Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_41800 |
Symbol | |
ID | 7763060 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4214463 |
End bp | 4216310 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643807035 |
Product | hypothetical protein |
Protein accession | YP_002801284 |
Protein GI | 226946211 |
COG category | [S] Function unknown |
COG ID | [COG4529] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAATT TTAATGTTGC TATAATTGGT GGCGGCCCTC GAGGCTTAAA CGTTTTGGAG AGATTTATTG AAATAACAAA TAAAAATAAA ATGTCGTCAA GATTGAAAGT GCATTTTATT GATCCTGGGA TACCAGGCGA AGGTTGTCAT CCTTCCAGTC AGCCAGATCA CTTGTTGGTA AACACTGTTT CATCTCAGGT GACTATATAT GCCCCAGATA GTATAGCGCA CAGAGACAAC GGATTATCTT TTATTGATTG GGCTGTCTCA AAAAATTATC GTAAAATTGA TGGCAAATAT AAAAAAGGTT GTGGCGGTAA TCAATTGACC GATATGGATT ATTTGCCTAG AAGCATACTT GGAGAATATC TGGGTATGTT TTTCAATCAT CTGCTTTCAC TATTGCCAGA AAATGTGGAT GTTATTGTTC ACAAAGCAAA TGCGATAGAT ATTTCCAACG AAAATCCATA TAAGATTGAG CTTGATAATA ACAATATACT GGATGCTGAC TTTATATTTC TGACCACAGG ACATGTTTAC AGAAAACCTT CAAAGGTCGA TCAAGGCTTT GCTGATTTTG CGAAAAATCA TTGCCACAGA AATCGAAATT TGGCTTACTA TCAAACACCT TATCCCATCG GTAACCTGGA TTGCATTTCT AGTAATGCAA CTGTGCTGAT ACAAGGTTTT GGTCTTACTG CTCATGACAC CATTTCTGCT CTGACTATTG GGCGTGGTGG CCATTTTGAG GAGGACGGGA ATAAGCTTCG CTATTGTGCT TCTGGTTCAG AGCCAGATCT CCGACTCGTC TCAAGGCAGT GTTTGCCGTT TGCTGCACGA GGTACTAATC AGAAAGGTTT GACTGGACGA CATCAAAGTC AGTTCTTCAC CTTAGAAGCT ATTACTAAAC TTAGGCAACA AGCTATTCAG AAAACAGGGG ATTATCGCCT AAACTTTCAG AAAGATGTTA TGCCGTTAAT TCTCAAAGAA GTAATTTACG CCTGGAGGTG TGCGAAGTTT GCTAAGCGAG TGGATCCAAG GGGCTTTCAG CCCACAAAAG ACGAAATAAA TTCGATCAAC AAAATTTTGT GGCCATTGGA GGGAAATAGT TTTTCTGATT TCTCTAACTA TAGAGCCTTC TTTAAAAAAA TGGTCGATGA GGATCTACTG GAAGCAGTAA AGGGCAATAT GAATAGTGCT GTGAAAGCTT CAACTGATGT GTTGCGTGAT ACAAGAGATG TGTTGCGCAG TGTCATTGAG TTTGGTGGAT TGTTACCGGA GTCTCACAAA TATTTTATTG AGCACTTCAA CCCTATCATC AATCGTGTTT CCTTTGGGCC GCCACTTCGT CGTGTCAAGG AGGTTCAAGC GCTGTTTGAA GCAGGTGTTC TGGATCTGGC TGGTGGACCA GGAGCAATGG TCATTACCAA TGAGTCTACA TCTGAATATG AGGTTATCAC AAAGTTTGCG AATAAAATCT ACAAGCAGAA AGCAGACATT GTTATAAGTG CACGTCTTGA TGGCTATTCA CCTTTAACAG ATTCAAGTCG ATTAACAGAA AATTTATTGA AACGTAGTCT TATCAGACCT TGGATGAACG GTGATTATCA TCCTGGCGGT GTAGATATAG ATAAACATAT GCACCCGATT GATAGTTCTG GAAAACCACA GCCTCGGCTA TGGGCTATCG GATTTCCAGT AGAGGGAGCT CATTACTACA CCCATGCCTT ACCTCGGCCT CATGTTGACT CCCGTCAAAT AACTGAAGCT GAAAATTGCG TCGTGGATTG CTTAAAGCAA ATTACTGAAA TTCATAAGAG ATCTCAACAA TTATTGGAAA CCCTGTAA
|
Protein sequence | MNNFNVAIIG GGPRGLNVLE RFIEITNKNK MSSRLKVHFI DPGIPGEGCH PSSQPDHLLV NTVSSQVTIY APDSIAHRDN GLSFIDWAVS KNYRKIDGKY KKGCGGNQLT DMDYLPRSIL GEYLGMFFNH LLSLLPENVD VIVHKANAID ISNENPYKIE LDNNNILDAD FIFLTTGHVY RKPSKVDQGF ADFAKNHCHR NRNLAYYQTP YPIGNLDCIS SNATVLIQGF GLTAHDTISA LTIGRGGHFE EDGNKLRYCA SGSEPDLRLV SRQCLPFAAR GTNQKGLTGR HQSQFFTLEA ITKLRQQAIQ KTGDYRLNFQ KDVMPLILKE VIYAWRCAKF AKRVDPRGFQ PTKDEINSIN KILWPLEGNS FSDFSNYRAF FKKMVDEDLL EAVKGNMNSA VKASTDVLRD TRDVLRSVIE FGGLLPESHK YFIEHFNPII NRVSFGPPLR RVKEVQALFE AGVLDLAGGP GAMVITNEST SEYEVITKFA NKIYKQKADI VISARLDGYS PLTDSSRLTE NLLKRSLIRP WMNGDYHPGG VDIDKHMHPI DSSGKPQPRL WAIGFPVEGA HYYTHALPRP HVDSRQITEA ENCVVDCLKQ ITEIHKRSQQ LLETL
|
| |