Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_51980 |
Symbol | |
ID | 7764035 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 5304434 |
End bp | 5306179 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643808014 |
Product | hypothetical protein |
Protein accession | YP_002802248 |
Protein GI | 226947175 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0810] Periplasmic protein TonB, links inner and outer membranes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCGAA TCACCCGTTC ATCCAGCGCC GGGCGCAGCG CGGCCGATGC CCTGGCCGGC CACGCGGAAC TGACGGCCAT CCTGCGCCGC CACCTGCCGC CGAGCACGGC GGCCCTGTTC GCCAAGCCCA GGCAGGCCGA GGGCGACATC GTCGAGTGGT ACTCGGACCT GGGCGGCCAG CCGGTGCCCT TCGCCGAGCT GGCGCCGCAG GAAGCGCGCC AGGTGCGCCA GCTGCTGGAC GAGCGCCTGG CCTCCATCGT GGAACTGGCC GGGCGTCTCG AAGGACTGGG CGAGGAGGGC AGTCGACAGG CCGAGCTGTT GCGCCAGGCC GCGCGTTACC CGGACACCGC CACCCTCTAT GCGCTGAACG GCCAGCCGCT GGTGACCTTC TGGGGCGGCG GCGAGCCGCC CGCACCGCCC GTGCCGCCGC CGGCCGCCGC CGGATTGCCG GGAGCGGCGC CGGCGGCCCT GGCTACCGCC GGCGCCGCGT CCCTCGCCGC CGCCGGGCCG CAACGCCGCT GGTGGCCCTG GCTGCTCCTC CTGCTGTTGC TGCTGGCCCT GCTCGCCGGC CTCTGGTGGT GGCTGTTCGG CCGCCAGGAG CCGCCGGTGG AGCCGATCGC CGCGACCGAG CCGCCGGCCG TCCAGGAACA ACCCGAGCCG CCGGTCGAGG AAAAGACTCC GCCGGTCGAG GAACCCGAGC CCGAACCGGA ACCCGTGCCC GAGGAAAAAG CGGAGCCCGA ACCCGTGGTG CCGCCGGCGC CCAAGGTCGA GGCGGTGCCG CCCAAGCCCG AGCCGAAACC GGAGCCCAAG CCCGAACCGA AGCCCGTGCC CAAGCCGGAA CCCAAGCCGG AGCCGAAACC CGAGCCCAAA CCGGACCCGG TCGAATTGGC GCGCAAGCGC ATCGTCGCCG CGGGTAGCAA CTGCAACAGC CTGCAGCAAC TGCTCAGCCA GGACCCGCAG GTCAAGCAGA ACCCGGCGCT GCGCGGCGAA GTCGAGGCGA AGCTCAAGCA GCAGTGTCGG CAGCAACTGA TCGTCAACGC CAAGAACCTC TGTCCGGACG AGCGACCCAA GGAACTGGCG CCGGAGCTGG CCATCGTGTT CGACGCCTCC GGCTCGATGG ACATCAGCCT GCTGGCGACC AAGCAGGAGA TCGACCGGGC GGTGATGGTG CAGGGCATGA CCGACATCGC CGCGCGCGTG CTGCTGGGCG GCAATCCGGG GATCGACACC ACCGGCCACC TGTACCGCGA GCCCAAGCGC ATCACCGCGG CCAAGCAGGC GACTACCGCG GTGGTCCAGC GCCTGCCCAG CGATGTCAGC GCCGGTCTGG TGCTGATCGA GCGCTGCCCC GCCTCGCGCA GCATGGGCTT CTTCACCCCG GTCCAGCGTG GCGGCCTGCT GGCGCGCCTG AATTCCATCC AGCCGGTGGA GGGCACGCCG CTGGCCGACG GCGTCGCCAA GGCCGGACAG ATGCTCGACG GCGTCAACCG CGAGTCGGTG ATGGTGGTGG TCTCCGACGG CGAGGAGAGC TGTCACCAGG ACCCCTGCGC GGTGGCCCGC GACCTGGCGC GGCGCAAGCC GCACCTGAAG ATCAACGTGG TCGACATCGC CGGTACCGGC GCCGGCAACT GCCTGGCCCA GGCCACCGGC GGCCGGGTGT TCAACGCCAG GAACGCCGGC GAACTGGCCG CGATGACCCG CCAGGCCGCC CAGGACGTGC TGCCGCCGGC GCATTGCCGA CAGTGA
|
Protein sequence | MKRITRSSSA GRSAADALAG HAELTAILRR HLPPSTAALF AKPRQAEGDI VEWYSDLGGQ PVPFAELAPQ EARQVRQLLD ERLASIVELA GRLEGLGEEG SRQAELLRQA ARYPDTATLY ALNGQPLVTF WGGGEPPAPP VPPPAAAGLP GAAPAALATA GAASLAAAGP QRRWWPWLLL LLLLLALLAG LWWWLFGRQE PPVEPIAATE PPAVQEQPEP PVEEKTPPVE EPEPEPEPVP EEKAEPEPVV PPAPKVEAVP PKPEPKPEPK PEPKPVPKPE PKPEPKPEPK PDPVELARKR IVAAGSNCNS LQQLLSQDPQ VKQNPALRGE VEAKLKQQCR QQLIVNAKNL CPDERPKELA PELAIVFDAS GSMDISLLAT KQEIDRAVMV QGMTDIAARV LLGGNPGIDT TGHLYREPKR ITAAKQATTA VVQRLPSDVS AGLVLIERCP ASRSMGFFTP VQRGGLLARL NSIQPVEGTP LADGVAKAGQ MLDGVNRESV MVVVSDGEES CHQDPCAVAR DLARRKPHLK INVVDIAGTG AGNCLAQATG GRVFNARNAG ELAAMTRQAA QDVLPPAHCR Q
|
| |