Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_37840 |
Symbol | |
ID | 7762676 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3827319 |
End bp | 3828644 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643806650 |
Product | hypothetical protein |
Protein accession | YP_002800903 |
Protein GI | 226945830 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3487] Uncharacterized iron-regulated protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00426998 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCGCA TTTCCCTGGC CTCCGCCAGC CTGCTCGCCA TCGCCATCTC GCTCGCCGGT TGCGACGACA ACAAGGAACG GACCACCGCC CAGACCGCCA CCCCGGCAGC CAACGCTCCC GCCGCCACGC CACTGGCCGA CGAAGCGGCC GCCAGGGCGG TGGTGAATCA CTATGCCGAC CTGGCGCTGG CGGTGTTCGG CGACTCCCTG AGCACGGCCA AGGCCCTGCA GCAGGCGATC GACGCCCTGC TCGCCAAGCC GGCTGCCGAT ACCCTGCAGG CCGCCCGCGA GGCCTGGATC GCGGCCCGCG TCCCCTACAT GCAGAGCGAG GTGTTCCGCT TCGGCAACCC GGTGGTCGAC GAATGGGAAG GCCAGCTCAA CGCCTGGCCG CTGGACGAGG GGCTGATCGA CTATGTCGCC GCCGACTACC AGCACGCCCT CGGCAATCCC GGCGCCACCG CCAACATCAT CGCCAACACC GAGATCCAGG TGGGCGAGGA GAAGATCGAC GTCACCGAGA TCACCGGCGA AGCCCTGGCC AGTCTCAACG AACTGGCCGG CTCGGAAGCC AACGTGGCCA CCGGTTACCA CGCCATCGAG TTCCTGCTCT GGGGTCAGGA CCTGAACGGC ACCGCGCCCG GCGCCGGCCA ACGCCCGGCC AGCGACTTCC AGGCCGGAGA GGCCTGTACC GGCGGCCATT GCGAGCGCCG CCGCGCCTAC CTCGAGGCAG TCACCGACCT GCTGGTCAGC GACCTGCAAT ACATGGTCGA ACAGTGGCAA CCCCAGGTGG CCGACAACTA CCGCGCCAGC CTGGAAAAGG ATCCGGCCGC CAACGGCCTG CGCAAGATGC TTTTCGGCAT GGGCAGCCTG TCCCTCGGCG AACTGGCCGG CGAGCGCATG AAGGTGCCGC TGGAAGCCAA CTCCACCGAG GATGAACAGG ACTGCTTCAG CGACAATACC CACAACGCGC ACTTCTACAA CGGCAAAGGC ATCCGCAATG TCTATCTGGG CGAATACCGC AAGCTCGACG GCAGCCTGCT GAGCGGCCCG AACCTGTCGT CGCTGGTCGC CAAGGCCGAC GCCCAGGCCG ATGCCACCCT GAAGGCCGAT CTCGAGGCCA GCGAGGCCCG GCTGCAGGCG CTGGTCGACA ACGCCGCCAA GGATCAGCAC TTCGACCAGT TGATCGCCGC CGATAACGCC GCCGGGCAGC AACTGGTGCG CGACGCCATC GCCGCGCTGG TCAAGCAGAC CTCCTCCATC GAGCAGGCCG CGGCTCGACT GGGCATCGGC GACCTGAATC CGGACAGTGC CGACCACAGT TTCTGA
|
Protein sequence | MPRISLASAS LLAIAISLAG CDDNKERTTA QTATPAANAP AATPLADEAA ARAVVNHYAD LALAVFGDSL STAKALQQAI DALLAKPAAD TLQAAREAWI AARVPYMQSE VFRFGNPVVD EWEGQLNAWP LDEGLIDYVA ADYQHALGNP GATANIIANT EIQVGEEKID VTEITGEALA SLNELAGSEA NVATGYHAIE FLLWGQDLNG TAPGAGQRPA SDFQAGEACT GGHCERRRAY LEAVTDLLVS DLQYMVEQWQ PQVADNYRAS LEKDPAANGL RKMLFGMGSL SLGELAGERM KVPLEANSTE DEQDCFSDNT HNAHFYNGKG IRNVYLGEYR KLDGSLLSGP NLSSLVAKAD AQADATLKAD LEASEARLQA LVDNAAKDQH FDQLIAADNA AGQQLVRDAI AALVKQTSSI EQAAARLGIG DLNPDSADHS F
|
| |