Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_06770 |
Symbol | thiL |
ID | 7759631 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 642806 |
End bp | 643771 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643803599 |
Product | thiamine monophosphate kinase |
Protein accession | YP_002797903 |
Protein GI | 226942830 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0611] Thiamine monophosphate kinase |
TIGRFAM ID | [TIGR01379] thiamine-monophosphate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGAGT TCGAACTGAT CCGCCGCTAC TTCGCCGCCG CCGCCTGCGC GCAGGCGGCG CCCGGCGTGG CCCTCGGCAT CGGCGACGAC TGCGCCCTGC TGGAGGTTCC CGCCGGCGAG CGGCTGGCGA TCTCCACCGA CACCATGGTC GCCGGGGTGC ATTTCCCCGC TCCCTGCGAT CCCTTCCTGC TCGGCCAGCG AGCCCTGGCC GCGGCCGCCA GCGATCTTGC GGCGATGGGC GCCACGCCCC TCGGTTTCAC CCTCGCCCTG ACCCTGCCGG CGGCCGATCC GGCCTGGCTG GCCGCCTTCG CCCGCGGCCT GGACCACAAG GCGCGCGAGT GCGGCCTGGC GCTGATCGGC GGCGATACCA CCCGCGGTCC CCTGTGCATC AGCCTGAGCG TATTCGGCCG GGTACCGGCC GGCCTGGCGC TGTGCCGGAA TGGCGCGCGG CCCGGCGATC TGCTCTGCGT CGGCGGTCCG CTCGGCGATG CCGCCGGCGC CCTGGAACTG GTGCTCGACC GGCGCCAGGC CCCGGCCGAG GTCGCCAGCC CCCTGCTGGC ACGCTACTGG TCGCCACGGC CGCAGCTATC CTTGGGGGTG GCGCTGCGCG GCAGGGCGAG CGCGGCGCTG GATATCTCCG ATGGCCTGCT CGCCGATTGC GGGCACATCG CTTCGGCTTC CGGCGTGGCG CTGTGCATCG AACGCCGCCG CGTGCCGCTG TCGGTGCCGT TGCGCCGCCT GCTCGGCGAA GAGGCGGCGC TGGCCTGCGC CCTAGGCGGC GGCGACGACT ACCTGCTGGC CTTCACCCTG CCGCCGCGCT TCCTCGCCGC CCTGCAGGCC GAATGGCCGC TGCGGGTGAT CGGCCGGGTC GAGGCCGGTA CGGGCGTGCA TCTGCTCGAC GACGCTGGCC GCGAGGTCGC GCCGCCGGTC GGGGGCTATC AACATTTCGG GAGTCCGCGT GACTGA
|
Protein sequence | MGEFELIRRY FAAAACAQAA PGVALGIGDD CALLEVPAGE RLAISTDTMV AGVHFPAPCD PFLLGQRALA AAASDLAAMG ATPLGFTLAL TLPAADPAWL AAFARGLDHK ARECGLALIG GDTTRGPLCI SLSVFGRVPA GLALCRNGAR PGDLLCVGGP LGDAAGALEL VLDRRQAPAE VASPLLARYW SPRPQLSLGV ALRGRASAAL DISDGLLADC GHIASASGVA LCIERRRVPL SVPLRRLLGE EAALACALGG GDDYLLAFTL PPRFLAALQA EWPLRVIGRV EAGTGVHLLD DAGREVAPPV GGYQHFGSPR D
|
| |