Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU3194 |
Symbol | thiL |
ID | 2687582 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 3502085 |
End bp | 3503071 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637127887 |
Product | thiamine monophosphate kinase |
Protein accession | NP_954235 |
Protein GI | 39998284 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0611] Thiamine monophosphate kinase |
TIGRFAM ID | [TIGR01379] thiamine-monophosphate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGACTCG GCGAGATCGG TGAATTCGGC CTTATCGACA GGATTGCCGG CAAAGTAGCC GCCGGTGCCG GGGTTCGCCT CGGCATCGGC GACGACGCTG CCGTTACCGA AACGGAAGCG GGGCGCCTCC TGCTGTCCAC CGCCGACATG CTCGTTGAAG GCATCCACTT TGACCTCTCC TTCACCGACC CCTTCAGGCT CGGCCGCAAA TCCCTGGCGG TCAACGTCTC CGACATTGCC GCCATGGGGG GACACCCTCG CCATGCGCTG CTCTGTCTTG CCATTCCAAC CGATCTGCCG GTTGAATTTC TCGACCGGTT CGCCGACGGC GTCATCTCCC TGGCAGAGGA ATTCGGTGTC ACCCTCATCG GCGGCGATAC CTGCCGCTCC TCCTCCGGGC TTGTCATCTC CATCACCCTC CACGGCGAAC AGGTCCCCAC ACGGATCATC CCGCGCAACG GGGCTCGGCC GGGCGACGAC GTGTTCGTCA CCGGTACCGT CGGAGATTCA GCCCTGGGGC TCGAACTGCT GCGCAGAGGC GAACGCTCCG GACATGCCGT CGAACGCCAC CTCAACCCCT CGCCCCGCGT CTCCGCCGGC CTGAGCCTGG CCGAATCGGG CATGGCCTCG GCCATGATCG ACGTGAGCGA CGGTGTCCTT GCCGACCTGG GGCACATCCT GACCGGCTCG GGGGTCGGCG CCCGCATCGA CGCATCCCTC ATCCCCCTTT CTCCCTACTT CAGCCAACGG GCGCCCGACG TGGCACCTGA CCCCCTCTCT CTGGCACTGG CGGGAGGCGA GGACTACGAA CTACTCTTTA CTGCGGCGCC GGGACGGACG GCCGAGGTGG AAACGCTGCT GGCGGCGTGC GGCGTTACAG CGACCCGGAT CGGTTCCATC GTTGCCGGGT CGGACGTCAC GGTCACCGCA GCAGACGGGA CCCTTATCCC TCCGAGACGC CGCGGCTTCA ACCATTTCGC GCCGTAA
|
Protein sequence | MRLGEIGEFG LIDRIAGKVA AGAGVRLGIG DDAAVTETEA GRLLLSTADM LVEGIHFDLS FTDPFRLGRK SLAVNVSDIA AMGGHPRHAL LCLAIPTDLP VEFLDRFADG VISLAEEFGV TLIGGDTCRS SSGLVISITL HGEQVPTRII PRNGARPGDD VFVTGTVGDS ALGLELLRRG ERSGHAVERH LNPSPRVSAG LSLAESGMAS AMIDVSDGVL ADLGHILTGS GVGARIDASL IPLSPYFSQR APDVAPDPLS LALAGGEDYE LLFTAAPGRT AEVETLLAAC GVTATRIGSI VAGSDVTVTA ADGTLIPPRR RGFNHFAP
|
| |