Gene Moth_0198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0198 
Symbol 
ID3832271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp193951 
End bp195123 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content62% 
IMG OID637828134 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_429076 
Protein GI83589067 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATACAT CCCTACTGGT CCGTTACGGC GAGATCAGCC TCAAGGGCAA CAACCGCCCA 
TACTTTGAAG ATAAACTCCT GGCCAACATG CGCCGGGCCC TGGCCGGCCT GCCGCCCCGC
AGAATGCGCA AGACCTTCGG CCGCGTCTTC GTGGAGCTCC ATGACGACCT GGAAGCCGTA
GCCCGGCGTT TGCAGCGCGT CTTTGGCATC GTCTCCATGA GCCCGGTAGC CACAGCTCCC
CTGGAGCTGG AAGCCATCAA AAAAGCCGCC CTGGCCGTCT TAAAGGATTC CCCCGGCAGC
ACCTTTAAAG TCCAGGCCCA GCGGCCCAAT AAACGCTTTC CCCTCACCTC ACCTGAAGTC
AACCAGGAAC TGGGGGCCTA CCTCCTCACC CACAGCCAGG GCCAACGGGT AGACGTTCAC
CATCCCGACC GGGTTATCCA TGTGGAAATC CGCGACGAAG GGGCCTATAT CTACTCCCGC
ATCATCCCCG GACCCGGCGG CCTGCCCGTA GGGGTCACCG GCCGGGGGCT GCTCCTGATC
TCCGGCGGCA TCGACAGCCC GGTGGCCGGC TATATGGGCA TGAAACGGGG CCTGGAACTC
ACGGCCCTCC ATTTTCACAG CTTCCCTTTT ACCAGCGAGC GCTCCAAGGA AAAGGTCATC
GACCTCTGCC GGGTCCTGGC AGGCTACAGC GGACCTTTGC GCCTGGTGGT GGCCCCCTTT
ACCAATATCC AGAAGGCCAT CCGCCAGAAC TGCCCCCAGG AGTTTTACGT CACCATCATG
CGGCGGATGA TGTTCCGCAT CGCCAGGGCG GTGGCTGCCA AAGAGGAGGC CCCGGCCATC
CTCACGGGGG AGAGCCTGGG CCAGGTGGCC AGCCAGACCC TCCAGAGCAT GGCGGTAATC
AACAAGGTGG TCGACCTGCC GGTCTTAAGG CCCCTGGTGG CCTGGGACAA GAGCGAGATT
ATCGAGGTGG CCCGCCGCAT CGGCACCTAC GACATCTCCA TCCGGCCCTA CGAGGACTGC
TGCACCCTCT TTGTCCCCAA ACACCCGGCC ACCAAACCGC CCCTGGCCCG GGTGGAAGCG
GCCGAAAAGA ATCTGGCCGT GGTGGAACTC GTCGCGGAGT GCCTGGAAAA TCTAGAAATC
CTGACGGTGG AGCCCGAAGC TGACGTAGTG TAA
 
Protein sequence
MYTSLLVRYG EISLKGNNRP YFEDKLLANM RRALAGLPPR RMRKTFGRVF VELHDDLEAV 
ARRLQRVFGI VSMSPVATAP LELEAIKKAA LAVLKDSPGS TFKVQAQRPN KRFPLTSPEV
NQELGAYLLT HSQGQRVDVH HPDRVIHVEI RDEGAYIYSR IIPGPGGLPV GVTGRGLLLI
SGGIDSPVAG YMGMKRGLEL TALHFHSFPF TSERSKEKVI DLCRVLAGYS GPLRLVVAPF
TNIQKAIRQN CPQEFYVTIM RRMMFRIARA VAAKEEAPAI LTGESLGQVA SQTLQSMAVI
NKVVDLPVLR PLVAWDKSEI IEVARRIGTY DISIRPYEDC CTLFVPKHPA TKPPLARVEA
AEKNLAVVEL VAECLENLEI LTVEPEADVV