Gene Moth_1723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1723 
Symbol 
ID3833023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1767185 
End bp1768483 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content59% 
IMG OID637829648 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_430568 
Protein GI83590559 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000414436 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.578112 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAACCTGA TAGAAAGCGC CCGGGCCGGC TTGATTACCC CCGAAATGGA GCAGGTGGCC 
GTTCAGGAGG GTGTAACTCC GGAATTCGTG AGGCAGGGTG TGGCTGATGG GACGATAGTC
ATCCTGCGCA ATGCCCGCCG GCAGAACGTC ACTCCAGTTG GTGTCGGTAA AGGGCTGAGG
ACAAAGGTCA GCGCCAGCGT CGGTTTGTAC GGAGAGACGG GCGGTATTGA TGTGGAGGTT
GCCAAGATTA AAGCCGCCGT GGAAGCGGGA ACGGACGCCA TCATGGATCT GAGCGTCAGC
GGGGACATCG AGGCCATGCT TGCGGAAACG CTGGCCGTTT CCCCCAAGCC CGTCGGTACC
TTGCCCCTTT ACCAGGCCAT GGCCGAAGCC GGCAGAAAAT ACGGTTCTTC CGTTAACATG
AGAGATGAAG ACTTGTTTGA TGTAATTGAA CGCCACGCGG CCGCCGGGGT AGACTTTCTG
GCCCTGCACT GCGGGACTAC TATGAATATT GTAGAACGCG CCAGAAACGA GGGCCGGATC
GATCCTCTGG TAAGCTACGG GGGTTCCCAC CTGATCGGGT GGATGCTGGC GAACCGGAGG
GAAAACCCCC TTTATGAACA CTTTGACCGG GTTCTTGCGA TTGCCCGGAA GTACGATGTT
ACCATCAGCT TTGCCGACGG CATGCGACCG GGATGCCTGG CCGATTCCCT GGATGGCCCC
CAGGTGGAAG AGCTGGTTGT TTTGGGAGAG CTGGTCAGGC GGGCGAGGGA AGCCGGTGTA
CAGGTGATGG TAAAAGGGCC GGGTCATGTA CCCTTGCAGC AACTAAAGGC GACGGTTGTC
CTGGAAAAAA GTCTCTGCCA CGGGGCGCCG TATTTTGTCT TCGGCCCCCT GGTAACAGAT
ATCGCAATCG GTTATGACCA TATCAATGCT GCTATCGGGG GTGCCTTGAG CGCCTGGGCG
GGTGCGGAGT TTCTCTGTTA TGTAACTGCC GCCGAACATG TGGGGATTCC GGATATTGAC
CAGGTCCGGG AGGGAGTGAT TGCCGCTCGC ATTGCCGCCC ATGCCGCCGA CCTGGCCAAC
GGCCTTACCT GTGCCCGGGA ATGGGATCGG GAGCTTTCCC GGGCGAGAAA AGAACTGGAC
TGGAAGCGGC AGATTGCTCT CGCCATAGAC CCCGAACGGG CGGGAAGGCT GAGAGAAGAA
AGAAGCGACG CCGCGGCGGC GGGATGTGCC ATGTGCGGTA AATACTGCGC CATGGAAATC
GTATCCAGAT ACCTGGGCAC AGCCAGACAT ACATGTTAG
 
Protein sequence
MNLIESARAG LITPEMEQVA VQEGVTPEFV RQGVADGTIV ILRNARRQNV TPVGVGKGLR 
TKVSASVGLY GETGGIDVEV AKIKAAVEAG TDAIMDLSVS GDIEAMLAET LAVSPKPVGT
LPLYQAMAEA GRKYGSSVNM RDEDLFDVIE RHAAAGVDFL ALHCGTTMNI VERARNEGRI
DPLVSYGGSH LIGWMLANRR ENPLYEHFDR VLAIARKYDV TISFADGMRP GCLADSLDGP
QVEELVVLGE LVRRAREAGV QVMVKGPGHV PLQQLKATVV LEKSLCHGAP YFVFGPLVTD
IAIGYDHINA AIGGALSAWA GAEFLCYVTA AEHVGIPDID QVREGVIAAR IAAHAADLAN
GLTCAREWDR ELSRARKELD WKRQIALAID PERAGRLREE RSDAAAAGCA MCGKYCAMEI
VSRYLGTARH TC