Gene Moth_1844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1844 
Symbol 
ID3831705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1902337 
End bp1903419 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content59% 
IMG OID637829776 
Producthypothetical protein 
Protein accessionYP_430687 
Protein GI83590678 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAAA CTATCGCCTG CAAAGTGCTT GACGGTGAGC GTTTGAGTTT TCACGAAGCG 
GAGCGTCTTT ATCAGGAGGC CGGCCTGCTG GAACTGGGTT ACCTGGCCAA CCTGGTCCGC
CAGAAGCTGC ACCCCGAAGG GACGGTCACC TTTGTCGTGG ACCGGAATAT CAACTATACC
AATATTTGCG TCAATGCCTG CCGTTTCTGT GCCTTTTATC GCCTGCCCGG GGATCCGGAA
GGCTACCTCC TCAGCCGGGA GGAAATCGGC CGTAAAATCG AGGCGACCCT GGCTGCAGGG
GGAACCCAGA TCCTGATGCA GGGGGGCCTG CATCCCGATC TAGACCTGGC CTGGTTTGAG
GACCTCTTCT CCTGGATTAA ATCCCGCTAT CCGGTCACCC TTCATTCCCT ATCTCCGGTA
GAAATCGATG ACCTGGCCCG GAAGGAAGGG CTGCCGGTGA TAGAGGTTTT GCGGCGGCTG
AAGAAAGCAG GCCTCGATTC CCTGCCGGGC GGTGGGGCGG AGATCCTGGT CGACAGGGTT
CGCGGCCGGG TTAGCCCCAA AAAGACCGGT GCGGCCCGCT GGCTGGAAGT GATGCGCGCC
GCCCATGCCC TGGGGATGAA ATCAACCGCC ACCATGGTTT TCGGGTTGGG GGAAACTATG
GCTGAGCGGA TCGCTCACCT GGAGGCTATC CGGCAGCTCC AGGATGAAAC GGGAGGTTTT
ACAGCCTTCA TCCCCTGGAG TTTCCAGCCA GGCAATACCG AACTTGGCGG CGTGGAAGCT
AGCACCACCG AATACCTGAA ACTCCTGGCC CTTTCCCGGC TTTACCTGGA TAATATCCCC
AATATCCAGG TCTCCTGGGT AACCCAGGGA ACCAAGGTGG CTCAGGCGGC CCTTTTCTTT
GGCGCCAACG ATTTCGGTTC CACCATGCTG GAGGAAAACG TAGTCCGGGC GGCTGGTGCT
TCCTTCCGTG CCGATCGGGA GGAGATCCTT CGCTGCATCC AGGCGGCCGG TTTCCGGCCC
GCCCAGCGCG ATAATGAATA TCATATTTTG CGCTACTACG AGGCAGGGCA GGTTAACCGG
TGA
 
Protein sequence
MKETIACKVL DGERLSFHEA ERLYQEAGLL ELGYLANLVR QKLHPEGTVT FVVDRNINYT 
NICVNACRFC AFYRLPGDPE GYLLSREEIG RKIEATLAAG GTQILMQGGL HPDLDLAWFE
DLFSWIKSRY PVTLHSLSPV EIDDLARKEG LPVIEVLRRL KKAGLDSLPG GGAEILVDRV
RGRVSPKKTG AARWLEVMRA AHALGMKSTA TMVFGLGETM AERIAHLEAI RQLQDETGGF
TAFIPWSFQP GNTELGGVEA STTEYLKLLA LSRLYLDNIP NIQVSWVTQG TKVAQAALFF
GANDFGSTML EENVVRAAGA SFRADREEIL RCIQAAGFRP AQRDNEYHIL RYYEAGQVNR