Gene Moth_1846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1846 
Symbol 
ID3831707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1904263 
End bp1905384 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content60% 
IMG OID637829778 
Producthypothetical protein 
Protein accessionYP_430689 
Protein GI83590680 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCCACCA CCACGGCAGA ATTTATACGC CGGGAGCTCT CCCGGGGGGA ACTGGCGGAC 
ATAGCTGCCA GGGTATATGC CGGGGAAAGG CTGACCCGGG AGGACGGTAT ACGTCTCTGG
GAGAGTCAGG ACCTCCTTGG TATTGGCTAC CTGGCCGACC TGGCCCGGCA GCGGACCTGC
GGTGATACCG TATATTTTAT TAATAATGCC CACATCAATT ATACCAATAT CTGTCAAAAT
CTCTGCGATT TATGTGCCTT TGGGAGACAG ACAGGTACTC CGGGGGCTTA TACCCTCACC
CTGGCGGAGA TTGAAGCGAA GGCCAGGGCG GCGGCTGCCG CAGGTGTCAC CGAGATCCAT
ATTGTCGGCG GCCTGAATCC GGAATTACCC TATGATTACT ACCTGGAACT AATCCGAACC
GTGCGCCGGG CGGCACCCGG GGCCTGCATC CAGGCCTTTG ACGCTGTGGA AATTGACTTT
ATAGCCTCCC GGGCCGGGCG CCCCGTAGCT GACGTCCTCC AGGAACTCCG GCAGGCGGGC
CTGGACTCCC TTCCCGGGGG TGGGGCCGAA GTTTTTGCCC CGGAGGTGCG CCGGCGCCTT
TGCGCCAAAA AGATCGATGG ACAGCGATGG CTCCAGATCC ACGAGACGGC CCACCGGCTG
GGCATACCCA CCAACGCTAC CATGCTCTAC GGCCACCTGG AGACGGCAGC CGACCGGGTG
GATCACCTCC TGGCCCTCCG GGAACTCCAG GACCGGACAG GGGGCTTCCT GGCTTTTATT
CCCCTGGCCT TTCACCCGGC CAACACGGCC TTCAGCAACC TGCCGGGAAC CACCGGCGTA
GATGACTTGA AGATGTTGGC CATCAGCCGC CTGCTTCTGG ACAACTTCCG TCACATCAAA
GCCTTCTGGA TTATGATTGG GCCGAAGTTA GCCCAGGTGG CCCTGCATTT TGGGGTAAAC
GACCTGGACG GTACCGTCCG GGAAGAGCAT ATCTTTCACG ACGCCGGGGC TGAAACGCCC
CAGTACCAGC CTGCTGAAAG CTTTTTACAG AGTATCCGGG CGGCCGGCCG GATCCCGGTG
GAAAGGGATA CTTTATACCG GGAAATCCGC CGCTATGCCT GA
 
Protein sequence
MPTTTAEFIR RELSRGELAD IAARVYAGER LTREDGIRLW ESQDLLGIGY LADLARQRTC 
GDTVYFINNA HINYTNICQN LCDLCAFGRQ TGTPGAYTLT LAEIEAKARA AAAAGVTEIH
IVGGLNPELP YDYYLELIRT VRRAAPGACI QAFDAVEIDF IASRAGRPVA DVLQELRQAG
LDSLPGGGAE VFAPEVRRRL CAKKIDGQRW LQIHETAHRL GIPTNATMLY GHLETAADRV
DHLLALRELQ DRTGGFLAFI PLAFHPANTA FSNLPGTTGV DDLKMLAISR LLLDNFRHIK
AFWIMIGPKL AQVALHFGVN DLDGTVREEH IFHDAGAETP QYQPAESFLQ SIRAAGRIPV
ERDTLYREIR RYA