Gene Moth_1245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1245 
Symbol 
ID3833040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1285978 
End bp1286985 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content63% 
IMG OID637829181 
Productradical SAM family protein 
Protein accessionYP_430102 
Protein GI83590093 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones66 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.394622 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTTTTAT CCTGGAATAC GACCAACCAG TGCAATCTTT ACTGCGATCA CTGCTACCGG 
GATGCCGGCG CCAGGGTAGA GGACGAGTTG ACCACCGCCG AGGCCGGCAA TCTAATAGAC
GAAGCCGCCA AAGCCGGCTT TAGGATTATG ATCTTTAGCG GCGGCGAACC CCTGCTGCGG
CCCGACCTGC CGGAGCTGGT GAGCCGGGCG GCAGCCAGGG GGTTGCGCCC GGTCCTGGGA
AGCAATGGTA CCCTCCTCAC CACCGAACTG GCCCGAGAAT TAAAGGCTGC CGGAGCCCTG
GCCGTTGGCA TCTCCCTGGA CAGCTGCGAT CCCGCCCGCC ACGACCGCCT GCGGCAAAAG
GAGGGTGCCT GGCGAAAGGC CGTCGCCGGA ATGGCGGCCT GCCGGGAAGC CGGCCTTCCC
TTCCAGGTCC ATACAACTGT ATTTGATTGG AATCAGGACG AACTGGAAAA ACTGACCGAT
CTGGCGGTGG AACTGGGAGC CGTGGCCCAT CACTTCTTTT TCCTGGTGCC CACCGGCCGG
GCAGCGAGTA TCGAAGCCGA GTCGCTGCGG GCCGCCGAAT ACGAGGCCAC CCTTAAACGC
ATTTTACAAA AGCAGCAACA GGTGAAGATC GAGTTAAAGC CTACCTGTGC TCCCCAGTTT
ATGCGTCTGG CCCGCCAGCT GGGGATACCG GTGCGCTACC AGCGCGGCTG CCTGGCCGGT
ATCGCCTATT GCATCATCAG CCCCCGGGGG GATGTCCAGC CCTGCGCCTA CTTGAACCTG
CCGGTGGGCA ACGTGCGGGA GGTACCCTTC AGCCAACTCT GGCGGGAGAG CCCGGTCTTC
CAGCGCCTGC GCACGGAAGA GTACAGCGGC GGTTGCGGTC GCTGCGGCTA TAAAAAGATA
TGCGGCGGCT GCCGGGCCCG GGCCTGGTAT TATCACGGCG ATTATATGGC CGAAGAACCC
TGGTGCCTCT ACCAGGGCCG GCAGGACGCG GCGGCGCACG ACAATTAA
 
Protein sequence
MLLSWNTTNQ CNLYCDHCYR DAGARVEDEL TTAEAGNLID EAAKAGFRIM IFSGGEPLLR 
PDLPELVSRA AARGLRPVLG SNGTLLTTEL ARELKAAGAL AVGISLDSCD PARHDRLRQK
EGAWRKAVAG MAACREAGLP FQVHTTVFDW NQDELEKLTD LAVELGAVAH HFFFLVPTGR
AASIEAESLR AAEYEATLKR ILQKQQQVKI ELKPTCAPQF MRLARQLGIP VRYQRGCLAG
IAYCIISPRG DVQPCAYLNL PVGNVREVPF SQLWRESPVF QRLRTEEYSG GCGRCGYKKI
CGGCRARAWY YHGDYMAEEP WCLYQGRQDA AAHDN