Gene Moth_0626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0626 
Symbol 
ID3832522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp650478 
End bp651590 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content66% 
IMG OID637828569 
Producthypothetical protein 
Protein accessionYP_429499 
Protein GI83589490 
COG category[S] Function unknown 
COG ID[COG3323] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR00486] dinuclear metal center protein, YbgI/SA1388 family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.743702 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCAA AGTGCGGCGA GATCATAGCC ATTATGGAAG CCCTGGCCCC GCCGGAACTG 
GCTGCCGGCT GGGACAACGT CGGCCTCATG CTCGGCTCGC CTGAGGCGGA GGTCAGACGC
GTCCTGGTAT GCCTGGACGT AACCCCGTCG GTAGCCGCCG AGGCTGCCGC CCGGGCCGTT
AACCTGATTA TCAGCCACCA CCCCCTCTTC TTCCGGCCGG TGAAGAACCT GCGCTTTGAC
GAGCCCGTGG GAGAACTGGT GCGGCGCCTC CTCCAGGATA ACATCATGGT CTACTCGGCC
CACACCAATA TGGATAGCGC CGACCTGGGG GTCAGCTACC ACCTGGCCTC CAGGCTGGAG
CTGGAGGACA TCCGGGTCCT GGTCCCCACC CACCGTGAGA AGTATTACAA GCTCGTCACC
TTCGTCCCCG AAGACCACGA AAAGGTCGTT CGCGAAGCCC TCACCCGGGC CGGAGCCGGC
TGGATCGGCA ACTACTCCGA CTGCACCTTC CGGGTGGCCG GTACCGGCAC CTTCATGCCC
CTGGCCGGCA CCCGTCCCTA TACCGGTGAA GAGGGCAAAC TGGCGGAGGT CAAAGAGTAC
CGCCTGGAGA CCATCATCCC CACCGGCCGG CTGCCGGAGG TCCTGCGGGC CCTGCTGAAA
GCCCACCCCT ACGAGGAAGT GGCCTATGAC GTGTACCCCC TGGCCAACGA AGGACCGGCC
CAGGGCATCG GCCGCACCGG CGTGCTGCCC CAGGCCGTCA CCCTGGAGGA ATTCGCCCTG
CGGGTGAAGG AGTCCCTGGG GGCCGGCCGG GTCAACCTGG TGGGCGACCG GGAGCGTAAG
GTCAAAAGGG TGGCCGTCTG CGGCGGCGCC GGCAGCGACG TTATGGCCGC CGCCCGGGAT
GCGGGGGCGG AAGTCCTGGT CACCGGGGAC CTCAAGTACC ACGAAGCCCG CACGGCCCAG
GCCATGGGCC TGGCCGTCGT CGACGCCGGC CATTTCGCCA CCGAAAGGCT GATTGTCCCG
GCCCTGGTGA CCTATCTCCA GGAACAGTTG CAGGAGCGCG AGGTGATGGT CCTGGCCTCC
CAACAGGAAC AGGAACCCTG GTACGCATTA TAA
 
Protein sequence
MAAKCGEIIA IMEALAPPEL AAGWDNVGLM LGSPEAEVRR VLVCLDVTPS VAAEAAARAV 
NLIISHHPLF FRPVKNLRFD EPVGELVRRL LQDNIMVYSA HTNMDSADLG VSYHLASRLE
LEDIRVLVPT HREKYYKLVT FVPEDHEKVV REALTRAGAG WIGNYSDCTF RVAGTGTFMP
LAGTRPYTGE EGKLAEVKEY RLETIIPTGR LPEVLRALLK AHPYEEVAYD VYPLANEGPA
QGIGRTGVLP QAVTLEEFAL RVKESLGAGR VNLVGDRERK VKRVAVCGGA GSDVMAAARD
AGAEVLVTGD LKYHEARTAQ AMGLAVVDAG HFATERLIVP ALVTYLQEQL QEREVMVLAS
QQEQEPWYAL