Gene Moth_1452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1452 
Symbol 
ID3831338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1495326 
End bp1496564 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content57% 
IMG OID637829385 
Producthypothetical protein 
Protein accessionYP_430305 
Protein GI83590296 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.275657 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTTTT CACCCGCCAT TGAAGAGAGT CTTTTTCCAC CCCTTTATCA AAATTTACAT 
GAAAAAATAA CCGGTGCCCT GGGAGAAGCT ACCCTACGAT CTTGCCTTAC CTGCGGCGTC
TGCAGCGGCG GCTGTCCCAC CGGCGATATA GGCGCACCTG TTGATCCCCG TAAAATCGTC
CGCCTGCTCC TATGGGGGAT GGAGGACAAA GTCCTGGCAT CGGACATGAT CTGGCTCTGC
ACCATGTGTG GCCGCTGTAC GGTTTACTGC CCGGTAGGTG TGAATATGGG CGACCTGGTC
CGGGCCCTGC GCAGCCACCT GGCGGAGGAA GGCCGGGTCC CCGAAAATTT ACAAAAAGTT
GTTGATCTAG CAGTTACTTC TGGTAATAAT ATGGGGATCA GCCGGGAGGA TTATCTCGAT
ACCCTGGACT GGATGCAAGA AGAACTCCAG GCTGAGTTCG GCCCCCGGGC GGAAATACCG
GTAGATAAAA AAGGCGCCAG GGTAATGTAC GTTATCAACC CGCGGGAAGC AAAGTTTTTC
CCTCTATCCA TCCTGGCGGC CGCCAAGGTG TTTTATGCCG CCAGCGAGAG CTGGACCCTC
TCCAGCCGCT CCTGGGACGC GACCAACTAC GCCCTTTTCT CCGGGGACGA CAAAGCCGGC
GCTATCCTGG TGCAGCGCCT GGCGGATGAG GTGGAGCGCC TGGGCTGCCA GGAGTTGATC
ATGACCGAGT GCGGCCATGC CTTCCGCGCC ATCCGCTGGG GGCCCGAACG CTGGCTGGGG
CATAAACTCC CCTTCCCCGT ACGCAGTATT GTCCAGCTAA TGGCCGAATA CCTGGATGCA
GGCCGTATCC GCCTGGACCC CTCCCGCAAC AGCGAGCCGG TAACCTATCA TGACCCCTGC
AATCTAGGCC GCAAGGAAGG TATCTTTGAA GAACCGCGGC GGGTACTGCA GGCAGCGGTC
ACCGATTTTC GCGAAATGAC GCCGAACCGT GAGAATAACT ACTGCTGCGG CGGCGGTGGC
GGCATGCTCT CTTTGAGCGA GTTCGGCCAG GAACGTCTGG CCAAAGGCAA GGTTAAAATA
GAGCAAATTC AGCGCACCGG GGCCGGGATA GTGGCTACTC CCTGCCACAA CTGTGTTGAT
CAATTAAATG ACCTTTGCCG TCATTATCAT CTCAATGTTA AAGTTAAGAA CCTGGTCGAA
TTGGTAGCCG ATGCCCTGGT AATCGCTGGT AAGGAGTGA
 
Protein sequence
MPFSPAIEES LFPPLYQNLH EKITGALGEA TLRSCLTCGV CSGGCPTGDI GAPVDPRKIV 
RLLLWGMEDK VLASDMIWLC TMCGRCTVYC PVGVNMGDLV RALRSHLAEE GRVPENLQKV
VDLAVTSGNN MGISREDYLD TLDWMQEELQ AEFGPRAEIP VDKKGARVMY VINPREAKFF
PLSILAAAKV FYAASESWTL SSRSWDATNY ALFSGDDKAG AILVQRLADE VERLGCQELI
MTECGHAFRA IRWGPERWLG HKLPFPVRSI VQLMAEYLDA GRIRLDPSRN SEPVTYHDPC
NLGRKEGIFE EPRRVLQAAV TDFREMTPNR ENNYCCGGGG GMLSLSEFGQ ERLAKGKVKI
EQIQRTGAGI VATPCHNCVD QLNDLCRHYH LNVKVKNLVE LVADALVIAG KE