Gene Moth_0039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0039 
Symbol 
ID3830905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp39307 
End bp40602 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content57% 
IMG OID637827971 
Producthypothetical protein 
Protein accessionYP_428921 
Protein GI83588912 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000157521 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTTAAAA CCTTTCGCCA GCGGGAAGAA GATATTGTCC GTTGTAATAG ATGCGGTTTT 
TGTGAGGAAG TTTGTCCCAC CTACAAGGCG ACGGGGGAAG AGTTTTCCCT GGCCCGGGGA
CGTAACCGTT TAATGCGCCA GTCCATGGAG GGCAAACTGG ATTTAACGAA AGAGCCCGAG
ATCAACCAGC ATATCTATTC CTGCCTTCTT TGCGGCGCCT GTGTAGCGGC CTGCCCCTCG
TCCGTCATCA CCGACACCCT GATCAAGACC GCCCGGGCCG AAATTACCCG GGCCAAGGGC
CAGCCCTTCC CCATCCGCAT GGCTTTGCGG GGGGTCCTGG CCAACCAGCG GCGCCTGACC
CTGGGGGCCA AAGTCCTGCG CTTCTACCAG CGCAGCGGCG CACGCTGGCT GGCCCGGCAT
ATCGGTTTTC TTAACTTGAT GGGTTCCCTG GGCAAGGCCG AGGGGCTGCT GCCGGCCATC
CCCGAGAAAA CCCTGCGCGT CCAGTTACCC CAACTCTTAA AGAAGCCGAT GAAGCCCCGG
CATAAAGTCG CCTACTTTGC CGGCTGCATG ATTAACAACT TTTTTACTGC TGTTGGCGAG
GCCACCCTGC GGGTTTACCA GGAAAACGAT ATCGAAGTAG TAGTGCCGAC CAGCAACTGC
TGCGGCATCC CCCATGAGGC CTATGGCGAT ATAGAGATGC AAATAAAACT GGCCAAAGAA
AATCTGGACG CCTTCAGCCG CTATGAGGTT GAAGCAATTG TCACCGATTG CGCCAGCTGT
GCCCACGGCC TTCACAGTTA CGCCGAACTC CTTCAGGACG ATCCCCATTA TGGTCCCCTG
GCGGCGCAGC TAGCGGCTAA AGTAAAGGAT GCCTCTCAGT ACCTGGTCGA GATTGGCTTT
AAAAAGGAGA TGGGGCCGGT CAACGCTACC GTAACTTACC ACGATCCCTG CCATGCAGCC
CGGGGCCTGA AGGTCAAGGA GCAACCGCGG GAGATCTTGA AGAGTATCCC GGGGGTTAAA
TTCGTCGAGA TGAATGAATC CGACTGGTGC TGTGGCGGTG CCGGTTCCTA TAACGTAACC
CACTACGAAC TATCACGTAA GATCCTCGCC CGCAAGATGG ATAACTTTAA GAAGACCGGA
GCCGAATACC TGGCAACCTC CTGCCCGGCC TGCCTCATGC AACTGGCCCA CGGCCTGGAT
GTCTACCGCT TGTCTGGCAA AGCAATCCAT GTTATGCAAA TATTGGACCA GGCCTACCAG
AACCGGGCCG TCCGGAGCAA GGCCAAGGCC GGCTGA
 
Protein sequence
MVKTFRQREE DIVRCNRCGF CEEVCPTYKA TGEEFSLARG RNRLMRQSME GKLDLTKEPE 
INQHIYSCLL CGACVAACPS SVITDTLIKT ARAEITRAKG QPFPIRMALR GVLANQRRLT
LGAKVLRFYQ RSGARWLARH IGFLNLMGSL GKAEGLLPAI PEKTLRVQLP QLLKKPMKPR
HKVAYFAGCM INNFFTAVGE ATLRVYQEND IEVVVPTSNC CGIPHEAYGD IEMQIKLAKE
NLDAFSRYEV EAIVTDCASC AHGLHSYAEL LQDDPHYGPL AAQLAAKVKD ASQYLVEIGF
KKEMGPVNAT VTYHDPCHAA RGLKVKEQPR EILKSIPGVK FVEMNESDWC CGGAGSYNVT
HYELSRKILA RKMDNFKKTG AEYLATSCPA CLMQLAHGLD VYRLSGKAIH VMQILDQAYQ
NRAVRSKAKA G