Gene Moth_1200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1200 
Symbol 
ID3832967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1234450 
End bp1236357 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content59% 
IMG OID637829133 
Productferredoxin 
Protein accessionYP_430057 
Protein GI83590048 
COG category[R] General function prediction only 
COG ID[COG3894] Uncharacterized metal-binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATCAGT TTGCTGTCAC CTTTTTACCT GATAATATCA CCGTCCGGGT GGCGGCTGGT 
ACCAGCATTA TGGAGGCGGC CAACCAGGCC GGCCTGCCCC TGAAAAGCAC CTGCGGCGGG
GCCGGCACCT GTGGTCGCTG TGCCATCAAG GTCCAGGAGG GGAAAGTTGA GGTCAGGGGA
GGCCATCTTC CGGCCCGTTT AAGGGAAGAG GGCTATTCCC TGGCGTGCCA GACCATGGTT
ATGGGAGATG CTATAATAGC CATCCCACCG GAATCCCGCC TGGGCCGGCA CCAGGTATTA
CTGCAGGATA AAGGACGCCT GCAGGACAGC CTGACTGATC TACTGGGCAG TTATCCCCTT
GACCCGGCGT GTAGCGTTGC CTCCCTGGTC TTGCCGGAAC CGACGCTGAC TGAAAACACC
AGTGATGCCA GCCGCCTGCT GGCCACCCTG CGCAAGGAAA AGGGCCTGGA GGGGCAACTC
GACCTGGTCT TCTTGCAGGA GTTACCGGAT AGCCTGCGCC AGGCCAACTG GCAAGTAGAT
GTTACCCTGG CCGCGGGAAG TAATCGTCTT ATCCGGATCA CCCCCCTGGG GGAAACAAGA
GCCTTCGGCT TGGCTATTGA CCTGGGTACA ACCACGGTAG TGGTATCCCT GCTGGACCTG
CGCAGTGGCG AACGGGTGAT GCGCAAGGGC AGTTATAACC GCCAGGCAGT TTATGGTGAT
GACGTTATCT CCCGGATTAT TCACGCCACC AGTAATGAGG GGGGCCTGGA GGAATTACGC
CAGGCGGCCC TGGCTACCAT TAACGATTTA ATAACAGCGG TCCTGACTTC CGGGGGAATT
GATCCGGCAG AGGTAACGGC AGCAACTATA GCCGGCAATA CCACCATGAC TCACCTGTTG
CTGGGAATAA ACCCCAGGTA CCTCCGCCTG CAGCCCTATA TACCGGCAGC AGCCGAATTG
CCGGTACTGA AGGCCGCCGA GGTGGGGTTA AAGATTAATC CGCTGGCTCC GGTCCAAATC
TTCCCGGCGG TAGCCAGCTA TGTAGGCGGG GACATCGTCT CCGGCGCCCT CTTTACCCGG
ATTGCCAGTA GTGAAGAATT GACCCTCTTC ATTGACATCG GCACCAACGG GGAAATGGTC
CTGGGCAACA GCGACTGGCT GATCTCCTGT GCCTGTTCGG CCGGGCCGGC TTTTGAGGGC
AGCGGCATTA CCTGCGGTAT GCGGGCCATG GAAGGAGCAA TCGAAGGGGT GAGCATCGAC
CCGGATACCC TGGAAGTCGA GTTGGAGGTA ATCGGCGGTG GTCGCCCGTC CGGGATATGT
GGTAGCGGTT TAATTGACTG CCTGGCAAAG CTCCGGCGCG CGGGTATTAT TGACCGCACC
GGCAACTTCC AGGAAGTAGC CACCCCACGG CTGCGTACCA CTGACGAAGG CCCGGAATTT
GTCCTGGCCT GGGCTACCCA GAGCAGTACC CAGCGGGATA TTGTCCTGAC CGCCGCCGAC
ATTAAAAACC TTATTCGTTC CAAGGGAGCC GTTTTCGCCG GCATCCAGAG CCTGCTCAAA
ACAGTCTCCC TTGAAATAGA CGCCATCGAA AGGATCATCA TCGCCGGTGG GTTCGGCAAT
TACCTCCATA TCCCCAATGC CGTAGAGATC GGCCTGTTAC CCGACCTACC GCCGGAGAAA
TATATTTTTG CCGGCAATAC TTCCCTGAAG GGAGCCGAGC TGGCCCTCCT TTCCCAGCCG
GCCTGGCAGG AAACCCTGGA ACTGGCCCGC AGGATGACCT ACCTGGAGCT TTCAGCCGGC
AACCTCTTCA TGGAAGAGTT TGTATCGGCC CTGTTCCTGC CCCATACCAG GCTGGAACTC
TTCCCATCCG TGGGGAATGG GTCCGGCGAC GAAAGGAGGT CGGGTTAA
 
Protein sequence
MDQFAVTFLP DNITVRVAAG TSIMEAANQA GLPLKSTCGG AGTCGRCAIK VQEGKVEVRG 
GHLPARLREE GYSLACQTMV MGDAIIAIPP ESRLGRHQVL LQDKGRLQDS LTDLLGSYPL
DPACSVASLV LPEPTLTENT SDASRLLATL RKEKGLEGQL DLVFLQELPD SLRQANWQVD
VTLAAGSNRL IRITPLGETR AFGLAIDLGT TTVVVSLLDL RSGERVMRKG SYNRQAVYGD
DVISRIIHAT SNEGGLEELR QAALATINDL ITAVLTSGGI DPAEVTAATI AGNTTMTHLL
LGINPRYLRL QPYIPAAAEL PVLKAAEVGL KINPLAPVQI FPAVASYVGG DIVSGALFTR
IASSEELTLF IDIGTNGEMV LGNSDWLISC ACSAGPAFEG SGITCGMRAM EGAIEGVSID
PDTLEVELEV IGGGRPSGIC GSGLIDCLAK LRRAGIIDRT GNFQEVATPR LRTTDEGPEF
VLAWATQSST QRDIVLTAAD IKNLIRSKGA VFAGIQSLLK TVSLEIDAIE RIIIAGGFGN
YLHIPNAVEI GLLPDLPPEK YIFAGNTSLK GAELALLSQP AWQETLELAR RMTYLELSAG
NLFMEEFVSA LFLPHTRLEL FPSVGNGSGD ERRSG