Gene Moth_2112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2112 
Symbol 
ID3833263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2205599 
End bp2207437 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content62% 
IMG OID637830037 
Productferredoxin 
Protein accessionYP_430947 
Protein GI83590938 
COG category[R] General function prediction only 
COG ID[COG3894] Uncharacterized metal-binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCAAGAG TACTAGTTGA TTTCCAGCCA GTGGGCCGGC GGGTAGAGGT TGACGCCGGC 
CAGACCATTT TGTCTGCTAT CCAGCAACTC GGTCTCTCCC TGGGGGCGGG GGGGCTGACC
GCTCCCTGCG GTGGCCGGGG CCTGTGCGGT CGCTGCCGGG TGCGGATTGC GTCCGGGGAG
GTAGGGGAAG TAAACCCGGC GGAGCGGCGC TTCCTCACAC CGGCTCAACT GGAAAAGGGT
TATCGCCTGG CCTGCCAGGC AACGGTCATC GGCCCCGTCA AGGTAGAGAT CCCGCCGGAA
TCTATGCTCG GAGTTCAGAA ACTTCAGGTC GAGGGGCTGG ATATTGCAGT TACTCCCGAG
CCCCCTGTTA AAAGATACAA CCTGCCGCTA AGTAAGACGA CCATTGAAAA CCCCCTCCCC
CTCTGGCAGC AGGTGACCGG CGAGCTGGAA GCCACCTACG GCCTGGACCG GCCCGGCGTG
GACTTTGGCC TGGCCCGGGA ATGGAAGCCA ATGGCTACGG AGGGAGAATG CGTGGTAACG
GTGCGCGGGC CGGAGGTTAT CAATATTTAT ACCGGCCGCC TGGCCCCGCC GCCGGTGGGT
CTGGCCGTCG ACCTGGGGAC CACCAAGGTG GCCGGCTTTC TCATAAACCT GGAGACAGGT
GCTACCCTGG CGGCCGACGG TATTATGAAT CCCCAGATTA CCTACGGCGA GGATGTCATG
GCCCGGCTGG GCTATGCCCT GGAGGGCGAA GAAGAGTACC GGCGCATCCA GGAGGTGGAG
ATCGAGGGAT TGAACCGCCT GGCCGCTACC CTGGCGGCGA AGGCCGGGGT GGCGACGACG
GATATTGAGG AAGCCGTCAT TGTCGGTAAC ACCGCCATGC ATCACCTGCT TCTGCACCTG
CCGGTAGAGC AGCTCGCCCG GGCGCCCTAT GTACCCGCTT TGACTACACC GGTAGAAATA
AAGACCCGGA ACCTTGGCTT AAACTTAAGC CCGGGGGCCT TTGTTTACCT GCAACCGGTA
ATCGCCGGTT TTGTTGGCGG CGATCATGTG GCCATGATCC TGGGCAGCCG GATTGATGAA
GCCCGTAAGG TCACCCTGGG ACTGGATATC GGCACCAATA CGGAGATTGT CTTAAGTTAT
GGCGGTAAAA TGCTCTCCTG TTCCTGTGCC TCCGGGCCGG CTTTTGAAGG CGCCCATATC
GCCCAGGGGA TGCGGGCCGT TACCGGGGCT ATTGCCGCCG TCCGCTTAAG CGACGACGGC
CGGGATGTTT TCTGGGAAAG TATTGGTGGC GTACCACCCC TGGGTATCTG CGGTTCCGGC
ATCCTGGACG CCGTGGCCGA GCTTTACCGT ACCGGCATCC TCAACGCCAG CGGTCGGCTG
GACCTCAACC ACCCGCGGGT GAGGCGACCT GCCGGAGGTG GACCGCCGGA ATTCCTCCTG
GTACCGGCGG CGGAAACCGG CATCGACGGC GACCTGGTGG TGACCCAGAA GGACATCAAT
GAGATTCAGT TGGCCAAAGC CGCCATAGCC ACCGGGACCC TCCTGCTCCT GGAGGCAGCC
GGGTTAACAG TAAAGGACCT GGAGGAGGTC GTCGTGGCCG GCGCCTTTGG CACCCATCTT
AAGCTGGAGA GCGCCATCAC CATCGGTATG TTTCCCAACC TGCCTTTAAC CGCCTTCCGC
CAGGTTGGCA ATGCCGCCGG CACCGGAGCG CGGCTGGCCC TCCTTTCCCT GACGGAGAGG
AAGCGGGGCG AAGCCATTGC CAGGCAGGTC GGCTATCTTG AGCTCATGAC CCGGCCGTCC
TTCCACGAAG TATATGTTAA ATCCCTTCTC TTGCCCTGA
 
Protein sequence
MARVLVDFQP VGRRVEVDAG QTILSAIQQL GLSLGAGGLT APCGGRGLCG RCRVRIASGE 
VGEVNPAERR FLTPAQLEKG YRLACQATVI GPVKVEIPPE SMLGVQKLQV EGLDIAVTPE
PPVKRYNLPL SKTTIENPLP LWQQVTGELE ATYGLDRPGV DFGLAREWKP MATEGECVVT
VRGPEVINIY TGRLAPPPVG LAVDLGTTKV AGFLINLETG ATLAADGIMN PQITYGEDVM
ARLGYALEGE EEYRRIQEVE IEGLNRLAAT LAAKAGVATT DIEEAVIVGN TAMHHLLLHL
PVEQLARAPY VPALTTPVEI KTRNLGLNLS PGAFVYLQPV IAGFVGGDHV AMILGSRIDE
ARKVTLGLDI GTNTEIVLSY GGKMLSCSCA SGPAFEGAHI AQGMRAVTGA IAAVRLSDDG
RDVFWESIGG VPPLGICGSG ILDAVAELYR TGILNASGRL DLNHPRVRRP AGGGPPEFLL
VPAAETGIDG DLVVTQKDIN EIQLAKAAIA TGTLLLLEAA GLTVKDLEEV VVAGAFGTHL
KLESAITIGM FPNLPLTAFR QVGNAAGTGA RLALLSLTER KRGEAIARQV GYLELMTRPS
FHEVYVKSLL LP