Gene Moth_2305 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2305 
Symbol 
ID3831419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2422717 
End bp2424909 
Gene Length2193 bp 
Protein Length730 aa 
Translation table11 
GC content60% 
IMG OID637830229 
Producthypothetical protein 
Protein accessionYP_431135 
Protein GI83591126 
COG category[C] Energy production and conversion 
COG ID[COG1139] Uncharacterized conserved protein containing a ferredoxin-like domain 
TIGRFAM ID[TIGR00273] iron-sulfur cluster-binding protein 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCGGTA AAGAATTCAA GCAACGTATC CGGCAGGCCC TGAATAACGC CAGCCTGCGG 
GGAGCCCTTG GTCGCTTTGC CGATTCCTAT GTAGTTTCCC GGGAAGAGGT TTATGCCGGC
CGGGATTTTG AATCCTTAAG GCAGAGGATT GCCGCTATCA AGGCTGATGC CGCCGGCCGT
TACGAGGAAC TGGCCGACCG GTTCAGCCGG GCGGTGGAGG CCCGAGGCGG CAAGGTGTTC
CGAGCTAAGG ACGCAGCGGC CGCCAGGGAA TATATCTACC AGGTAGCTAA AGAACACGGC
GTTACAGAAA TCGTCAAGTC CAAGTCCTTT GCTTCGGAAG AGATCCACCT GAACGAATTC
CTCCAGGAAC GGGGTATCAA TCCCTACGAA ACCGACCTGG CCGAGTGGAT CCTCCAGCTC
ATGCCCGGGG AGAGGCCTTC CCATATGGTC ATGCCGGCCA TTCACCTCCC CAAGGAGGAG
GTCGCCCGGG TCTTCAGCCG TTACCTGGGT GAACCGGTAG AACCTGATAT TAAAAATATG
GTCCGTATCG CCCGCCGGGA GTTGAGGAAA AAATTTCTGA CTGCCGGCAT GGGCATCAGC
GGCGCCAACA TTGCCGTGGC TGAAACGGGG ACCATTGTCC TCTGCACCAA TGAAGGCAAC
GCCCGCTTGA CTACTACTGT ACCACCGGTT CACGTGGCCA TTGTCGGCTA CGAGAAGCTG
GTGCCCAGCA TTAAAGATAT TGTTCCCATC CTTGAGGCCC TGCCCCGCAG CGGTACGGCT
CAGCCCATTA CCAGTTACGT AACCATGATC ACCGGTCCGG TGCCGGCCTG GCGGGGCGAA
GGCGAGGGTA TTAAGGAACT GCACGTCGTT CTTCTGGACA ACGGGCGCAC CAGGATGGCC
GCCGACCCGG TCTTCAAGGA GGCCCTGCAG TGTATCCGCT GCGCCTCCTG CACCAACGTC
TGCCCGGTCT TCCAGCTGGT CAGCGGCCAG GTCTATGGTT ATATTTACAA CGGCGGTATC
GGCAGTGTCC TCACGGCCTT CTTCAATTCC CTGGAAGACG CCGTCGACCC CCAGAGCCTG
TGTATCGGCT GCCGGCGCTG TGCCGAGGTC TGCCCGGCAA AGATTAATAT CCCCGATTTG
GTGTTGAAGC TTCGGGAGCG GGTCGTCACG AAGCAGGGGC TTTCCAGCGG CTACCGGATC
GCCCTCCACG GCATAGTGGC TAAACCGAAG CTGATGCACA CCCTCCTGCG GGCGGCCTCC
CGCCTCCAGG GTCCGGTAAC CCACGGCCAG CCCCTGATCC GGCACCTGCC TCTCTTCTTC
AGCAACTTGA CCTCCGGCCG CAGCCTGCCG GCCATTGCTA AGGAGCCCCT GCGGGACCGG
GTCAAACGCC TGGAGGCCCG TACCGGCAGG CCGCGTCTCA AGGCCGCCTT TTACAGCGGT
TGCGTAATTG ACTTTGCCTA CCCGGAGATC GGTGAGGCTG TTTATAAAGT GCTGGGGCGG
GAAGGGGTGC AGGTAACATT TCCCCAGGGC CAGGCCTGCT GCGGTGCCCC GGCAGTTTAT
GCCGGCGATC GGGAGACGGC GGTGAAGTTG GCCAAACAAA ATATTACCGC GCTGGAAGAG
GCCCGGGCCG ATGTTGTGGT CACCGCCTGC CCTACCTGCG CCGTGGCCCT GAAAAAGGAC
TTTCCCGAGC TCCTGGCCGG CGAACCGGCC TGGGAGGAGC GGGCCAGGGC CCTGGCGAAG
AAGGTAAAAG ACTTTACCGA GCTGGTCCAT GAATTAACTG GCGGGCAGGG GAAAAAGGTC
AAGCAGGCTA AAAAATCCGG CAGCGGGGCA GTAAAGGTTA CCTATCATGA CTCTTGTCAT
TTCAAGCGGC ACCTGGGCCT GGACCAGGTG GCCAGGCAGG TTTTAAAGGA ACAGCCGGGG
GTGGAACTGG TAGAGATGCA GGAAAGCGAC CGCTGCTGCG GTTTTGGTGG TTCTTATAGC
ATCAAGTACC CGGAGATCAG CGCCCCTATC CTGGAACGCA AGCTGAAAAA TATTACCGAA
AGTGGTGCGC AAGTGGTGGC TGCGGACTGC CCGGGCTGCG TCCTGCAGCT ACGCGGCGGC
CTGGATCAGA AGGGCAGCTC TATCAAGGTC AAGCATACGG CTGAGGTGCT GGCGGCCTTG
GAGAACTTGC AGGGTGGCGG GAACAGTAAA TAA
 
Protein sequence
MAGKEFKQRI RQALNNASLR GALGRFADSY VVSREEVYAG RDFESLRQRI AAIKADAAGR 
YEELADRFSR AVEARGGKVF RAKDAAAARE YIYQVAKEHG VTEIVKSKSF ASEEIHLNEF
LQERGINPYE TDLAEWILQL MPGERPSHMV MPAIHLPKEE VARVFSRYLG EPVEPDIKNM
VRIARRELRK KFLTAGMGIS GANIAVAETG TIVLCTNEGN ARLTTTVPPV HVAIVGYEKL
VPSIKDIVPI LEALPRSGTA QPITSYVTMI TGPVPAWRGE GEGIKELHVV LLDNGRTRMA
ADPVFKEALQ CIRCASCTNV CPVFQLVSGQ VYGYIYNGGI GSVLTAFFNS LEDAVDPQSL
CIGCRRCAEV CPAKINIPDL VLKLRERVVT KQGLSSGYRI ALHGIVAKPK LMHTLLRAAS
RLQGPVTHGQ PLIRHLPLFF SNLTSGRSLP AIAKEPLRDR VKRLEARTGR PRLKAAFYSG
CVIDFAYPEI GEAVYKVLGR EGVQVTFPQG QACCGAPAVY AGDRETAVKL AKQNITALEE
ARADVVVTAC PTCAVALKKD FPELLAGEPA WEERARALAK KVKDFTELVH ELTGGQGKKV
KQAKKSGSGA VKVTYHDSCH FKRHLGLDQV ARQVLKEQPG VELVEMQESD RCCGFGGSYS
IKYPEISAPI LERKLKNITE SGAQVVAADC PGCVLQLRGG LDQKGSSIKV KHTAEVLAAL
ENLQGGGNSK