Gene Moth_1807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1807 
Symbol 
ID3830725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1860063 
End bp1862135 
Gene Length2073 bp 
Protein Length690 aa 
Translation table11 
GC content53% 
IMG OID637829734 
Producthypothetical protein 
Protein accessionYP_430650 
Protein GI83590641 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000440945 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAATA TAGTTACGCT GGTGAAAAAA GATAACGCTG GCAAATTCGA CTGGGCTCCG 
TTGTACGAGG CTTACGTCAG CGGCGCGAAC GGCGACCCGC TAAACGAACG CGATGATGGT
TGGATGGATG GGTGCTGTCC TCTACATGAC GACACCAGAC CGAGCTTCTC TTTTAACCGC
TGGTCCGGGT ACTGGCTCTG CCGGGCAGGG TGCGGCGCGG GTTGGCCGCT GGACTTTTTG
GAGGCGGTCG CTGGCCTCGA CGGTGACGAT GCACTAGAGG AAATCCGGGA ATTGTGTGGG
TCCTTGCCGC CGGAGATAAT CCCCGGCCCG CTTACCTTGA CGGCCTACGC TGTCTACTGT
AACCTACCGG TATCCTTCCT TCAGAAGTTG GGACTGGCGG AATGTGCTAA AGGCGTCAAA
ATACCCTACT GCACCGTGGA CGGCCAGGTT TTCCGCTACC GTTACCGCCT GAGTTTGAAT
AAACAGGGCA GCCGGTTCGC TTGGGGTAGC GGCAAGGGTA TCCTGCCATA CGGTTTGGAG
AACTTGCATT TGGCTAAGAA GGCAGGTTAT TTACTCCTGG TCGAGGGAGA ATCTGATGTG
CAAACTCTGT TGTATGCTGG TATTCCCGCT CTAGGAAGTC CCGGAGTTGC TGCGTGGCAG
AAGGAATGGT CAGCTTTAAT CCCTGAAGGA GTGCAAATTT ACATCTGGCA GGAACCGGAT
GAGGGTCACG TCCTTGTGGA AAAGGTTTTA CGTGCCTTCC CGGACGCGAA AATTATTAAA
ACCACTGTGG AGGAAAAAGA CCCCCGGCAG GTATGGTTGA ACTCCAGGGA TAAGGCAGAT
TTCGTGCAGC TTATTAACGA GTTATTACAA ACAGCAACCA CGGCAGATGA CCTGCAGGAA
GCCGCCAGGA GGACAGAGCG GGAAGCGGTC TGGCAGAAAT GTAAAGACTT AGCAATGGAG
CCTGATATCT TAACACCTGT TTTAAATACC CTTAGTCAAG TCGTTGCTGG CGAACGCGAG
GCCTTGGCCA TCCTTTACTT AGCGTTGACC TCACGGTTAT TACCTCGACC CATTAACCTC
CTGCTTCAGG CCCCACCGGG GGCAGGCAAA AGTTATTTGG TAGACTGTGT TTTACAGATG
TTTCCAGAAA GTGCATATTA TAAATTGACC GCTTCCAGTG AGCGGGCTTT TATTTATTCG
GATGAAAATT TCGCTCACCG GACAGTTGTC GTAGCCGAAG CAGCAGGCTT GCATTCGGAT
GGTGTCGCGG GAACAATTAT ACGTGAGTTA GTTTGGAGTA GCCAGTTGGC TTATGAAGTT
GTAGAGAAAA CCCCTGACGG CCTGCGGCCA CGTAAAATTA TCAAAGATGG CCCAGTCGGC
CTAATTACCA CGACTGTTAA AAATGTTGAA GGCGAGCTAG CCACACGCCT ACTGGTCGTG
GAACTGAAAG ACACCCCGGA GCAAACCAGG CTAATTCTGG AAGCGGAAGC ACGGGAAGCT
GCTGGACAGG CTACCATGCC AGATTTAAGC CATTTCGCGG CCCTACAGAA GTGGTTGGAG
CTAAATGGGC CAGCGAATGT AATCGTACCC TACGCTGAAA CCCTAGCCAG GCTATTGAAG
CCAAGCAGCG TACGGTTGCG CCGTGATTTC CGGCAGTTGT TGACGTTGAT AATGGCTAAC
GCCGTGCTAC ACCGGGCGAG CCGCCAAACT TCTAGCAGTG GAGCGATAAT CGCCTCCATC
GACGATTACG CGGCCATTTA CCCCCTGGCG GTTGCTCTGT TCGCCAGCAC TGGCGAAGCC
ACCCTAACGC CGCAGCAACG TGAGGCGGTA GAAGCCGTTC GCCGGTATTA CGAGCAGTAC
CATACGTCGG TCACTGTCAA GGCTTTAAGT AAATTACTGG GGATCGATAG GACTTCCACT
CAACGCCGGG TAGCTGCGGC TATCAAGAAA GGTTTCCTGG TAAATCTGGA AGATAAACCC
CACAGACCAG CGATGTTAGT GCCAGGCGAT ATGGCGGCTG AGGAAGACAA CTCTTTACCA
GAGCCAGAGA TGGTAGCCAG GATGTCCAGC TAA
 
Protein sequence
MGNIVTLVKK DNAGKFDWAP LYEAYVSGAN GDPLNERDDG WMDGCCPLHD DTRPSFSFNR 
WSGYWLCRAG CGAGWPLDFL EAVAGLDGDD ALEEIRELCG SLPPEIIPGP LTLTAYAVYC
NLPVSFLQKL GLAECAKGVK IPYCTVDGQV FRYRYRLSLN KQGSRFAWGS GKGILPYGLE
NLHLAKKAGY LLLVEGESDV QTLLYAGIPA LGSPGVAAWQ KEWSALIPEG VQIYIWQEPD
EGHVLVEKVL RAFPDAKIIK TTVEEKDPRQ VWLNSRDKAD FVQLINELLQ TATTADDLQE
AARRTEREAV WQKCKDLAME PDILTPVLNT LSQVVAGERE ALAILYLALT SRLLPRPINL
LLQAPPGAGK SYLVDCVLQM FPESAYYKLT ASSERAFIYS DENFAHRTVV VAEAAGLHSD
GVAGTIIREL VWSSQLAYEV VEKTPDGLRP RKIIKDGPVG LITTTVKNVE GELATRLLVV
ELKDTPEQTR LILEAEAREA AGQATMPDLS HFAALQKWLE LNGPANVIVP YAETLARLLK
PSSVRLRRDF RQLLTLIMAN AVLHRASRQT SSSGAIIASI DDYAAIYPLA VALFASTGEA
TLTPQQREAV EAVRRYYEQY HTSVTVKALS KLLGIDRTST QRRVAAAIKK GFLVNLEDKP
HRPAMLVPGD MAAEEDNSLP EPEMVARMSS