Gene Moth_0501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0501 
Symbol 
ID3832824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp518159 
End bp519379 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content44% 
IMG OID637828435 
Producthypothetical protein 
Protein accessionYP_429374 
Protein GI83589365 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000761016 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTGCC AGTATTGTGG TAAAAAAATA GAAGCGGGGT CCCGCTTTTG TAGTGGTTGT 
GGACATGAAC TTAGGTCAAT TAATAATGAA GATACAGTTG TTCTGGCCCG GCCGTCCCTG
GATCAAGTCA AACAGGAGGA AGGATCGGAG CATACAACGA CCCAACAGCA GGTACCGGCC
AAGGGTAAAA GCATCTGGCT ATTACCTTTA GCTACCGCGG TTTTAGTAGC GGTGGTATTG
GGTGGATATT ATGCCTATGA ACAGTATATT AACCGGCTGG TGGAGCAAGA TCGGGTGCAA
GCAGAAAATT TGGCCCTGCA GGGAGATCTG GATAAGGCGG AAAAACTGAT TAGCAATGCT
TTAAACAAGC GGCCCCGGCA CAAAACACTG CAGGCTGACC TTGCATATGT TAGAGAAGGT
CAGGAGGTGC AAAGTGAGTT AAATGAAGCC TGGGAACATA CCAAGCAGCA GCAATTTAAT
CAAGCCCTGT CCTTAATTGA ACAGGCGGAG AAAAAAGTGT CCGGTAAGGA AGGAGATTTT
TACAAATACC TTAGTCAATT AATTAATGAC AAAAAAGCCG CGGTCAATAT TATGCAGGTT
AAACAGGAGA TGAATAATAA GAATTCTATC GAGGAACTGG CTGCTTTATT GACTAAAATA
TCTTCCTTTA AAGTTAAGGA GGCCCAGGAA GTAGCGAAAG TTATAAAGAC CAAAATTTGC
CAATTAATTT ATACCAAAGC CAATGAATTG TTAAAGAAAA AGGATTTCGC CGGTGCACTG
GCTTTAGTGC AGCAGGGACT GGGCTACGAT AGCGAGAACC AGCAGTTGTT ATCATTCCAA
AAAACCATTG AACAACAGAA GGCGGCCTTT GAGCAGAATG AGCAAATGAT TCTGGAACAA
GCCCAACTGG CTGCCGCCCG GGAAAACGCC ATAAATCATA CCCAGGCTGT AGAAGTGTTA
AAATGCGACG GCAGTGTTAC TGCCCAAGGT GATTTCCGGG TATGGGGAAC CGTACGCAAT
GTAGCCACAC GTCCCATCTA CATGGTGGAG ATTTACTATA CCGTCTATGA TGCTGCCGGT
AATGCCCTGA CCACTGATAG TACCTATGTT TATCCTAATT ATTTAAATCC CCGGGATCAG
GGTAGCTTTG ACAACACTTC TTACGGGTTG TGGCAGGGTA ATCGAGTAAA AATTAACAGG
ATCACCTGGT ATTTAAGGTA G
 
Protein sequence
MFCQYCGKKI EAGSRFCSGC GHELRSINNE DTVVLARPSL DQVKQEEGSE HTTTQQQVPA 
KGKSIWLLPL ATAVLVAVVL GGYYAYEQYI NRLVEQDRVQ AENLALQGDL DKAEKLISNA
LNKRPRHKTL QADLAYVREG QEVQSELNEA WEHTKQQQFN QALSLIEQAE KKVSGKEGDF
YKYLSQLIND KKAAVNIMQV KQEMNNKNSI EELAALLTKI SSFKVKEAQE VAKVIKTKIC
QLIYTKANEL LKKKDFAGAL ALVQQGLGYD SENQQLLSFQ KTIEQQKAAF EQNEQMILEQ
AQLAAARENA INHTQAVEVL KCDGSVTAQG DFRVWGTVRN VATRPIYMVE IYYTVYDAAG
NALTTDSTYV YPNYLNPRDQ GSFDNTSYGL WQGNRVKINR ITWYLR