Gene Moth_2295 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2295 
Symbol 
ID3831327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2408846 
End bp2410225 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content64% 
IMG OID637830215 
Producthypothetical protein 
Protein accessionYP_431125 
Protein GI83591116 
COG category[S] Function unknown 
COG ID[COG2078] Uncharacterized conserved protein
[COG3885] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00296] uncharacterized protein, PH0010 family 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACGGT TGCTGGATGT AGGCTTTATG CCCCACCCGC CAATTATGGT CCCGGAGGTC 
GGGCGCGGCG AGGTGGCAAG GATTAAGGCT ACGGTGGCAG CGGCCAGGGA GCTGGCAGCC
AGGGTGGCTG CCCATCAGCC GGAGGTAATA ATTATCATTT CCCCCCACGG CCCGGTCTTC
CGCGACGCCG TGGGTATCTG GGCGACGCCG GAACTGGCAG GCGACCTGGC GGCCTTCCGG
GCGGGGGAAG TCCGGTTTAA GTACAGCCTG GACCTGGATT TGAGCCGCGC TATTGCAGCA
AAGGCTAGGG AGGAGGGGAT AGCAGTTGCC TGGCTTGATG CCCGGGCCAG CAGTAGTTAC
GGCCTGACCC CGGAACTGGA CCACGGCATG ATGGTTCCTC TGTATTTTTT ACGCCAGGCC
GGCCTGGAGG TACCCCTGGT GGCTATGGGC ATGGCCTTTA TGGAACGGGA AAAACTCTAT
GCCTTTGGCG CCGCCCTGGC GAGGGCCGTC AAAGATAGCC CGCGCCGGGC CCTCCTGGTG
GCCAGCGGCG ATATGTCCCA CCGCCTGCTG CCCGGAGCCC CGGCAGGTTA CGACCCCCGG
GGCAAAGTCT TTGATGCCAG AATAAGGAAT TTGCTCGCAG CCCTGGATGT CGAAGGGATC
CTGGCCATAC CGGAAGACCT GGCCGAGGGA GCCGGCGAGT GCGGCCTGCG TTCCTTTATT
ATGGGGCTGG GGGCCCTGGA CGGCTACCGG GTTAAGGGAG AAGTCCTTTC CTATGAAGGA
CCCTTCGGCG TCGGCTACCT GGTGGCCCAC CTGGAACCCG GCGAGGAGGC CCCGGAGCGC
AGCCTGCTGG CCCGGGAGAC GGCTGCTGCC AGGGAAGAAT CCCTGCCGGT GCGCCTGGCC
CGGCAGAGCC TGGAACACTA CCTGCGCACG GGCAAAGTGT TACCGGTTCC CGCCCCGCTG
CCCCCGGAAC TTGCCGGCCG GGCCGGCGTC TTCGTATCTC TAAAGAAAAA CGGCCAGTTG
CGGGGCTGCA TTGGCACCAT CAGCCCCACC CGGGAGAACC TGGCCGGGGA GATTATCTAC
AACGCCCTGG CGGCAGGCCT GGAAGACCCC CGTTTCCCCC CGGTAACGGT CGACGAATTA
CCCGAGCTCC AGTATTCGGT GGATGTCTTA AGTGAACCAG AACCGGCCAC CGTAGCGGAC
CTGGACCCGA AGGTTTACGG GGTTATTGTC AGCTGTGGTC ACAGGCGGGG GTTGCTCCTT
CCCGACCTGG AGGGCGTGGA TACCGTGGCG GAACAGGTGG CCATTGCCCG TCAGAAGGGC
GGCATCGGCC CCGATGAGCC CTACCGGCTG GAAAGGTTCA AGGTAACGCG TTACCATTAA
 
Protein sequence
MGRLLDVGFM PHPPIMVPEV GRGEVARIKA TVAAARELAA RVAAHQPEVI IIISPHGPVF 
RDAVGIWATP ELAGDLAAFR AGEVRFKYSL DLDLSRAIAA KAREEGIAVA WLDARASSSY
GLTPELDHGM MVPLYFLRQA GLEVPLVAMG MAFMEREKLY AFGAALARAV KDSPRRALLV
ASGDMSHRLL PGAPAGYDPR GKVFDARIRN LLAALDVEGI LAIPEDLAEG AGECGLRSFI
MGLGALDGYR VKGEVLSYEG PFGVGYLVAH LEPGEEAPER SLLARETAAA REESLPVRLA
RQSLEHYLRT GKVLPVPAPL PPELAGRAGV FVSLKKNGQL RGCIGTISPT RENLAGEIIY
NALAAGLEDP RFPPVTVDEL PELQYSVDVL SEPEPATVAD LDPKVYGVIV SCGHRRGLLL
PDLEGVDTVA EQVAIARQKG GIGPDEPYRL ERFKVTRYH