Gene Moth_2238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2238 
Symbol 
ID3831284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2336397 
End bp2337566 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content50% 
IMG OID637830158 
Producthypothetical protein 
Protein accessionYP_431068 
Protein GI83591059 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0102712 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGATC TGCCAGCCAC CTTATATACA GGGGAACTGT TCTCTAATAC TGCTCCAGTA 
ATCATTCCGA GAAAGTATGA ATATCTCCCT GGAATTTGTG CTTTCTGTAT GTCCGGCGAA
TTCACCAAAG CTTTGCGAGC GATTAATCCT AATGTTGCAG TTGAAAATGG TTATGTTAGT
AAAATCCATT GGGATCCAGG CAAGTGGCTA TATTCCGATA CAACAATACC TTTGCCTAAA
GCTTATGCAG ATGCACCAAC CCAATGGATC TTTAAGGGGA CAATAACATC GTCCATTGCT
CCACTCCATG TCGCTATAGC CCGACTTTTA GGTTATCAAT GGCCCGAACA TGAACTGGAT
GAATTAGATG AATTGAGCAA TTCAGACGGC ATAGTGTGCA TCCCTGCCGT TCGCGGAGAA
GACCCGGCTG CCGAACGGCT GCGGGCGTTG CTCGCCGCCG CATATGGGAA TGACTGGTCG
CCAACAAAAG AGCAGGAACT TATCGCAGCA ACGGGTTCAG AAGCAAAGGA TCTTGACGAG
TGGCTTCGCA ACGATTTCTT TGAACAGCAC TGTAAGCTAT TCCACCACCG GCCCTTCATC
TGGCATATCT GGGACGGGCG CCGGCGCGAC GGTTTCCATG CCTTGGTCAA CTACCATAAG
CTTGCCGAGG GCAATGGCAA AGGCAGGCAG GTTTTAGAAA GCCTTACTTA TAGTTATCTC
GGTGAGTGGA TTACCCGGCA GAAAGATGGG GTGAAGCGGG GCGAAGGTGG TGCTGAGGAT
CGACTGGCAG CGGCATTGGA ATTGCAAAAA CGCCTTATTG CCATCCTCGA AGGTGAACCT
CCATTCGACA TTTTCGTTCG CTGGAAGCCT ATCGAAAAGC AGCCCATCGG CTGGGAACCG
GATATCAACG ACGGCGTGCG CATCAACATC CGGCCGTTTA TGGCTTCTGA CATTCCCGGC
GGCCGGAAAG GTGCCGGCAT CCTCCGGTGG AAGCCCAATA TTAGCTGGGG CAAAGACCGT
GGCAAAGAGC CTGTGCGCCC ACAAGAACAG TTTCCCTGGT TATGGAAGAA TGGTAAGTTC
ACCGGCGATC GGGTCAATGA TGTGCATTTA ACCAGTGAAG AAAAAAGAAA AGCACGTGAG
ACAATGAACA GGAAGACGAG GAGTAAATAG
 
Protein sequence
MRDLPATLYT GELFSNTAPV IIPRKYEYLP GICAFCMSGE FTKALRAINP NVAVENGYVS 
KIHWDPGKWL YSDTTIPLPK AYADAPTQWI FKGTITSSIA PLHVAIARLL GYQWPEHELD
ELDELSNSDG IVCIPAVRGE DPAAERLRAL LAAAYGNDWS PTKEQELIAA TGSEAKDLDE
WLRNDFFEQH CKLFHHRPFI WHIWDGRRRD GFHALVNYHK LAEGNGKGRQ VLESLTYSYL
GEWITRQKDG VKRGEGGAED RLAAALELQK RLIAILEGEP PFDIFVRWKP IEKQPIGWEP
DINDGVRINI RPFMASDIPG GRKGAGILRW KPNISWGKDR GKEPVRPQEQ FPWLWKNGKF
TGDRVNDVHL TSEEKRKARE TMNRKTRSK