Gene Moth_2341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2341 
Symbol 
ID3832059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2458296 
End bp2459960 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content45% 
IMG OID637830264 
Producthypothetical protein 
Protein accessionYP_431170 
Protein GI83591161 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.920789 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGAAG TCTATCTGGT TCTATGTGGA ATAATCGATA AAAAATGTAG CTGGTTACCG 
GCTACTTGGG GAGGGTGTGG ATTGGAAAGA AGGTTAATTA GACGCGGACC TCTAAGCAAG
AGATTACTAA TGCTCCTCAT CTTGACTATA CTTGCCGGCA GCTTTGCCTT TTCAGGAAAT
GCCTTAGCTA ACGACAATGA AGAAAATGAA CCTGAGAAAA TTAATAATGA GGTAGCGGCG
GAGGTCAGGA ATATCAGTTA TCAACTTGAT AAAGGTGCGA TGGATCCGGG TACGGCAATC
AGCCTTTTAA TTAGCGAGAT AGATGCACTT GAGGAAATTG ACAAAGTCGA TATTGAAAGT
GCCACCCTGG ACGAGGTAAA AAAAGCGGTG TCACTAATCG TTCAAGAAAT GCCTTTACTT
CCGAATTCTC TTCTAAAGGT TTGGTTTACA GGCAGCAAGG CTGTGGTAGA TATAGCTGAA
AACGCTATTC CCTACCACCT AGGAAGAGTG ATGGATGACG CTAAAAAGCT GGAAGAACAT
CTAGATAAGC TCGGGTTAAC GGAACAGGCG CAAGACCTAG AAAACGCCAG GCCCTATAAC
AATTTACTCT TAAAAATACC TGAAGTGGTC GCTAAAAAGG AAATCCAAGT AAATGTGCCT
GTAAGTGTTG CCAAAGACCT TATTGCCAGG AACTTGAATT TAAAAATTCA AGGTGATGGT
ATTTCTTTTT TAATGCCTGC AAAAGATTTT GATTTTAGTG ACGGAACTGC TAGAGTAGCT
ATTATCTACA AAGAAATGGA GAGCAAGGAC CTCCCTAACC TTGACGCGAA CTTGACTCTC
CATTTTGCCA GCAGAATATA TGCTGTCAAG GCTTATCGGC TGGCAGTTAA CGGGGCTTCG
TCACCGGCAG ATTTTAGCCA GCGCGTTCGG ATGGCCCTGT CATTTGATCT CGGTAAAGAC
AGCGATCCTG CGCGCTTAGG TGCTTTCCGA TATGACGAAA CGAAAAAAAG ATGGATATAT
ATAAAGGGTG TTAATGCTAC TGCTGGCATT ATTGAATTAC ATCTTGAAAA TGAAGGTATA
TATGCCGTCA TCGATTATCA AAAGACCTTT GAAGACATTC AAAAACACTG GGCTAAAGCG
GATATTGAAG CCATGGCCGC CAGGCAGATA GCAATAGGGG TTACTACCAG TCTATTTATG
CCTGATGAAC CCCTTAGCAG GGCCCAATTT ACCGCTTTAT TATTAAGATC TCTTAACAAA
AAAGAAATAA CTCCGACTAC GCCTACCTTT AAAGACGTGA ACTCAGGCGA CTGGTTTTAT
GGTGCCGTTG AAGCCGCTTA CCGGGCGGGA CTGGTTTCGG GTAAGGAAAA AGATATCTTT
GCTCCGCGAG ATAATATCAC CAGGGAGGAA ATGGTCGTTA TGCTGGCCCG GGCTCTAAAG
GCTGCGGGCT ACCATGGTAC GGGTAATGTG GGCAGCCTTG AGTGCTTCAG AGACTATGCA
GTTATTTCCG ATTGGGCCAA GGGTGCCTTT GCTTCCTGCC AGGACGCAGG GATCAATTTG
ACTCTACCAG ATGGCAACTT GCACCCCCAT GACGATGCTA CCCGGGCCCA GGCCATGGTA
ATGTTAAAGG GATTTTCTGA TGCGATCCAA ACTCTGGACA GATAG
 
Protein sequence
MGEVYLVLCG IIDKKCSWLP ATWGGCGLER RLIRRGPLSK RLLMLLILTI LAGSFAFSGN 
ALANDNEENE PEKINNEVAA EVRNISYQLD KGAMDPGTAI SLLISEIDAL EEIDKVDIES
ATLDEVKKAV SLIVQEMPLL PNSLLKVWFT GSKAVVDIAE NAIPYHLGRV MDDAKKLEEH
LDKLGLTEQA QDLENARPYN NLLLKIPEVV AKKEIQVNVP VSVAKDLIAR NLNLKIQGDG
ISFLMPAKDF DFSDGTARVA IIYKEMESKD LPNLDANLTL HFASRIYAVK AYRLAVNGAS
SPADFSQRVR MALSFDLGKD SDPARLGAFR YDETKKRWIY IKGVNATAGI IELHLENEGI
YAVIDYQKTF EDIQKHWAKA DIEAMAARQI AIGVTTSLFM PDEPLSRAQF TALLLRSLNK
KEITPTTPTF KDVNSGDWFY GAVEAAYRAG LVSGKEKDIF APRDNITREE MVVMLARALK
AAGYHGTGNV GSLECFRDYA VISDWAKGAF ASCQDAGINL TLPDGNLHPH DDATRAQAMV
MLKGFSDAIQ TLDR