Gene Moth_0511 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0511 
Symbol 
ID3831813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp530025 
End bp531188 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content59% 
IMG OID637828445 
Producthypothetical protein 
Protein accessionYP_429384 
Protein GI83589375 
COG category[S] Function unknown 
COG ID[COG2718] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02877] sporulation protein YhbH 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGTGG AGTACAACCT TTCCCGGGAA GACTGGTCCC TGCACCGCAA GGGTTACCTT 
GATCAGCAGC GGCACCAGGA AAAGGTCCGG GAAGCCATTA AAAAAAACCT GCCTCACATC
ATAGCTGAAG AGAGTATCAT CATGGGCCGG GGTAAAAAGG TGGTCCGGGT GCCCATCCGC
AGCCTGGAGG AGTATCACTT CCGCTTTAAC TACAACCAGG GGCAGCATGC CGGCCAGGGC
AGCGGCGGTA CTCGGAAGGG GACCGTTATT GGCCGTGAGG TCATCGAAGG CGCTGGCGGG
GGGGCCGGGG CCGGCGACGA ACCGGGGATG GATTACTATG AGGCCGAGGT CACCCTGGAA
GAGGTCCAGG AGATGCTCTT CCGGGACCTG GAGCTTCCCA ACCTCCGGGA GAAAAAGAAA
CCGGTGATGG CTTCCCCCGC TTATGAGTTT CGCGACGTGC GCCGCAAGGG CCTCATGGGC
AACCTGGATA AAAAACGTAC CCTCCTGGAA AACCTCAAGC GTAACGCCAT GAAGGGCAAG
CTGGCCATCG GCGGTATTAC GCCGGAGGAC CTGCGCTTTA AAACCTGGGA GGAGAAGATC
CGCTACGAGA CCAGCGCCGT AGTCCTGGCC ATGATGGATA CCTCCGGCTC CATGGGGACA
TATGAGAAGT ATATCGCCCG CACCTTCTTC TTCTGGATGG TGCGTTTCCT GCGCAGCAGG
TACCAGCAGG TGGAACTGGT CTTTATCGCC CACCATACCC AGGCCCGCGA GGTAACGGAG
GAAGAGTTCT TCGCCAAGGG TGAGAGCGGC GGTACCCGCT GCTCCTCGGC CTACCGCCTG
GCCCTGGAGA TCATCGACCG GCGCTATCCA CCGGCGGATT ACAATATCTA CCCCTTCCAT
TTTACCGACG GCGACAACCT CCCCAGCGAC AATGAGGCCT GCCTGGAGGC TGTCCAGGAA
CTCCTGCCCA GGGTCAACCT CCTGGGTTAC GGGGAGATTG TCAATCCCTA TTACCGTACC
AGCACCCTGA TGAATGTCCT CAAGCGCATC AAGGACGACC GCCTGGTGAC GGTGGCGGTC
AAGGATAAGA GCGAGGTCTA CCAGGCTCTG CGGCAATTCT TTGCCGGGTC AAAAGGGGGA
GAAGCTGGTG GAACAAGAAT TTAA
 
Protein sequence
MPVEYNLSRE DWSLHRKGYL DQQRHQEKVR EAIKKNLPHI IAEESIIMGR GKKVVRVPIR 
SLEEYHFRFN YNQGQHAGQG SGGTRKGTVI GREVIEGAGG GAGAGDEPGM DYYEAEVTLE
EVQEMLFRDL ELPNLREKKK PVMASPAYEF RDVRRKGLMG NLDKKRTLLE NLKRNAMKGK
LAIGGITPED LRFKTWEEKI RYETSAVVLA MMDTSGSMGT YEKYIARTFF FWMVRFLRSR
YQQVELVFIA HHTQAREVTE EEFFAKGESG GTRCSSAYRL ALEIIDRRYP PADYNIYPFH
FTDGDNLPSD NEACLEAVQE LLPRVNLLGY GEIVNPYYRT STLMNVLKRI KDDRLVTVAV
KDKSEVYQAL RQFFAGSKGG EAGGTRI