Gene Moth_1842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1842 
Symbol 
ID3831702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1900313 
End bp1901323 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content60% 
IMG OID637829773 
Producthypothetical protein 
Protein accessionYP_430685 
Protein GI83590676 
COG category[S] Function unknown 
COG ID[COG5660] Predicted integral membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAATTGCA GTCAGTGCCG GGAACTCATA TCACCATACC TGGATGGAGT CTTGAGCGAA 
ACAATACAAC GGGCCCTGGA GAACCACCTT AACTCCTGCC CGGCCTGCCG GGAAGAACTG
GAGGCTATGG GGCAGACAAT CGAGATTATC CGTGCCTGGT CCGAAGAAGA ACTCGACCTG
CCACCCGGTT TTGAGGAACG CCTGCGCTCA CGCCTGGAGG AGTGCCGGCA GCCGTGGTAC
CGACGCCTCT CCCGGAACTG GCTTTCCCTG GCGGCAGCGG CGGCCACTAT CATGGTGGTA
GCAATTACGG CCCGGGCGGA TTACCTCCAC CTGGGTTCCT CCAGGCAAAT CGCTGTCCCC
CATGAGAAAC AGGTGCAGGA ATTGGCCATG ACCCGGGGAG ACCAGCAGGT GACTCCCCTC
AAGGCTCTAC CACCGGTTAC CTCAACGGAT GCCCCGCAGC AATCAGCACC GAAAGTAAAG
GTAAAAGCAG CTACTACCTC GGTACGATCT GCAGTGAGGA ATCTGGAGAG TTCCCACCCC
GATCCGGAAC AACAGCAAAG GAAAATAGTC CCTGGAGGGA CCTTCAACCT CAATTCCCGG
GGCAGAGCAG AGCGAGCAGC TCCGGAGCAG CAGACCGGGG GACAATCAGG AAAGGGTCAG
CCGGACCAGG ATAAAGATAA GGGAAAGGAG AAGGGGCCGG GTCAATCCCG TACTGTACTG
GAAGCAGGGA AGAAAGAAGT TACCCCGAGG GCTGGCGAGG GGGTGGCAGG GGGAACGTCA
ACCATTGCTG GCGATGGGCC GGGAACCGTC AAGACCCCGG CCGGTGATGG GAAAGAGGTC
CCACCTTTAC CACCGGCCGG TGGGAAGGCA ACGCTCCAGG ACCTGACGCC AGGGGTTGGG
CGGCAAAACT CGGCAGCGTC TCCGGATAGC GACCTGCAAA ACCGGACTCT TACCCAGCCA
CCCCCCGCAC CTGTTGCCCC GGCAACTATC CCTAAACCGC CCTCGCCTTG A
 
Protein sequence
MNCSQCRELI SPYLDGVLSE TIQRALENHL NSCPACREEL EAMGQTIEII RAWSEEELDL 
PPGFEERLRS RLEECRQPWY RRLSRNWLSL AAAAATIMVV AITARADYLH LGSSRQIAVP
HEKQVQELAM TRGDQQVTPL KALPPVTSTD APQQSAPKVK VKAATTSVRS AVRNLESSHP
DPEQQQRKIV PGGTFNLNSR GRAERAAPEQ QTGGQSGKGQ PDQDKDKGKE KGPGQSRTVL
EAGKKEVTPR AGEGVAGGTS TIAGDGPGTV KTPAGDGKEV PPLPPAGGKA TLQDLTPGVG
RQNSAASPDS DLQNRTLTQP PPAPVAPATI PKPPSP