Gene Moth_0833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0833 
Symbol 
ID3831530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp863966 
End bp865033 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content61% 
IMG OID637828763 
Producthypothetical protein 
Protein accessionYP_429693 
Protein GI83589684 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.637029 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTTAC CCGGCAACTT CCAGGCGGCA GGCATCGGCA GCCTGCCCTA CCTCGAGCCT 
GGACCGGCCC TGGACCTTAT CTTTAAAACC ACCCCTGTCA TCCCCCACTG GCCTCAACTA
CCCAAACGGG GCCACCAGGA ACACTTTGTT TACCAGAGCC TGGCCCCCCT GGTGCGCCTT
GGCTTGATTA AAGAAAATCC CGGTGGGATG CCAGCCTTTA CCGATGCTGA CGCCGGGTGG
ACCGACAGGT TAACGGATTT CTATAGCCTA TACCTGGAGG CCGAGGCCGG CGATGGGGAG
GCCCTGGCAG CCTTTGCCAT TCCGCGGGAA GCCGGGATTG GTTTTTATGC CCTGCTGGAA
TACCTGGAGC AAAAGGGCCC CGGCGAAGCC CGCTTCTTAA AGGGGCAGGT TGCCGGACCT
ATTACCGCCG GCCTATATTT GACCGATAGT GCTGGCAGGA GTTCCTTTTA CGACCCCCAG
CTGCGGGACC TCATTGTCAA AACGACGGCC ATGCAGGCCT GCTGGCAGGC CCGTGAATTG
GGTCGCTTTA ACCTTCCAGT CCTGGTGTTT GTCGACGACC CTGCCCTGGC GGCCTATGGT
ACCTCCACCC ATGTAGCCCT AAAACGGGAT GACCTCCTGG CGGCCCTGGC GGGTGTCGTA
GCCGGTATTG AAGCCGGGGG CGGACTGCCC GGGGCCCATT CCTGCAGCGG GGTGGAGTGG
CCCGTCTTTT TTGAAGCAGG TTACCGGATC TTAAGTTTTG ACGCCTATAA TTATTTTACT
TCCCTCCAGG TTTTCGCCTC TGATGTGGCT GCCTTCATAG CGCAGGGCGG GGTCCTGGCC
TGGGGGATTG TGCCCACCTA TGAACAGGCC TGGCAGGAGA CTCCTGCCAC CCTGGCTGCG
AAACTCCAGG AGCAGGTCGG AGAACTGGCC CGGCGGGGTG TGGACCGGGA GCGCCTCTGC
CGCCAGGCCC TGGTCACCCC CTCCTGCGGC ACCGGCGTCC TGGAAGAAGA CCTGGCAGAA
CATATCTACG GCTTGATGGC AGCTGTCGCC GAAATAATGG GCAGGTGA
 
Protein sequence
MFLPGNFQAA GIGSLPYLEP GPALDLIFKT TPVIPHWPQL PKRGHQEHFV YQSLAPLVRL 
GLIKENPGGM PAFTDADAGW TDRLTDFYSL YLEAEAGDGE ALAAFAIPRE AGIGFYALLE
YLEQKGPGEA RFLKGQVAGP ITAGLYLTDS AGRSSFYDPQ LRDLIVKTTA MQACWQAREL
GRFNLPVLVF VDDPALAAYG TSTHVALKRD DLLAALAGVV AGIEAGGGLP GAHSCSGVEW
PVFFEAGYRI LSFDAYNYFT SLQVFASDVA AFIAQGGVLA WGIVPTYEQA WQETPATLAA
KLQEQVGELA RRGVDRERLC RQALVTPSCG TGVLEEDLAE HIYGLMAAVA EIMGR