Gene Moth_0210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0210 
Symbol 
ID3831361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp205740 
End bp206831 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content64% 
IMG OID637828146 
Producthypothetical protein 
Protein accessionYP_429088 
Protein GI83589079 
COG category[S] Function unknown 
COG ID[COG3535] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTCAA GGATAATCCT TGATAATGAG GTCGTGGAGG CGGCCGTCCT GGGAGGGGCC 
GTCCTGGGCG GAGGCGGGGG CGGCTCCATG GAAATGGGGC GCCAGGCCGC CCGCCTGGCG
GTAGAACTCG GCAGTCCCGA ATTGATAACC CTGGATTCCC TCCCTGAGGA CGCCGTCCTC
CTTACCGTTT CGGCTGTAGG CGCGCCGGCG GCCAGGACGG TCTATGTCAA GCCGGTGCAT
TATATCCGTA CCGTGGAGTT ATTTCAAAAG TACACGGGCC AGGAGATCCG GGGCTTAATC
ACCAATGAGT GCGGCGGCCT GGCTGCTGTC AACGGCTGGC TGCAGGCCGC CGCCCTGGGG
ATACCGGTTG TCGATGCCCC CTGCAACGGC AGGGCCCACC CCACCGGGGT CATGGGCTCC
ATGGGCCTGC ACCGGCTGTC CGGTTATGTT TCCCGGCAGG TGGCCGTTGG CGGCAACCCG
CAAACCAATA GCTATGTGGA GGTTTTCGCC TCCGGGTCCC TGGAGACTGC CGCCGCCCTG
GTCCGGCAGG CCTCGGTGCA GGCCGGCGGC ATGGTGGCCG TGGCCCGGAA CCCGGTTACC
GCCGGTTATG CCAGGGAGAA TGCCGCCCCC GGAGCCATCG GCAGGTGTAT TGCCGTGGGC
CGGACCATTA TCGAGAACCG GTCCAGGGGG CCCCTGCCGG TCATCGAAGG GGTGGCCGGG
GTTTTACAGG GAGAGATTGC CTTTACAGGC CGGGTAGCCG CCGTCGACCT GGAAACGACC
GGAGGTTTCG ACGTCGGCAG GGTGGTTGTC CGGGATGGTG ACAGGCTGGC AGAACTCACC
TTCTGGAACG AGTATATGAC CCTGGAAATC GGCAGTGTGC GGAAAGGGAC CTTTCCTGAC
CTGCTGGCCA CCATGGACCT GACCACCGGC CTGCCCTTAT CTTCGGCGGA GATCAAGGCC
GGCCAGGAGA TAGCCATTCT ACACGTTCAC CGCGACCGGC TGATCCTGGG GCGGGGCATG
AAGGCCCCCG AACTCTTCCA GGTGGTCGAA AAGGCCACCG GCAAAGAAGT AATCAAGTAT
ATCTTCTCAT AG
 
Protein sequence
MGSRIILDNE VVEAAVLGGA VLGGGGGGSM EMGRQAARLA VELGSPELIT LDSLPEDAVL 
LTVSAVGAPA ARTVYVKPVH YIRTVELFQK YTGQEIRGLI TNECGGLAAV NGWLQAAALG
IPVVDAPCNG RAHPTGVMGS MGLHRLSGYV SRQVAVGGNP QTNSYVEVFA SGSLETAAAL
VRQASVQAGG MVAVARNPVT AGYARENAAP GAIGRCIAVG RTIIENRSRG PLPVIEGVAG
VLQGEIAFTG RVAAVDLETT GGFDVGRVVV RDGDRLAELT FWNEYMTLEI GSVRKGTFPD
LLATMDLTTG LPLSSAEIKA GQEIAILHVH RDRLILGRGM KAPELFQVVE KATGKEVIKY
IFS