Gene Moth_0578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0578 
Symbol 
ID3832491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp602425 
End bp603462 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content62% 
IMG OID637828519 
Producthypothetical protein 
Protein accessionYP_429451 
Protein GI83589442 
COG category 
COG ID 
TIGRFAM ID[TIGR02867] stage II sporulation protein P 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTATCC AGGTCATCAC CAGCCGGCAG GCCCGGGCCC TGGGAGCTTT CCTCAAAGGG 
GTGCTCCTGG CCGGGTTAAT TATTATGGGA CTGGTCACCG GGGCCTGGCA ATTGATCCGG
ACCGGTAGCC AGGTGCTGTC CGGCTCCCCC GCCAGGCTAC CTATGCCCCT GCTGGAGGGC
ATCCTGCGTG AGGGTTTGCC GGCCCTGGGC CTGCAGGCAG GAGAAGCCCC CCGGAATACT
TCCGGTGAGG CCCTGACAAC GGCCCTGCAG GTCCTGGCCT CGCCCCTGGT AGTTTTACCT
GGACAGGAGA AGACCCCGAG TCTCCTGACA ACTGAAGAGG AGTATGTCCA GCCGGAACCG
CCGGTAGAAG AAAGCCCGCC CCCGGCGCCA TCAGAGGTCG AAACCACCAG CTCCAAAAAC
CCCCTGGTGG CTATATACAA CACCCATAAC GCGGAATCCT ACCAGCCCAG TGAGGGTAGC
GCCAAGTTTC CCGGCAAAAA CGGCGGCGTC AGCCAGGTGG CTGCCACCCT GGCCGACAGC
CTGAGTAAAG ATTATGGAAT CCCGGTGGTT CGTTCAACCA CCATCCATGA TTATCCCGAT
TTTACCCTGG CTTATGCCAA CTCCGAGAAA ACCTTGAAGC GAATGCTGGC CGCCAACCCC
TCGGTCCTGG TGGCCCTGGA CGTCCATCGG GACGCCGGCC TGCCGTCACC GCCTGTTGTC
GAAATTGACG GCCAGAAAGT CGCCCAGGTG CTCATCATCG TCGGCAGCAA CGCCCGGTTG
GAACACCCCA ACTGGCGCCA GAATGAGGCC TTTGCCAGGC AACTGGCCAA AAAAATGGAT
GAACTCTATC CCGGGCTGTG CCTGGGAGTC CGGGTCCAGG AGGGGCGCTA CAACCAGCAT
CTCCTGCCCC GGGCTCTGCT CCTGGAATTC GGCAGCGACA ATAATACCCT CCAGGAGGCT
GAAGGTTCCG CCCGCCTGGT AGCCAGGGTC CTGGCGGCGG TAATCAAAGA CCTGCGGCAG
GAAAACCCGG CCAGTTGA
 
Protein sequence
MRIQVITSRQ ARALGAFLKG VLLAGLIIMG LVTGAWQLIR TGSQVLSGSP ARLPMPLLEG 
ILREGLPALG LQAGEAPRNT SGEALTTALQ VLASPLVVLP GQEKTPSLLT TEEEYVQPEP
PVEESPPPAP SEVETTSSKN PLVAIYNTHN AESYQPSEGS AKFPGKNGGV SQVAATLADS
LSKDYGIPVV RSTTIHDYPD FTLAYANSEK TLKRMLAANP SVLVALDVHR DAGLPSPPVV
EIDGQKVAQV LIIVGSNARL EHPNWRQNEA FARQLAKKMD ELYPGLCLGV RVQEGRYNQH
LLPRALLLEF GSDNNTLQEA EGSARLVARV LAAVIKDLRQ ENPAS