Gene Moth_1461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1461 
Symbol 
ID3831347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1510215 
End bp1511891 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content58% 
IMG OID637829394 
ProductGerA spore germination protein 
Protein accessionYP_430314 
Protein GI83590305 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTCCGCT TGTGGCGCTG GCTGGGACGT AAAAAAACAT CCGGCGGGCA ACTTCCGGGG 
GACGACTCCC GCGCGGTAAA TCACCGTCTG GAGGTTAACA TTGCGTATAT CAAAAAAGCC
TTCGGCCGGA GCGAAGACCT GGTGATCAGG GAGATAGAGC TTCCGGGCAG GAAACTGGCG
CTGGTCTACG TGGAAACCCT CATCGACCGT GATGTGGTCC AGCGGGATAT CCTGCGTTCC
CTCCTGGCGC TACAGACAAT ACCCCTGCCG GAGGACGAGG CAGGGTTCAA CCGGCTGCTC
CGCGCCCGGT TGACCATCGG CGATCTCCAG GAAGAGCAAC TCTGGTCAAA AATAATCACC
GGCCTGCTGG ACGGCAAGGC GGTTTTAATT GGAGAAGGCT TCAGCCGCTG CCTGCTGTTA
AGTGTCGAGG GCTGGGAAAA GAGGCCGGTC GAGGAACCTG TTAATGAAGT TTCCATCCGC
GGCCCCCGCG AGGGATTTGC AGAAAATTTA CCCACCAACA TCTCCCTCAT CAGGCGACGG
CTGCGCGCGC CCGAGCTCCG TTTTGAAACC ATGAACCTGG GCCGGCGCAC CCACACGAAA
GTAGTCATCT GTTACCTGGA AGGCCTGGCT TTACCCGGCG TCCTCGAAGA ACTCCGCCGC
CGGTTGGAGC GCATCGACAT TGACGGCGTC CTGGAGAGCG GCTATATAGA GGAACTTATT
GAAGACGCTC CCTTTTCCCC CGTACCGCAG CTTAACCGCA CGGAAAGGCC GGACAAACTG
GTTGCCGATT TATTGGAAGG CAGGATAGGT GTTCTGACCG ACGGAACGCC CTTCGCCCTG
GTCCTGCCAG GCAGCCTGGT ATCCCAGTTG CATGCCCCCG ACGATTATTA TGAGCGCTGG
CCTTTAAGCA TGGGGATCCG CCTTTTTCGC TTTTTCGGCC TGTTTATCGC CCTGCTGCTG
CCGTCCGTCT ATGTGGCCTG GACTTCCTAC CACCAGGAAA TGATACCCAC GCCCCTGGCG
ATATCCATCG CCGCCCAGCG TGAGCTGGTG CCCTACCCGG CCGTTGTTGA AGCTTTAATC
ATGCAGGTGC TTTTTGAAAT TCTCATCGAA GCCGGCATCA GGCTCCCCCG GGCCATAGGT
ACGGCCATCA GCATCGTCGG GGCCCTGGTT ATCGGCGAAG CGGCCGTCCG GGCGGGACTG
ATGTCTGCGG CGATGGTTAT TGTAATTTCA GCTACGGCCA TCGCCTCCTT CACCATACCC
ACTTTCGGTT TGAGCCAGGC TGTGAGGATG CTCCGCCTGC CAATGATCTT CCTGGCCGGT
GTCTTAGGCC TGCTGGGTAT TTTTGCCGGC CTAATGGCGC TCTTAATCCA CCTGGTGAGC
CTGAGAAATT TCGGCGAGCC CTATCTAAGC CCCCTGGCGC CCTTTATCTG GGAAGGGCAT
AAAGACCTGG TAGAACGGGT GCCCTGGTGG GCCATGCACT GGCGGCCTGT ACTACCCGGC
CGGCAAGATT TGCGACGCAT CAAACCGGGC CTGCGACCCT CTACCACGGC CCGGGAGAGA
AAGCCCGGCG AAGAACTGGA GACGTACCTG GGCGAAGAAG CCGGGAAAAG TATTACTACC
GTTACAACCC CGAAAAAAGG GCGAAAAAAA AGAAAAAAAG GGTGGATAAA GATTTAG
 
Protein sequence
MLRLWRWLGR KKTSGGQLPG DDSRAVNHRL EVNIAYIKKA FGRSEDLVIR EIELPGRKLA 
LVYVETLIDR DVVQRDILRS LLALQTIPLP EDEAGFNRLL RARLTIGDLQ EEQLWSKIIT
GLLDGKAVLI GEGFSRCLLL SVEGWEKRPV EEPVNEVSIR GPREGFAENL PTNISLIRRR
LRAPELRFET MNLGRRTHTK VVICYLEGLA LPGVLEELRR RLERIDIDGV LESGYIEELI
EDAPFSPVPQ LNRTERPDKL VADLLEGRIG VLTDGTPFAL VLPGSLVSQL HAPDDYYERW
PLSMGIRLFR FFGLFIALLL PSVYVAWTSY HQEMIPTPLA ISIAAQRELV PYPAVVEALI
MQVLFEILIE AGIRLPRAIG TAISIVGALV IGEAAVRAGL MSAAMVIVIS ATAIASFTIP
TFGLSQAVRM LRLPMIFLAG VLGLLGIFAG LMALLIHLVS LRNFGEPYLS PLAPFIWEGH
KDLVERVPWW AMHWRPVLPG RQDLRRIKPG LRPSTTARER KPGEELETYL GEEAGKSITT
VTTPKKGRKK RKKGWIKI