Gene Moth_1012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1012 
Symbol 
ID3833315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1040373 
End bp1041452 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content48% 
IMG OID637828940 
Productspore germination protein 
Protein accessionYP_429869 
Protein GI83589860 
COG category 
COG ID 
TIGRFAM ID[TIGR00912] spore germination protein (amino acid permease) 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000935312 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGCGGGAAA GAATTGGAAC GAACGAAGCC ACCTTTTTAA TTGTTGCGGC AATGATTGAA 
GTGGGGACGT TAAAAGGGGC CAGGAATATT GTTGAAAAAG TCGGGGTGGA CACCTGGCTG
GTATCTCCCC TGGAAACTAT ATTCAGCCTC GGAGCTATTT ACCTCCTGAC CGCGCTTGTT
ATGAAATTTC CTGACCTGGA CCTAGTTGGT TTTAGCCGCC GCCTGGTAGG CAAGTGGCTG
GCCTGGCTTT TAGGTTTAAT AGTTTTAGTT TACTGGGTTG GTTTAACGGC CGAGGTCGGC
CGGGTAACTG CTGATACTAT TAAGAGTTCT TTGTTGTCGC ACACACCCGA TGCAGTAGTG
TTATCTTCTT ACCTGCTGGT AGCGGCCTAC CTGGCTGGAA AGGGTTTAGA GCCCCTGGCA
CGTGCTTCTA TCATCATTGT TATTTTTACT TTACCCATCA CCTTGCTTCT CTTTGCCCTG
GTTATACCGC GAATCCATCT GGACAACTTC TTACCCATCT TGCCCCATGG ACCCTGGCCG
GTGATAAAGC TGGCTTTATG GCGGATTAGT AACGCTGAAG AGATGAGTCT TTTCCTTATC
CTGGTTCCTT TTTTAAAGGA ACCCCGGCGA GCCTGGCGTG CCGCCAGCTA TGGCTTTTTA
ACAGTCATGG CAGTAGTGAT CACCATTATA ACTACCTGTC AGGGGGTTCT GGGGGTAGAT
CAACTGAAGT ATACGCTGAT TCCAGGTTTA ACTGTGACGC AACTGGCAGA ATTCGCTGGA
GCCTTTATCG AACGGATTAG CCTGATATTC ATTTCTGTAT GGATTATTTT AGTCTTTCCT
ACCGCCTCGG CTCTCCTCTG GGCAAGCTCA TATTTGCTAG GCCGCCTCTT AAACTTAAAA
GACTATAAAA TGTTGGCTTT TTACCAGTTA CCCGTAGTTT ATTACTTAGC CTGGCGGCCC
GGTAATCCCT TTGAAGTTAA AAGCTTCTTT TTCTTTCTTC AACCCCTGGG CCTGGTGGTA
TTGGTAGGCA TCCCCTCCTT ACTTTATCTT GTTGCCCTTT TTCGTTGTCG CGCTCGGTGA
 
Protein sequence
MRERIGTNEA TFLIVAAMIE VGTLKGARNI VEKVGVDTWL VSPLETIFSL GAIYLLTALV 
MKFPDLDLVG FSRRLVGKWL AWLLGLIVLV YWVGLTAEVG RVTADTIKSS LLSHTPDAVV
LSSYLLVAAY LAGKGLEPLA RASIIIVIFT LPITLLLFAL VIPRIHLDNF LPILPHGPWP
VIKLALWRIS NAEEMSLFLI LVPFLKEPRR AWRAASYGFL TVMAVVITII TTCQGVLGVD
QLKYTLIPGL TVTQLAEFAG AFIERISLIF ISVWIILVFP TASALLWASS YLLGRLLNLK
DYKMLAFYQL PVVYYLAWRP GNPFEVKSFF FFLQPLGLVV LVGIPSLLYL VALFRCRAR