Gene Moth_1460 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1460 
Symbol 
ID3831346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1508967 
End bp1510058 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content52% 
IMG OID637829393 
Productspore germination protein 
Protein accessionYP_430313 
Protein GI83590304 
COG category 
COG ID 
TIGRFAM ID[TIGR00912] spore germination protein (amino acid permease) 


Plasmid Coverage information

Num covering plasmid clones61 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGAAA AAGGTAGGAT TTCTCTCTGG CAGTTTTTTG TGCTGGTTAC CGGGTACCTT 
ATTGGTACCT CTACCTTGAT CGTCCCTGTC GGGCCGGCCA AACAGGACGC CTGGATATCT
TACCTGCTGG CGGGAACCCT GGGACTGGGC GCGGCCTATT GCTATACAGC CTTAGCCCGG
CGCTTCCCGC GGGAAACCCC GGTGCAGTAT GCCACCAGGG TCCTGGGGCG CTGGCCGGGG
ACCTTTTGCA ATCTTATTTT TCTCTGGTAC GCCCTGTACT TGGCCGCCCT GGTGCTGCGT
AACGTTATTG AATTATATAA GATGGCGATC TTGCCCCAGA CGCCAATGGT TTTAGTCGCC
GGAATTTTTG CCGGCCTGGC TGCCTACGCC ATTCGCATAG GGATAGAAAT CCCGGCGCGA
TTAAGCGAGC TCTTAATTCC TTTTGTTATT GTCGCCATTT TGGTTCTTAC CGCCCTGGCG
GAAGCTGTCC CGGGATTAGC CCACTGGGAA GCCCTCCTCC CGGTGATGGA AAGGGGGCCC
CTGCCGGTCT TACGAGGGGT CTACCCGGCC TTTGTTTTTC CCTTCGGCGA AGCCGTCTTT
TTTTTGGTCA TTTTACCTTT TTTAACCGAA CCCAGGAGAA ACTTTCCCCC CTTCGCCCTG
GCGGTAACCG TGGCAACATT GCTCACCACC CTGGTCCTGG TGCGTAACCT CATCGTCCTG
GGTCCTTCCG AAACGGCGCG GATAAATTTC CCCAGCCTGA TAGCCATTCA AATGATAAAT
ATCGGTGACT TTTTGCAGCG GCTGGAACCG GTCATTATTT TCGTCTGGAG TTTTACTATA
TTGCTGAAAC TGACTGTCGT CTACTATGTT TTTACCCTCG GCACAGCCCA GGTTTTCGGC
CTCCGGGATT ACCGTCCCCT GGTGCTGCCA GCCGGACTGT TAATAACCTT TTTAGCTATG
AGCCTCTATG AGAATTTTTC CCAGATGTTA ATTTTTGCCG GACGGGCCTT CCCCTTTTAT
TTTCTCCCTG CCTACCTGTT CTACCCCGCT TTATTGCTCC TGGTGGCTAA AATAAGAAAA
ATTAAAGGGT AA
 
Protein sequence
MIEKGRISLW QFFVLVTGYL IGTSTLIVPV GPAKQDAWIS YLLAGTLGLG AAYCYTALAR 
RFPRETPVQY ATRVLGRWPG TFCNLIFLWY ALYLAALVLR NVIELYKMAI LPQTPMVLVA
GIFAGLAAYA IRIGIEIPAR LSELLIPFVI VAILVLTALA EAVPGLAHWE ALLPVMERGP
LPVLRGVYPA FVFPFGEAVF FLVILPFLTE PRRNFPPFAL AVTVATLLTT LVLVRNLIVL
GPSETARINF PSLIAIQMIN IGDFLQRLEP VIIFVWSFTI LLKLTVVYYV FTLGTAQVFG
LRDYRPLVLP AGLLITFLAM SLYENFSQML IFAGRAFPFY FLPAYLFYPA LLLLVAKIRK
IKG