Gene Moth_1484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1484 
Symbol 
ID3832365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1531169 
End bp1532503 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content53% 
IMG OID637829417 
Productextracellular solute-binding protein 
Protein accessionYP_430337 
Protein GI83590328 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000980441 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAACT GTAAGTGGCG TAAACCCCTG GGAATTGCCC TGCTAGTGGC AACCATGGCG 
AGTAGCCTGG TGACAGGTTG CGGTCCGGGA AACCCGGCGA AAGGTACCGG CCAGGAGAAG
ACAGGCGGTA AGGTAACAAC AATTGAATAC TGGCATGTTA ACTCGGAAAA CTTTGGCGGG
CAGACCGTCC GCGATCTGGT GCAGAAGTTT AACGAACAGC ACCCGGACAT TAAAGTAGTT
GAAAAGTTTC AACCCAACAT GTACCTGGGG TTAATGCAAA ATTTACAGGC CGCCCTGGCG
GGGGGCCATC CTCCGGCAGT AGCCCAGATT GGCTATAATT ACCTTGACTA CGCCACCGCC
AACTTGCCCC ACCTGCCGGT AGAAGACGCC GCTAAAAAGG ATCCCGAGGG GCAGGCTTTT
CTAAATAACT ATCTGCCCAA TATCTTAAAC CTGGGCCGGG TTAACGGCAA ACTGGAAGGT
ATGCCCTATT CCATCAGCAA CCCTGTCCTA TATTACAACG CCGACATGTT TAAAGCAGCC
GGCCTGGATC CTCAAAATCC ACCTAAGACC TGGGCCGAGG TACGGGATAT GGCCAGGATA
ATCAAAGAGA AAACCGGCAA CTACGGCCTT TATGTGCAGG AGCCTTCCGA CAACTGGGCC
CAGCAGGCCA TGATGGAGTC CAATGGCGCT CAGGTACTGA CCAGGACGGG AGGTAAGGCC
AGCGCTACCT TTGATAGCCC CGAAGCCATA GAAGCTTACC AGTTGATGGC CGACATGGTT
TTAAAGGATA AGACGGCCTT GCACGCCACC TGGGAGGAAG GTACCCAGGC CTTTATTACG
GGAAAAGTGG GAATGTATGT TACAACTATT GCCAGGAGAA ATTATATTGA AACTTCTTCT
AAGTTTAAGG TTTTAGCGGC TCCTTTTCCA ACCTTCGGTA ACAAACCGCG GCGGGTCCCG
GCCGGAGGTA ATGCCCTGTT TATCTTCGCT AAAGATCCTG ACCAGCAAAA GGCTGCCTGG
GAGTTCATCA AGTACCTGGA ATCCCCCGAG GCTTTAACAA CCTGGACCAA AGGTACTGGC
TATCTGCCTC CCCGGAAAGA TGTGACCGAA GATCCCAACT ACCTGAAACC TTTCATGGAC
CAGAATCCGT TAATGAAGCC GGCGGCAGCT CAGCTTCCCG ACGCTGTTCC CTGGGTAAGT
TTTCCCGGCA ATAACGGTCT TCAGGCCGAA CAGATCCTCC TGGACGCCAG GGATGCCATC
CTCGGCGGTC GCCAGTCGGC AGCAGAAGCC CTGAAGGAAG CCGTGGCTAA GGTAAATAAA
TTAATCGGCA ATTAA
 
Protein sequence
MINCKWRKPL GIALLVATMA SSLVTGCGPG NPAKGTGQEK TGGKVTTIEY WHVNSENFGG 
QTVRDLVQKF NEQHPDIKVV EKFQPNMYLG LMQNLQAALA GGHPPAVAQI GYNYLDYATA
NLPHLPVEDA AKKDPEGQAF LNNYLPNILN LGRVNGKLEG MPYSISNPVL YYNADMFKAA
GLDPQNPPKT WAEVRDMARI IKEKTGNYGL YVQEPSDNWA QQAMMESNGA QVLTRTGGKA
SATFDSPEAI EAYQLMADMV LKDKTALHAT WEEGTQAFIT GKVGMYVTTI ARRNYIETSS
KFKVLAAPFP TFGNKPRRVP AGGNALFIFA KDPDQQKAAW EFIKYLESPE ALTTWTKGTG
YLPPRKDVTE DPNYLKPFMD QNPLMKPAAA QLPDAVPWVS FPGNNGLQAE QILLDARDAI
LGGRQSAAEA LKEAVAKVNK LIGN