Gene Moth_0895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0895 
Symbol 
ID3831436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp928760 
End bp930145 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content53% 
IMG OID637828825 
Productgluconate transporter 
Protein accessionYP_429755 
Protein GI83589746 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG2610] H+/gluconate symporter and related permeases 
TIGRFAM ID[TIGR00791] gluconate transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000913428 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACCGG AACAGCTAGC TGCCGCAAAG ACGGTTATGG TACACGGTCC CATGCTGCTT 
GTGATAGTGG TCCTGGCGAT CCTGTTTATA ATTTTCGCTA CTGCCAGGCT AAAACTTCAT
CCGTTCCTCG CCCTTATCCT GGCGGCATAC GGCGTGGGTT TTCTCAGTAA GATGCCCGTC
GAGTTCGTCG GTTCCGTAGT TGCCCAGGGA TTTGGTAATT TAATGACGAA CATCGGCCTC
GTGATAGTCT TCGGGACGAT AATAGGCACC ATCATGGAAA AATCCGGAGC GGCCCTGAAA
ATGGCCGAGG TGGTGCTTAA TATAGTAGGG ATTAAAAGAT CGCCCCTGGC CATGAGCATA
ATAGGTTATA TCACCAGCAT ACCGGTTTTC TGTGATTCGG GGTACGTTAT TTTGACTCCC
CTCAATAAAG CCCTGGCCAG GAGGGCTGAG ATCCCTATGG CCGTAATGGC CGTCGCTTTG
TCTACCGGCC TTTACGCCAC CCACACGCTG GTACCGCCAA CGCCGGGGCC GATAGCTGCC
GCAGGAAACG TGGGCGCGGA CCTGGGGCTT GTCATCCTGA TAGGCGTGCT GGTATCGATA
CCGGCGGCTC TTACGGGACT CTGGTGGGCC TATAGGGTTG GTAAAAATAT CACGTCGGAA
GTAGACCAGA CCGGTCTGAG TTACGACGAA CTGAAAAAAC AGTTCCAGGA GCTACCCGGA
GCAGTAAAAT CCTTTTTGCC TATCGTGGTG CCGATCATAC TTATAGCCAT CGCTTCGGTG
GCCAAGTTTA CAAAGTATGC CGGCCCGGGG AACAATTTTA TTATATTTCT GGGCACGCCG
GTCAACGCCC TTATGATAGG CGTTCTGCTG TCATTTACGC TCCTCCCCAG GTTTGACGAA
GAGACGCTCA TGAACTGGGT GGGGCAGGGG ATTAAGGATT CAGCGATTAT TTTACTCATA
ACCGGCGCAG GCGGCTCTCT CGGGGCCGTG CTGTCAGCCA CTCCTATTTC GGATTATATT
AAGTCTTTAG CTGGAGGAAA CATTGCGGGG GGTCCCTTGG CTATTATCCT TGTATTTATC
ATAGCAGCCA TGCTCAAAAC CGCCCAGGGC TCCTCGACGG TGGCCCTTGT TACCACCTCG
AGCCTTATAG CTCCCCTGCT GCCGCAGCTA GGGTTGACGT CTCCAATGGA TCTTGTCCTG
ACGGTAATGG CAATAGGCGC CGGGGCTATG ACCGTCTCCC ATGTTAACGA CAGTTACTTC
TGGGTAGTGT CGCAGTTTTC GGGTTTGGAG GTCACCGATG CCTATAAGGC CCAGACAGCG
GCTACCCTGC TGGAAGGCCT GGTAACGATT GTGACCACGA TCGTGCTCTT CATGATATTC
CATTGA
 
Protein sequence
MTPEQLAAAK TVMVHGPMLL VIVVLAILFI IFATARLKLH PFLALILAAY GVGFLSKMPV 
EFVGSVVAQG FGNLMTNIGL VIVFGTIIGT IMEKSGAALK MAEVVLNIVG IKRSPLAMSI
IGYITSIPVF CDSGYVILTP LNKALARRAE IPMAVMAVAL STGLYATHTL VPPTPGPIAA
AGNVGADLGL VILIGVLVSI PAALTGLWWA YRVGKNITSE VDQTGLSYDE LKKQFQELPG
AVKSFLPIVV PIILIAIASV AKFTKYAGPG NNFIIFLGTP VNALMIGVLL SFTLLPRFDE
ETLMNWVGQG IKDSAIILLI TGAGGSLGAV LSATPISDYI KSLAGGNIAG GPLAIILVFI
IAAMLKTAQG SSTVALVTTS SLIAPLLPQL GLTSPMDLVL TVMAIGAGAM TVSHVNDSYF
WVVSQFSGLE VTDAYKAQTA ATLLEGLVTI VTTIVLFMIF H