Gene Moth_0728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0728 
Symbol 
ID3831004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp760006 
End bp761391 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content42% 
IMG OID637828659 
Productgluconate transporter 
Protein accessionYP_429589 
Protein GI83589580 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG2610] H+/gluconate symporter and related permeases 
TIGRFAM ID[TIGR00791] gluconate transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000179796 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.654044 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAGGCA TTTCCGGAAT TCAGATAATT ATTGGTCTGA TTGTTGGTAT CTTTGTGCTT 
GTATATTTAA TACTGAGGAC CAAAATTCAT GCTTTTCCTG CTCTTATCAT TGCGGCTTCC
GTAATCGGGT TAATTGGTGG TATGTCACCA TCTACAGGTG ATATTAACCT TGCTAAATCA
ATCACTACTG GCTTTGGTAA TACTTTAGCA AGTATCGGTC TCGTAATTGG TTTTGGCGTA
ATGATGGGAC GATTGCTGGA GGTATCCGGG GCTGCCGAAC GTATGGCTTA TACTTTCTTA
AAGTATCTAG GACGTGGAAA AGAAGAATGG GCGCTGGCCG CAACGGGATA TGTCATTTCT
ATTCCTATTT TCTGCGATTC GGGTTTTGTT ATTTTAACAC CCCTGGTTAA AGCGTTATCA
CGTAGAACCA AAAAATCCGT ACTTGCTCTT GGTGTCGCTC TGGCAGCCGG TCTAGTGGCT
ACCCATAGTG CAGTACCACC AACACCGGGA CCTCTGGCAG TAGCAGGCAT TTTTAAAGTT
GATGTAGGTA TGGTAATTAT TTCCGGGCTT ATATTTACCA TACCAATAAC TATAGCTGGG
GTTTTGTACG GCAAATGGTT GGGTAAAAAA ATATATCAAT TACCCAGTGA AGATGGTCAG
AGCTGGATAC GACCACCTTA CCAATCTTCT CAGATTGCCG AAGAAGCATT GCCTGAAAAT
GGCAACCTGC CTTCCGCTTT TATATCCTTT GCCCCTGTTG TTATTCCTTT AATTTTAATC
TTTGTCAATA CGTTGCTGAC AGCTATGAAA ATAAACCAAC TATGGGCGCG CTACCTAGTT
TTCCTGGGTA ATCCGGTAAT AGCGGTTGGT ATTGGTCTTA TTATTGCTAT TTATGGTCTG
GCTCCGAAAC TTTCTCGGTC TGAAGTACTA AAAAAGATGG AAGAAGGGGT TTCTTCAGCT
GGTATAATTA TCTTAATTAC AGGTGCTGGT GGCGCATTAG GCCAGGTGTT AAGGGACAGT
GGTGTCGGTA ATTATGTAGC TCAACTTATT GCTTCTAGCC CTCTGCCTCC ATTTTTGTTA
CCCTTTTTTG TTGCTACTTT TGTACGATTA GTCCAGGGAA GCGGTACAGT AGCCATGATT
ACCTCTGCTT CCATTACTGC ACCAATTTTG GCCAATCTTT CCGTTAATCC AATCATTGCT
GTTCAAGCAG CCAATTTAGG TTCATTGATA TATTCTTATT TCAATGACAG TTTTTTCTGG
GTAGTCAATA GATTTTTAGG TGTCGATGAC ATTAAGGAAC AAACACTGAC GTGGTCGGTT
CCGACTACAA TTGCTTGGGG CGTTTCTTTG ATTATGTTAT ACATTGCCAA CGCCATTTTA
AGCTAA
 
Protein sequence
MGGISGIQII IGLIVGIFVL VYLILRTKIH AFPALIIAAS VIGLIGGMSP STGDINLAKS 
ITTGFGNTLA SIGLVIGFGV MMGRLLEVSG AAERMAYTFL KYLGRGKEEW ALAATGYVIS
IPIFCDSGFV ILTPLVKALS RRTKKSVLAL GVALAAGLVA THSAVPPTPG PLAVAGIFKV
DVGMVIISGL IFTIPITIAG VLYGKWLGKK IYQLPSEDGQ SWIRPPYQSS QIAEEALPEN
GNLPSAFISF APVVIPLILI FVNTLLTAMK INQLWARYLV FLGNPVIAVG IGLIIAIYGL
APKLSRSEVL KKMEEGVSSA GIIILITGAG GALGQVLRDS GVGNYVAQLI ASSPLPPFLL
PFFVATFVRL VQGSGTVAMI TSASITAPIL ANLSVNPIIA VQAANLGSLI YSYFNDSFFW
VVNRFLGVDD IKEQTLTWSV PTTIAWGVSL IMLYIANAIL S