Gene Moth_1257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1257 
Symbol 
ID3833052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1299608 
End bp1300996 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content49% 
IMG OID637829193 
Productmajor facilitator transporter 
Protein accessionYP_430114 
Protein GI83590105 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.375741 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00856521 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCAGTAG AAACATTGGG ACGCAGGGTT GAGAAAGAAG GCGACCTTAA GAAACACGTC 
CAGAGTTCCT GGATACCAAG ATCTGTGCAC CCATACTCCT GGGTTTCCCT GCTTGTCTGC
TGGGGTATCT GGGTAATCAA CGCCTTTGAT CGCGAGATTA TCTTACGCCT CGGGCCCAGT
ATTACCGAAG AGTTTCACCT TTCCCCGGAA CAATGGGGCA ATATGGTGGC CCTCATTATG
CTGGCCCTGG CTGTGTTGGA TATACCGGGC AGTATTATGA GCGATCGCTA CGGTTCCGGC
TGGAAACGGG CCCGTTTTCA GGTGCCAATT GTAATCGGGT ATACAGTTTT GTCGTTCCTA
TCAGGTTTAC GTGCCCTTAG TGCCCAGTTG AGCCATTTTA TTGCCTGGCG GGTTGGAGTC
AACCTGGGTG CCGGCTGGGG CGAACCGGTT GGCGTCAGTA ATACGGCCGA GTGGTGGCCG
GTAGAAAACC GCGGTTTTGC GCTGGGTGTC CATCATACAG GTTATCCCAT TGGTGCTTTA
TTAAGTGGCG TTGTCGCCAG CTATGTACTG AGTACCTTTG GAGCCGAAAA TTGGCGTTAT
AGCTTCTTTT TCGCCATTAT TGCCGTCCCA ATTATGCTCT TTTGGCTGTG GTATTCAACC
CCGGAACGGG TAGATACCCT ATATAAGGAT ATTGAAGCTA AAGGTCTAAC TAAACCGGAA
CTCGATGTAG GGGTCAACGT AGGTAAAGGA CAGGGTATGA ATGTTTTTAT TAAAACCCTT
AAAAATAAAA ATGTCTCTTT AACTGCCGGG AATACCCTTC TAACCCAGAT TGTTTATATG
GGCATTAATG TAGTCTTAAC TCCTTATCTC CATTATGTAG TCGGGTTCTC CGTAGCCGCT
TCGGCAGGAT TGAGTATTAT CTTTACCCTG ACTGGCGCCT TCGGGCAGAT CCTCTGGCCC
TGGCTGTCAG ACTACCTGGG TCGAAAATGG ACCCTGGTTG TCTGCGGCTT ATGGATGAGC
GCCGGTATCG CTGCCTTCTA TTTTGCCACC AATATGAGTA AACTCGTATT AATCCAGTTA
CTTTTCGGTG TTGTTTCCAA TGCTGTTTGG CCAATTTACT ATGCCATGGC CTCCGACTCG
GCCGAAAAGG CTGCTACCTC TACTGCCAAT GGCATTATCA CTACGGCCAT GTTTATTGGT
GGCGGCATCT CCCCGGTATT AATGGGTTGG TTGATTGGCC TCGGCGGTGG TTGGAATAGC
CCGACAGGTT ATATCTATAC CTTCTTTGCC ATGGCCGCCT GCGCCCTTAT GGGAGTAGTA
TTACAATTGT TTACAGTTGA AAAAGCAGGT ATTTTTGCCA AATCGGATGA ATCTATCTTC
GTTAAGTAA
 
Protein sequence
MAVETLGRRV EKEGDLKKHV QSSWIPRSVH PYSWVSLLVC WGIWVINAFD REIILRLGPS 
ITEEFHLSPE QWGNMVALIM LALAVLDIPG SIMSDRYGSG WKRARFQVPI VIGYTVLSFL
SGLRALSAQL SHFIAWRVGV NLGAGWGEPV GVSNTAEWWP VENRGFALGV HHTGYPIGAL
LSGVVASYVL STFGAENWRY SFFFAIIAVP IMLFWLWYST PERVDTLYKD IEAKGLTKPE
LDVGVNVGKG QGMNVFIKTL KNKNVSLTAG NTLLTQIVYM GINVVLTPYL HYVVGFSVAA
SAGLSIIFTL TGAFGQILWP WLSDYLGRKW TLVVCGLWMS AGIAAFYFAT NMSKLVLIQL
LFGVVSNAVW PIYYAMASDS AEKAATSTAN GIITTAMFIG GGISPVLMGW LIGLGGGWNS
PTGYIYTFFA MAACALMGVV LQLFTVEKAG IFAKSDESIF VK