Gene Moth_1590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1590 
Symbol 
ID3832736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1624610 
End bp1625905 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content50% 
IMG OID637829519 
Productmajor facilitator transporter 
Protein accessionYP_430439 
Protein GI83590430 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0171149 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.72938 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAATCC AGAACCCTGA GACCCCGTTC GGGCCCCGGG AAGTACCCAA ACCTTTTTTC 
GAGAATAAAT GGGTGCAATT GATTATTGCC ATGATCGGCA TGATCATGAT TGCCAACCTC
CAGTATGCCT GGACTCTCTT TGTGCCGGAA GTAGTAAAAG GTCTTAACTC AACGAAAGCT
GCAGTCCAGA TGGGGTTTTC CCTCTTTATC GCCTTTGAGA GCTGGGGCCA GCCTATTGCC
GGATACTTCA TGGATCGCTA CAGCCCCCGG GCTTTGCTGA CTGTAGCCGC CTTGATGATC
GGTATTGGAT GGGCCGGTAT GGGATTAGTC AAATCCCTTG GTGCTTTGTA TTTCTTATAT
AGTATGGCGG GTGTGGGTGC AGCCCTGATC TATAGCGGCT CCATTGCCTC GGGCGTGCGC
TGGTTTGAGG CCTCCAAGAG GGGAATGGCC TCCGGACTGG TAGCTGCCGC TTTCGGTTCC
GGGGCCGCCC TCTTTATCCC CTTTATCGCT ATCATATTGA AAAAGCAGGG TTATAATTCA
GCATTCGTAA CTACAGGCGT TATCCAGGGA ATTATCGCCT TGATTGCCGC CCAGTTCATG
CGTTTCCCTG CTAAACCTAA GACGAGTAGC TCCAGTACGG CGAAAGCCCA GGTTTCCGCC
AGCCAGCGTG ATTTTACTAC AGTGGAAATG TTTAAAACAG CTCATTTCTG GATTATTTAT
TTAATGTTCT TGTTTATTTG TACCGGCGGC ATGATTGTAA CGGCCCAGAC AAAACCCTTT
GGTACGGAGG CGGGTATAGC TGCCAGCATC ATTGTGACAG CAGCCACAAT CAATACCATT
GCCAATGGTG CCGGGCGGAT AATCTGGGGT ATGATTTCCG ACAAACTGGG GCGTTACCAG
ACAATGTTTG TGGCCTTTAC CATTAACGCC ATCGCCATGG CCCTGGTTCC TTTTATCGGC
CATAACGCCT TTATGTTTGT CTTTATCTTT GCCCTGATTA TGTTCACCTG GGGTGAACTA
TATTCCTTGT TCCCGGCCGT CAACGCCGAT ATTTTCGGAA CTACCTACGC CGCAACAAAT
TATGGTTTCA TTTACAGTGC CAAGGGTTTG AGCGGTATTG TTGGCGGCTT TGTGGCCGCC
CTGGTGGCCC AGATGAGCGG CTGGACACCG GTATTTTTAA CTGGTGCCGT AATGTCCCTG
CTGGCCGGTT TGGGTGCCCT TTTATTGCGG AGTATTCCTA AACCAGTCCC CTCAGGGATT
AAGAGCGGAA GCAGTGGTTT TTCCAGCCAG GCTTAA
 
Protein sequence
MSIQNPETPF GPREVPKPFF ENKWVQLIIA MIGMIMIANL QYAWTLFVPE VVKGLNSTKA 
AVQMGFSLFI AFESWGQPIA GYFMDRYSPR ALLTVAALMI GIGWAGMGLV KSLGALYFLY
SMAGVGAALI YSGSIASGVR WFEASKRGMA SGLVAAAFGS GAALFIPFIA IILKKQGYNS
AFVTTGVIQG IIALIAAQFM RFPAKPKTSS SSTAKAQVSA SQRDFTTVEM FKTAHFWIIY
LMFLFICTGG MIVTAQTKPF GTEAGIAASI IVTAATINTI ANGAGRIIWG MISDKLGRYQ
TMFVAFTINA IAMALVPFIG HNAFMFVFIF ALIMFTWGEL YSLFPAVNAD IFGTTYAATN
YGFIYSAKGL SGIVGGFVAA LVAQMSGWTP VFLTGAVMSL LAGLGALLLR SIPKPVPSGI
KSGSSGFSSQ A