Gene Moth_0013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0013 
Symbol 
ID3831885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp13155 
End bp14543 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content45% 
IMG OID637827940 
ProductPTS fructose IIC component 
Protein accessionYP_428896 
Protein GI83588887 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1299] Phosphotransferase system, fructose-specific IIC component
[COG1445] Phosphotransferase system fructose-specific component IIB 
TIGRFAM ID[TIGR00829] PTS system, fructose-specific, IIB component
[TIGR01427] PTS system, fructose subfamily, IIC component 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.991255 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000838432 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAAAAAAA TAGTTGCTGT TACTTCCTGC CCGACAGGAA TAGCCCACAC CTATATGGCA 
GCGGAAGCTT TGCAAAAGGC AGCTAAAGAA CAGGGAGTGG CGATAAAAGT AGAAACCCGT
GGGTCTATTG GGGTTGAAAA TGAACTGACG CCAGCAGATA TCGAAGAGGC CGTGGCAGTT
ATTATAGCTG CAGATGCTAA AGTAGATGAG GAAAAATTTC AAGGGAAGCC CATAGTTAGG
TCTTCAACGG GAGAAGCTAT TAAAAATGCT AAAACTTTGA TAGACAAAAC TTTAAGCTTG
GAGGGTAAAG CGTCCTCAAG GCAAATTACT GATTTTATTC GTGAGGTCGA GCAGCGCAAA
AGTGAGCGGA GGTCCCAGGC TACTGGTTTT TATAAACACT TAATGACAGG CGTATCGTAT
ATGATACCCT TTGTGGTAGC AGGTGGAATA ATCATTGCTC TTTCCTTCAT ATTTGGTATC
GAGGCCTTTA AGCAAGAGGG TACTTTAGCC GCCAACCTTA TGCGTATTGG TGGCGGTTCA
GCTTTTGCCT TAATGGTGCC AATACTGGCA GGGTATATTG CATTTTCCAT TGCTGATCGG
CCGGGCCTGG CTCCAGGAAT TATAGGAGGT ATGCTGGCTA CTCAAATGGG GGCAGGTTTT
CTGGGAGGAA TTGTTGCTGG ATTTCTGGCT GGTTATGTGG CCAAATTTTT ACGAGATAAT
ATTAAGCTCC CTGCGGGTCT GGAAGGATTA AAACCGGTAT TAATAATTCC ACTAATTTCT
ACTTTAATAG TGGGTTTGTT GTTAATCTAC GTTATAGGAA CTCCCGTAAA AGTTATAATG
GATGGTCTTG AACATTGGTT GACGTCAATG AGCAGAGGCA ATGCTGTTAT TTTAGGCTTT
ATTCTAGGTG CTATGATGGC CTTAGATATG GGAGGACCGG TAAACAAAGC GGCTTATACC
TTTGCGGTTG GGTTGCTTGG TAGCAATATT TATGAGCCCC AGGCTGCGGT TATGGCTGCC
GGCATGACGC CTCCTTTAGG TTTAGCTCTA GCTACTTTGC TGTTCCCTAA AAAGTTTACT
AGCGAAGAAA GAGAGGCTGG TAAGGCAGCA GCAGTTTTAG GGATTTCTTT TATTACCGAA
GGCGCTATTC CCTTTGCTGC ATCTGATCCT TTCAGGGTGA TTCCATCTAT TGTGGCAGGT
TCAGCCGTAG CTGGTGCTCT CTCCATGGCT TTTAATGCAA CCCTGAGGGC GCCACATGGA
GGCATTTTCG TCCTTGCCAT CCCTAATGCA GTAGGGCACT TAGGATTATA TAGCCTGTCT
ATTGCCATTG GTACCCTCGT CACGGCTTTA ATGGTGTCAC TTTTGAAACC CAACAAAGAA
ATAAAATAA
 
Protein sequence
MKKIVAVTSC PTGIAHTYMA AEALQKAAKE QGVAIKVETR GSIGVENELT PADIEEAVAV 
IIAADAKVDE EKFQGKPIVR SSTGEAIKNA KTLIDKTLSL EGKASSRQIT DFIREVEQRK
SERRSQATGF YKHLMTGVSY MIPFVVAGGI IIALSFIFGI EAFKQEGTLA ANLMRIGGGS
AFALMVPILA GYIAFSIADR PGLAPGIIGG MLATQMGAGF LGGIVAGFLA GYVAKFLRDN
IKLPAGLEGL KPVLIIPLIS TLIVGLLLIY VIGTPVKVIM DGLEHWLTSM SRGNAVILGF
ILGAMMALDM GGPVNKAAYT FAVGLLGSNI YEPQAAVMAA GMTPPLGLAL ATLLFPKKFT
SEEREAGKAA AVLGISFITE GAIPFAASDP FRVIPSIVAG SAVAGALSMA FNATLRAPHG
GIFVLAIPNA VGHLGLYSLS IAIGTLVTAL MVSLLKPNKE IK