Gene Emin_1349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1349 
Symbol 
ID6263637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1451200 
End bp1452645 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content44% 
IMG OID642611830 
ProductPTS system, N-acetylglucosamine-specific IIBC subunit 
Protein accessionYP_001876236 
Protein GI187251754 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific
[COG1264] Phosphotransferase system IIB components 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR01998] PTS system, N-acetylglucosamine-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.00702227 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGAAAT ACTTTTCCGC CGTTACAAAA GCGTTTGTTA CTTTAGGCAA AGCCTTAATG 
CTCCCCATAG CCGTGCTTCC TATAGCGGGT TTATTATTAA GGCTGGGCCA GCCTGATTTG
CTTAATATAG CGTTCCTGGC TGATTCGGGG GACGCTATTT TTAAAAACCT GCCCACAATA
TTCGCGTTGG GCGTTTCCAT AGGTTTTGCC AAAGACAACC ACGGTGCCGC TCCGTTATCT
GCTTATGTGG GTTATGTTGT TTTAACAGCA GGGCTTAAAG TTTTAAATCC GGAAACTAAC
ATGGGCGTAT TCGGCGGTAT CATAATAGGT GTTATGGCGG GTTATTTTTA TAACAGATTT
CATGATATAC AGCTTCCTTC ATATTTAGCC TTTTTCGGCG GTAAAAGATT TGTGCCCATT
ATTACCGGCA CAAGCGCTGT TTTCCTGGCT TTAGCGGCCA GCGTTATTTG GCCGCCTGTT
GAACACGCTA TTAATTCTTT GGGCACTTGG ATTATAGGAT CTAAAGGCGT AGGGTTATTC
ATTTTCGGTT TCGCGAACAG GCTTTTAATA CCTTTAGGTT TACACCATGT TCTTAATAAT
TTGGTTTGGT TTTTATTCGG TAATTTTGAA GTTGTTAAAG AAGGCGCGGT TGTGCTTTAC
CAGGGTGATA TCGCCAGGTT CTTTGCGGGC GACCCTGCGG CGGGCAGTTT TATGGCGGGG
TTTTTCCCCG TTATGATGTT CGGCCTTCCG GCGGCTTGTT TCGCAATGAT GCTTACCGCT
AAAACCGCTA AAAGAAAAGC TACCGCCGGC ATTTTGCTTT CCATGGCTTT AACAAGCTTC
TTAACAGGTA TTACAGAGCC TATTGAATTT AGTTTTATGT TTCTTGCATT CCCGTTATAT
GTTTTACACG CTTTGCTCAC AGGGATATCA ATGGTTGTTA TGGATTTGAT GAACATTAAG
CTGGGTTTTA CTTTTTCAGC TGGTGCGTTT GACTTTGTTC TTAATTGGGG TAAGGCCACC
AATCCCGCCC GTTTCTTTAT AGTGGGCGGT GTGTATGCGG TACTTTATTT TGTCGTATTT
TATTTTGCTA TTAAGTTTTG GGATTTAAAA ACCCCGGGCA GGGAAGATGA CGTTGTTGAA
TCGGAAGAAA CCTGTGAACC GGGGCAGAAT TCTAAAGCTG TTGAGGCGGC CCCAACAAGC
GCGCCCGCCA GAGATACAAG GGGATATAAA TATATGACTG CTCTTGGCGG TAAAGAAAAC
CTTAAAGTTG TTGACGCGTG CGCCACAAGA CTGAGGCTTG AGGTTGTTGA TTCTTCCAAA
GTAAAAGACG TGGATTTAAA AGCTTTGGGC GCCAGGGGAG TTTTAAGACC GGGCGATGGG
CTTGTGCAGG TTATTATAGG GCCTGAGGCT GATTTAATAG CGGGCGAAAT AAGGAAGGAA
TTTTAA
 
Protein sequence
MKKYFSAVTK AFVTLGKALM LPIAVLPIAG LLLRLGQPDL LNIAFLADSG DAIFKNLPTI 
FALGVSIGFA KDNHGAAPLS AYVGYVVLTA GLKVLNPETN MGVFGGIIIG VMAGYFYNRF
HDIQLPSYLA FFGGKRFVPI ITGTSAVFLA LAASVIWPPV EHAINSLGTW IIGSKGVGLF
IFGFANRLLI PLGLHHVLNN LVWFLFGNFE VVKEGAVVLY QGDIARFFAG DPAAGSFMAG
FFPVMMFGLP AACFAMMLTA KTAKRKATAG ILLSMALTSF LTGITEPIEF SFMFLAFPLY
VLHALLTGIS MVVMDLMNIK LGFTFSAGAF DFVLNWGKAT NPARFFIVGG VYAVLYFVVF
YFAIKFWDLK TPGREDDVVE SEETCEPGQN SKAVEAAPTS APARDTRGYK YMTALGGKEN
LKVVDACATR LRLEVVDSSK VKDVDLKALG ARGVLRPGDG LVQVIIGPEA DLIAGEIRKE
F