Gene Ent638_3964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3964 
Symbol 
ID5114684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp4294762 
End bp4296573 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content56% 
IMG OID640494178 
ProductPTS system, beta-glucoside-specific IIABC subunit 
Protein accessionYP_001178670 
Protein GI146313596 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific
[COG2190] Phosphotransferase system IIA components 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR00830] PTS system, glucose subfamily, IIA component
[TIGR01995] PTS system, beta-glucoside-specific IIABC component 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.168346 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCATT TGCAGACAGC ACTGGACGTC ATCGCCCATG TGGGTGGCGC TGAGAATATC 
GAACATATCG AACATTGTTC TACCCGTTTG CGGTTGAGCC TGTATGACAA CAGCAAAGTG
AATGAAACCG CGCTGGCGAA AATTGACGGC GTGCTGGGCG TGCGCGTCAA CGTACAGTGC
CAGGTGATTA TCGGACATGA AGTGGTTCAG GTTTTTGAGG CGGTGCGTTC TCTGGTCGGC
ACGACGCAGG GAAGTGCCCA CCCACACGCC AATAAACCCA ACCGCTGGGC GCAAGCCGTT
GATTTTGTGA TCAGCGTTTT CCAGCCTTTG GTTCCCGCCA TTGCTGGCGG CGGCGTACTG
AAATCACTGC TGCTGTTGCT GGATGTCGTG GGCTGGCTCA GCCGCGACAG CTCCACGTAC
AAAGTGCTGG ATAATATCGG CTCTGCCCCA CTCTATTTCC TGCCGATCCT GGTGGCTATC
ACCACCGCAA CCAAGCTCAA GGTCAACGTG CTGGTTGCCG TCTCTGCCGT TGCGGTCATG
GTGTTACCGG CGATGACCAA ACAGCTGGCC GATGGCGCAG AGTTTATGTC CTTCGATCTG
CGCAACGTGG CCTACGCCTC GCAGGTTTTC CCGGCCATTC TGTGCGTGCT GTTTTATGCC
CAGACGGAAA AGTTCTTCAA CCGTTACTCA CCCGGCGCGC TGCGCATTTT CCTTTCGCCA
ATGCTGTCGT TGCTGGTCAC CGTTCCCGTC ACGCTGCTGA TTTTGGGGCC TTTGGGCTAT
GAACTCGGCG CGGGCCTGGC AAAGGTGATT CTGTGGCTTT ACGGGAAATT AGGGTTTGTG
GCGACAGGAT TACTCGCCGC CGCCCTGCCG TTTATGGTCG CATCCGGAAT GCACAAACCG
ATGCTGCCCT ACGCCGTGGC ATCCATGAGC CAGTTTGGCC GCGAATTGCT GTATCTCCCC
GCGTCGCTGG CGCACAACAT TGCGGAATCG GGCGCGTGCC TGGCCATTGC GCTGAAAAGC
AAAGACAAAG TGCTGAAATC GACCGCGTTT TCGGCGGGCA TTTCCGCGCT GTTTGGCATT
ACGGAACCCG CGCTTTACGG CGTCACGCTG CTGAATAAAA AAGCGCTCTA CAGCGTGATC
CTCGGCAGCA TCGTAGGGGG CGCTTTTATC GGCTGGATGG CGGTTGAAGC CTTTGCGATT
GTAGGGCCAG GCCTCGCCAG TATCTCCATG TTCGTCTCGC CAGATAACCC ACTGAATATC
CTGTGGGCGT TTGCCGGTGC GGGTCTGTCA TTCGCCATCG CCTTTATCAG CGCCCTGCTG
TTGTGGCGCG ACAAAGTGAC TGAGCAAACC GAAGAATTGA CGTTCACCCG TCCGATAGAA
GGGCAAATCA TTGCGCTGGA AAACGTGAAT GACGATGTTT TCTCGCGCAA AATCATGGGT
GACGGTATCG CGATTGTCCC TTCTCAGGGC GTGCTGCGTG CCCCGGCGGA CGGCACCATC
ATTAACGTGT TTGAGAGCGG CCACGCGCTG AGTCTTCTCA CCGACGCGGG TGTGGAACTG
ATTTTCCATA TCGGGATCGA CACCATCAAG CTGCAAGGCG AAGGGTTTTC CCCAAAAGTG
CAGGAAGGCC AACACGTCAA AAGCGGCGAA ACGCTGATTG AATTTTCTCT TGATACCCTC
ACCGCAGCAG GTCTCGATCC GGTGGTGATC ATGGTGGTCA CTAACGGCGA ACGTTTCTCG
CTCACGCCAC AAAGCCACAA CGACAACAAT CCAAATCCTC ACATCATCAT GACGCTAAAG
GAGTCCGTGT AA
 
Protein sequence
MNHLQTALDV IAHVGGAENI EHIEHCSTRL RLSLYDNSKV NETALAKIDG VLGVRVNVQC 
QVIIGHEVVQ VFEAVRSLVG TTQGSAHPHA NKPNRWAQAV DFVISVFQPL VPAIAGGGVL
KSLLLLLDVV GWLSRDSSTY KVLDNIGSAP LYFLPILVAI TTATKLKVNV LVAVSAVAVM
VLPAMTKQLA DGAEFMSFDL RNVAYASQVF PAILCVLFYA QTEKFFNRYS PGALRIFLSP
MLSLLVTVPV TLLILGPLGY ELGAGLAKVI LWLYGKLGFV ATGLLAAALP FMVASGMHKP
MLPYAVASMS QFGRELLYLP ASLAHNIAES GACLAIALKS KDKVLKSTAF SAGISALFGI
TEPALYGVTL LNKKALYSVI LGSIVGGAFI GWMAVEAFAI VGPGLASISM FVSPDNPLNI
LWAFAGAGLS FAIAFISALL LWRDKVTEQT EELTFTRPIE GQIIALENVN DDVFSRKIMG
DGIAIVPSQG VLRAPADGTI INVFESGHAL SLLTDAGVEL IFHIGIDTIK LQGEGFSPKV
QEGQHVKSGE TLIEFSLDTL TAAGLDPVVI MVVTNGERFS LTPQSHNDNN PNPHIIMTLK
ESV