Gene Ent638_0037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_0037 
Symbol 
ID5110640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp40986 
End bp42311 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content56% 
IMG OID640490193 
ProductPTS system lactose/cellobiose family IIC subunit 
Protein accessionYP_001174778 
Protein GI146309704 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1455] Phosphotransferase system cellobiose-specific component IIC 
TIGRFAM ID[TIGR00359] phosphotransferase system, cellobiose specific, IIC component
[TIGR00410] PTS system, lactose/cellobiose family IIC component 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.908472 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.420759 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCGT TATATCAATC CATGGTTGCG GTGATTGAGC AGTCCATTAC CCCGCTGGCG 
GCCAAGCTCG GTCAGCAAAA GTACGTGATT GCGATCCGTG ACGGCTTTAC CGCCGCCCTG
CCGTTTATGA TCATCGGCTC GTTTATGCTG GTGTTCATCT TCCCGCCGTT CTCTGCGGAC
ACGACCAACA GTTTTGCGCG CGGCTGGCTT GATTTCTCCC AGACCTACCG CGAACAGCTG
ATGCTGCCGT TTAACCTCAG CATGGGCGTG ATGACCTTCT TTATTTCGGT GGGCATTGGT
GCAAGCCTGG GCCGCCAGTT TAATCTCGAT CCGGTGATGT CAGGCCTGCT GGCGTTTATG
GCATTCCTGC TGGTCGCCGC GCCGTATGCC GACGGTAAAA TCTCCACGCA GTACATGTCC
GGTCAGGGCA TTTTCACCGC GCTGATTACC GCTATTTACT CCACCCGCGT TTATGCGTGG
CTGAAGGAAA ACAAAGTGAC GATCCGTCTG CCGAAAGAAG TCCCAACCGG CGTGGCGCGT
TCCTTTGAAA TCCTGATCCC TGTGATGGTC GTTATCGGTA CGCTGCACCC GCTGAACCTG
TTCATCGAAG CGCAGACCGG CATGATTATC CCACAGGCGA TTATGCACCT GCTGGAGCCG
CTGGTTTCTG CATCGGATTC CCTGCCTGCC ATTCTGCTTT CCGTCCTGCT GTGCCAGATC
TTCTGGTTCG CGGGTATCCA CGGCTCGCTG ATTGTCACCG GCATTATGAA CCCGTTCTGG
ATGGCGAACC TGTCGGCAAA CCAGGCTGCA CTGGCGGCTG GCGCGGCGCT TCCACACGTT
TATCTGCAAG GTTTCTGGGA TCACTACCTG CTGATTGGCG GCGTGGGCTC AACTCTGCCG
CTGGCGTTCC TCCTGCTGCG TAGCCGTGTG GCGCACCTGC GCACTATCGG CAAAATGGGC
GTGGTGCCAA GCTTCTTTAA CATCAACGAA CCGATTCTGT TCGGCGCACC GATCATCATG
AACCCAATGT TGTTCCTCCC GTTCGTGTTC GTGCCGTTGA TTAACGCCTG CCTGGCGTAT
GGCGCAACCA AACTCGGTTG GATCGCACAA GTTGTCTCTC TGACGCCATG GACTACGCCT
GCCCCAATCG GTGCATCGTG GGCCGCCAAC TGGGCGTTTA GTCCGGTCGT GATGTGCGTT
ATTTGTATGG TGATGTCAGC AATCATGTAT CTGCCGTTCC TGCGTGCTTA CGAGCGTTCT
TTGATGAAAA ACGAAGAGCA AAAAGCCCAG GCGACCGTGG GTGCAGTTGA GACAGCAAGT
CAATAA
 
Protein sequence
MSSLYQSMVA VIEQSITPLA AKLGQQKYVI AIRDGFTAAL PFMIIGSFML VFIFPPFSAD 
TTNSFARGWL DFSQTYREQL MLPFNLSMGV MTFFISVGIG ASLGRQFNLD PVMSGLLAFM
AFLLVAAPYA DGKISTQYMS GQGIFTALIT AIYSTRVYAW LKENKVTIRL PKEVPTGVAR
SFEILIPVMV VIGTLHPLNL FIEAQTGMII PQAIMHLLEP LVSASDSLPA ILLSVLLCQI
FWFAGIHGSL IVTGIMNPFW MANLSANQAA LAAGAALPHV YLQGFWDHYL LIGGVGSTLP
LAFLLLRSRV AHLRTIGKMG VVPSFFNINE PILFGAPIIM NPMLFLPFVF VPLINACLAY
GATKLGWIAQ VVSLTPWTTP APIGASWAAN WAFSPVVMCV ICMVMSAIMY LPFLRAYERS
LMKNEEQKAQ ATVGAVETAS Q