Gene Ent638_1707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_1707 
Symbol 
ID5112446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp1856838 
End bp1858196 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content51% 
IMG OID640491896 
ProductPTS system N,N'-diacetylchitobiose-specific transporter subunit IIC 
Protein accessionYP_001176437 
Protein GI146311363 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1455] Phosphotransferase system cellobiose-specific component IIC 
TIGRFAM ID[TIGR00359] phosphotransferase system, cellobiose specific, IIC component
[TIGR00410] PTS system, lactose/cellobiose family IIC component 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAGAG TCATCAATTC ACTTGAAAAG GTTCTCCTTC CTTTTGCTGT TAAAATAGGA 
AAGCAGCCGC ATGTAAATGC CATCAAAAAC GGTTTTATTA AAGTTATGCC GTTAACGTTG
GCTGGGGCCA TGTTCGTTTT AATTAATAAC GTTTTTCTGA GTTTCGGTGA AGGTTCTTTC
TTCTATTCAT TGGGTATTCG CTTAGACGCT TCAACCATTG AAACCCTCAA TGGCTTTAAA
GCCATCGGCG GCAACGTATA CAACGGTACG TTGGGTATTA TGTCGCTGAT GGCACCTTTT
TTCATTGGGA TGGCGCTGGC TGAAGAACGA AAAGTCGATC CTCTGGCGGC AGGTTTATTA
TCCGTCGCTG CCTTTATGAC CGTCACACCT TATAGCGTGG GCGAAGCCTA TGCCGTGGGT
GCAAATTGGC TAGGGGGCGC AAATATTATT TCCGGTATTG TTATCGGATT GGTCGTGGCG
GAAATGTTTA CTTTTATCGT TCGCCGAAAC TGGGTGATTC GTTTACCGGA TAGCGTGCCA
GAATCCGTAT CGCGTTCATT CTCCGCGTTA ATTCCTGGCT TTATTATTCT CTCTATCATG
GGGATTATCG CCTGGGCTCT GACTCACTGG GGCACCAACT TCCACCAGAT TATTATGGAC
ACCATTTCTA CGCCGCTGGC GTCGATGGGG GGCGTGGTAG GCTGGGCGTA CGTGATTTTC
ACCTCGCTGC TGTGGTTCTT TGGGGTACAC GGTTCTCTGG CGCTGGCTGC GCTGGACAGC
GGAATCATGA CGCCGTGGGC GCTGGAAAAC GTGCAGTTGT ATCAGCAGTA CGGTTCGGTC
GATGCGGCTC TGGCAGCAGG TAAAACCTTC CACGTGTGGG CGAAACCAAT GCTGGATTCG
TACATCTTCC TGGGCGGTAC GGGTGCAACG CTGGGTCTGA TCATTGCCAT CTTTATCACT
TCTCGTCGCG CCGACCATCG TCAAGTGGCA AAGCTGGCGC TGCCGTCAGG CATCTTCCAG
ATTAACGAGC CGATCCTGTT TGGTTTGCCA ATCATCATGA ACCCGGTGCT GTTTATCCCT
TTCATCCTGA TTCAGCCCCT GCTGGCCGCG ATTACGCTAA CCGCGTATTA CCTGGGCATT
ATCCCGCCAA TCACCAACAT TGCACCCTGG ACGATGCCGG CGGGTCTGGG CGCATTCTTC
AACACCAACG GTAGCGTGGC CGCTTTCTTA CTGGCGATAT TCAACTTAGG TGTCGCAACG
CTGCTCTACA TGCCTTTCGT GGCGATCGCC AACAAAGCGC AAACCACGAT AGATGAAGAA
GAGAGCGAAG AAGATATCGC CCTCGCACTG AAATTCTAA
 
Protein sequence
MSRVINSLEK VLLPFAVKIG KQPHVNAIKN GFIKVMPLTL AGAMFVLINN VFLSFGEGSF 
FYSLGIRLDA STIETLNGFK AIGGNVYNGT LGIMSLMAPF FIGMALAEER KVDPLAAGLL
SVAAFMTVTP YSVGEAYAVG ANWLGGANII SGIVIGLVVA EMFTFIVRRN WVIRLPDSVP
ESVSRSFSAL IPGFIILSIM GIIAWALTHW GTNFHQIIMD TISTPLASMG GVVGWAYVIF
TSLLWFFGVH GSLALAALDS GIMTPWALEN VQLYQQYGSV DAALAAGKTF HVWAKPMLDS
YIFLGGTGAT LGLIIAIFIT SRRADHRQVA KLALPSGIFQ INEPILFGLP IIMNPVLFIP
FILIQPLLAA ITLTAYYLGI IPPITNIAPW TMPAGLGAFF NTNGSVAAFL LAIFNLGVAT
LLYMPFVAIA NKAQTTIDEE ESEEDIALAL KF