Gene Ent638_4115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_4115 
Symbol 
ID5110686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp4477758 
End bp4479263 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content54% 
IMG OID640494342 
ProductD-ribose transporter ATP binding protein 
Protein accessionYP_001178819 
Protein GI146313745 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.684938 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0182067 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCAT TATTACAACT CAAAGGGATC GATAAATCGT TCCCGGGCGT AAAAGCCCTC 
TCTGGCGCGG CGCTGAATGT TTACGCCGGT CGCGTGATGG CGCTGGTAGG TGAAAACGGC
GCTGGCAAAT CCACCATGAT GAAAGTGCTG ACCGGCATCT ATCAACGCGA TGCGGGTTCG
CTGGTGTGGC TAGGGAAAGA AACCACCTTC ACCGGAGCGA AATCCTCACA GGAAGCCGGA
ATCGGCATTA TCCACCAGGA ATTAAACCTG ATCCCGCAGC TCACCATTGC CGAAAATATC
TTTCTTGGCC GCGAATTCGT GGGCCGCTTT GGCAACATCG ACTGGAAGTT GATGTATGCC
GAAGCCGACA AACTACTGGC GAAACTGAAT CTGCGCTTCA AGAGCGACCG TCTGGTGGGC
GATCTTTCGA TCGGCGACCA GCAGATGGTG GAAATCGCGA AAGTCTTGAG CTTCGAGTCA
AAAGTCATCG TCATGGATGA ACCGACTGAT GCACTCACCG ATACCGAAAC CGAATCTTTA
TTCCGCGTCA TCCGCGAGTT GAAATCGCAG GGACGTGGCA TTGTTTATAT CTCTCACCGC
ATGAAAGAGA TTTTCGAGAT TTGCGATGAC GTTACCGTGT TCCGCGACGG GCAGTTTATT
GCCGAGCGCG AGGTGGCCTC TCTGACCGAA GATTCGCTGA TCGAAATGAT GGTGGGGCGC
AAGCTCGAAG ATCAGTATCC GCGACTGGAT AAAGCGCCGG GTGACGTCCG CCTGAAGGTG
GATAATCTAT GCGGTCCTGG CGTCGACAAT GTCTCCTTTA CGCTGCGCCA GGGTGAAATC
CTGGGTGTGG CGGGCCTGAT GGGCGCGGGT CGTACCGAAT TAATGAAAGT GCTGTACGGC
GCGCTGCCGC GTACCAGCGG TTACGTCACG CTCAACGGTC ATGAAGTTGT TACCCGTTCT
CCGCAGGATG GGCTCGCAAA CGGCATCGTT TATATCTCTG AAGACCGTAA ACGCGATGGC
CTGGTGCTGG GCATGTCGGT TAAAGAGAAC ATGTCTCTGA CCGCGCTGCG CTATTTCAGC
CGTAACGGCG GTGGCTTAAA ACATAAAGAC GAACAGCAGG CGGTCAGCGA TTTTATCCGC
CTGTTCAATG TGAAAACGCC TTCGATGGAA CAGGCTATCG GCCTGCTTTC CGGCGGTAAT
CAGCAGAAAG TGGCGATTGC GCGCGGTCTG ATGACGCGTC CAAAAGTGCT GATCCTCGAT
GAGCCCACTC GTGGCGTGGA CGTCGGCGCC AAGAAAGAGA TTTATCAGCT GATCAACCAG
TTTAAGGCTG ACGGGCTGAG CATCATTCTG GTCTCTTCCG AGATGCCAGA AGTATTAGGC
ATGAGCGATC GCATTATCGT CATGCATGAA GGGCATCTCG GCGGTGAATT CACTCGCGAG
CAGGCCACCC AGGAAGTTCT GATGGCTGCC GCTGTGGGCA AGCTTAATCG CGTGAATCAG
GAGTAA
 
Protein sequence
MDALLQLKGI DKSFPGVKAL SGAALNVYAG RVMALVGENG AGKSTMMKVL TGIYQRDAGS 
LVWLGKETTF TGAKSSQEAG IGIIHQELNL IPQLTIAENI FLGREFVGRF GNIDWKLMYA
EADKLLAKLN LRFKSDRLVG DLSIGDQQMV EIAKVLSFES KVIVMDEPTD ALTDTETESL
FRVIRELKSQ GRGIVYISHR MKEIFEICDD VTVFRDGQFI AEREVASLTE DSLIEMMVGR
KLEDQYPRLD KAPGDVRLKV DNLCGPGVDN VSFTLRQGEI LGVAGLMGAG RTELMKVLYG
ALPRTSGYVT LNGHEVVTRS PQDGLANGIV YISEDRKRDG LVLGMSVKEN MSLTALRYFS
RNGGGLKHKD EQQAVSDFIR LFNVKTPSME QAIGLLSGGN QQKVAIARGL MTRPKVLILD
EPTRGVDVGA KKEIYQLINQ FKADGLSIIL VSSEMPEVLG MSDRIIVMHE GHLGGEFTRE
QATQEVLMAA AVGKLNRVNQ E