Gene EcolC_4245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4245 
Symbol 
ID6067948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4694705 
End bp4696210 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content51% 
IMG OID641603682 
ProductD-ribose transporter ATP binding protein 
Protein accessionYP_001727168 
Protein GI170022214 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0036959 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGAAGCAT TACTTCAGCT TAAAGGCATC GATAAAGCCT TCCCGGGCGT AAAAGCCCTC 
TCGGGCGCAG CGTTAAATGT CTATCCGGGC CGCGTGATGG CGCTGGTGGG CGAAAACGGC
GCGGGTAAAT CCACCATGAT GAAAGTGCTT ACTGGTATCT ATGCTCGCGA TGCCGGCACG
CTTTTATGGC TGGGGAAAGA AACGACATTT ACCGGGCCGA AATCTTCCCA GGAAGCCGGG
ATTGGGATTA TCCATCAGGA ACTGAACCTG ATCCCGCAGT TGACCATTGC CGAAAACATT
TTCCTCGGTC GTGAGTTTGT TAATCGCTTT GGCAAAATTG ACTGGAAAAC CATGTATGCC
GAAGCGGATA AATTGCTGGC TAAACTTAAC CTGCGCTTTA AAAGCGACAA GCTGGTGGGC
GATCTTTCCA TCGGTGACCA GCAAATGGTT GAAATCGCCA AAGTGCTGAG CTTTGAGTCG
AAAGTCATCA TTATGGATGA ACCGACCGAT GCGCTGACCG ATACCGAAAC CGAATCCCTG
TTCCGCGTCA TCCGCGAGCT GAAATCGCAA GGCCGCGGTA TTGTCTATAT CTCCCACCGC
ATGAAAGAAA TCTTCGAGAT TTGCGATGAC GTTACCGTTT TTCGTGATGG GCAATTTATT
GCTGAGCGCG AAGTGGCATC ACTGACCGAA GATTCGCTGA TTGAGATGAT GGTGGGTCGC
AAGCTGGAAG ATCAATATCC GCACCTGGAC AAAGCGCCGG GAGATATCCG CCTGAAAGTC
GATAATCTCT GCGGACCTGG CGTTAACGAT GTCTCTTTTA CTTTACGCAA AGGCGAAATT
CTTGGCGTCT CTGGTTTGAT GGGCGCAGGT CGTACCGAAC TGATGAAAGT GCTCTACGGC
GCACTACCGC GCACCAGCGG TTACGTCACC CTGGATGGGC ATGAAGTCGT TACCCGTTCA
CCGCAGGATG GCTTGGCAAA CGGCATTGTG TATATCTCCG AAGACCGTAA ACGTGACGGT
TTAGTGTTGG GCATGCCAGT AAAAGAGAAC ATGTCGCTGA CAGCGCTGCG CTACTTCAGC
CGCGCTGGCG GCAGTTTGAA GCATGCCGAT GAACAGCAGG CTGTGAGTGA TTTCATTCGT
CTGTTTAATG TGAAAACTCC GTCGATGGAA CAGGCAATTG GTCTGCTTTC CGGTGGCAAT
CAGCAAAAAG TGGCGATTGC CCGCGGTCTG ATGACACGCC CCAAAGTGTT GATCCTTGAT
GAACCTACCC GTGGCGTAGA TGTCGGCGCG AAAAAAGAGA TCTATCAACT GATTAACCAG
TTCAAAGCCG ATGGCTTGAG CATCATTCTG GTGTCATCGG AGATGCCAGA AGTATTAGGC
ATGAGCGATC GCATCATCGT CATGCATGAA GGGCATCTCA GCGGGGAATT TACTCGTGAG
CAGGCCACCC AGGAAGTGTT AATGGCTGCC GCTGTGGGCA AGCTTAATCG CGTGAATCAG
GAGTAA
 
Protein sequence
MEALLQLKGI DKAFPGVKAL SGAALNVYPG RVMALVGENG AGKSTMMKVL TGIYARDAGT 
LLWLGKETTF TGPKSSQEAG IGIIHQELNL IPQLTIAENI FLGREFVNRF GKIDWKTMYA
EADKLLAKLN LRFKSDKLVG DLSIGDQQMV EIAKVLSFES KVIIMDEPTD ALTDTETESL
FRVIRELKSQ GRGIVYISHR MKEIFEICDD VTVFRDGQFI AEREVASLTE DSLIEMMVGR
KLEDQYPHLD KAPGDIRLKV DNLCGPGVND VSFTLRKGEI LGVSGLMGAG RTELMKVLYG
ALPRTSGYVT LDGHEVVTRS PQDGLANGIV YISEDRKRDG LVLGMPVKEN MSLTALRYFS
RAGGSLKHAD EQQAVSDFIR LFNVKTPSME QAIGLLSGGN QQKVAIARGL MTRPKVLILD
EPTRGVDVGA KKEIYQLINQ FKADGLSIIL VSSEMPEVLG MSDRIIVMHE GHLSGEFTRE
QATQEVLMAA AVGKLNRVNQ E