Gene EcolC_2315 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2315 
Symbol 
ID6065983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2553430 
End bp2554722 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content50% 
IMG OID641601718 
Productextracellular solute-binding protein 
Protein accessionYP_001725277 
Protein GI170020323 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.62001 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAAT CAAAAATCGT GCTGTTATCA GCACTGGTTT CATGCGCCCT GATTTCAGGC 
TGTAAAGAAG AAAATAAAAC GAATGTATCC ATCGAATTTA TGCATTCTTC GGTGGAGCAG
GAGCGCCAGG CCGTTATCAG TAAATTGATT GCCCGTTTTG AAAAAGAAAA CCCTGGCATC
ACCGTTAAGC AAGTGCCCGT GGAAGAAGAT GCCTATAACA CTAAAGTCAT TACTCTTTCA
CGTAGCGGTT CGCTGCCGGA AGTGATCGAA ACCAGCCATG ACTACGCCAA AGTGATGGAC
AAAGAGCAGC TTATCGATCG CAAAGCGGTT GCCACAGTCA TCAGCAACGT TGGTGAAGGC
GCGTTTTACG ATGGCGTACT GCGTATTGTG CGTACCGAAG ATGGTAGCGC ATGGACCGGT
GTTCCTGTCA GCGCCTGGAT TGGCGGTATC TGGTATCGCA AAGATGTGCT GGCAAAAGCG
GGGCTTGAGG AGCCGAAAAA CTGGCAACAG CTGCTGGACG TTGCACAGAA ACTGAATGAC
CCGGCGAATA AAAAATACGG CATTGCGCTG CCTACAGCAG AAAGCGTGTT GACGGAACAA
TCCTTCTCCC AGTTTGCGTT ATCCAACCAG GCTAACGTCT TTAACGCCGA AGGCAAAATC
ACCCTTGATA CACCAGAGAT GATGCAGGCA CTGACCTATT ACCGCGACCT TGCTGCCAAC
ACTATGCCGG GTTCTAACGA CATCATGGAA GTGAAAGACG CCTTTATGAA CGGCACCGCG
CCGATGGCGA TTTACTCCAC CTATATCCTT CCGGCTGTGA TTAAAGAAGG CGACCCGAAA
AACGTCGGTT TCGTGGTGCC AACCGAGAAA AACTCTGCGG TCTACGGCAT GTTGACCTCG
CTGACCATTA CCGCCGGGCA AAAGACCGAA GAGACGGAAG CAGCAGAAAA ATTTGTCACC
TTTATGGAGC AGGCAGACAA CATTGCCGAC TGGGTGATGA TGTCGCCAGG TGCTGCGCTG
CCGGTGAATA AAGCGGTGGT GACTACCGCC ACCTGGAAAG ACAACGACGT TATTAAGGCG
CTGGGTGAAC TACCGAATCA GCTAATCGGT GAACTGCCAA ATATTCAGGT TTTTGGCGCA
GTAGGGGATA AAAACTTTAC CCGCATGGGT GATGTGACGG GTTCTGGCGT GGTGAGTTCA
ATGGTGCATA ACGTCACCGT GGGTAAAGCC GATCTCTCTA CTACGCTGCA AGCGAGCCAG
AAAAAACTGG ATGAACTGAT CGAACAGCAC TAA
 
Protein sequence
MIKSKIVLLS ALVSCALISG CKEENKTNVS IEFMHSSVEQ ERQAVISKLI ARFEKENPGI 
TVKQVPVEED AYNTKVITLS RSGSLPEVIE TSHDYAKVMD KEQLIDRKAV ATVISNVGEG
AFYDGVLRIV RTEDGSAWTG VPVSAWIGGI WYRKDVLAKA GLEEPKNWQQ LLDVAQKLND
PANKKYGIAL PTAESVLTEQ SFSQFALSNQ ANVFNAEGKI TLDTPEMMQA LTYYRDLAAN
TMPGSNDIME VKDAFMNGTA PMAIYSTYIL PAVIKEGDPK NVGFVVPTEK NSAVYGMLTS
LTITAGQKTE ETEAAEKFVT FMEQADNIAD WVMMSPGAAL PVNKAVVTTA TWKDNDVIKA
LGELPNQLIG ELPNIQVFGA VGDKNFTRMG DVTGSGVVSS MVHNVTVGKA DLSTTLQASQ
KKLDELIEQH