Gene EcolC_0258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0258 
Symbol 
ID6067977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp295914 
End bp297023 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content54% 
IMG OID641599657 
Productextracellular ligand-binding receptor 
Protein accessionYP_001723264 
Protein GI170018310 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGGA ATGCGAAAAC TATCATCGCA GGGATGATTG CACTGGCAAT TTCACACACC 
GCTATGGCTG ACGATATTAA AGTCGCCGTT GTCGGCGCGA TGTCCGGCCC GATTGCCCAG
TGGGGCGATA TGGAATTTAA CGGCGCGCGT CAGGCGATTA AAGACATTAA TGCCAAAGGG
GGAATTAAGG GCGACAAGCT GGTTGGCGTG GAATATGACG ACGCCTGCGA CCCGAAACAA
GCCGTTGCGG TCGCCAACAA AATCGTTAAT GACGGCATTA AATACGTTAT TGGTCATCTG
TGTTCTTCTT CTACCCAACC TGCATCAGAT ATCTACGAAG ACGAAGGTAT TCTGATGATC
TCGCCGGGAG CGACCAACCC GGAGCTGACC CAACGCGGTT ATCAACACAT TATGCGTACT
GCCGGGCTGG ACTCTTCCCA GGGGCCAACG GCGGCAAAAT ACATTCTTGA GACGGTGAAG
CCCCAGCGCA TCGCCATCAT TCACGACAAA CAACAGTATG GCGAAGGGCT GGCGCGTTCG
GTGCAGGACG GGCTGAAAGC GGCTAACGCC AACGTCGTCT TCTTCGACGG TATTACCGCC
GGGGAGAAAG ATTTCTCCGC GCTGATCGCC CGCCTGAAAA AAGAAAACAT CGACTTCGTT
TACTACGGCG GTTACTACCC GGAAATGGGG CAGATGCTGC GCCAGGCCCG TTCCGTTGGC
CTGAAAACTC AGTTTATGGG GCCGGAAGGT GTGGGTAACG CATCATTGTC GAATATTGCC
GGTGATGCTG CCGAAGGCAT GTTGGTCACT ATGCCAAAAC GCTATGACCA GGATCCGGCA
AACCAGGGCA TCGTTGATGC GCTGAAAGCA GACAAGAAAG ATCCGTCCGG GCCTTATGTC
TGGATCACCT ACGCGGCGGT GCAATCTCTG GCGACTGCCC TTGAGCGTAC TGGCAGCGAT
GAGCCGCTGG CGCTGGTGAA AGATTTAAAA GCTAACGGTG CAAACACCGT GATTGGGCCG
CTGAACTGGG ATGAAAAAGG CGATCTTAAG GGATTTGATT TTGGTGTCTT CCAGTGGCAC
GCCGACGGTT CATCCACGGC AGCCAAGTGA
 
Protein sequence
MKRNAKTIIA GMIALAISHT AMADDIKVAV VGAMSGPIAQ WGDMEFNGAR QAIKDINAKG 
GIKGDKLVGV EYDDACDPKQ AVAVANKIVN DGIKYVIGHL CSSSTQPASD IYEDEGILMI
SPGATNPELT QRGYQHIMRT AGLDSSQGPT AAKYILETVK PQRIAIIHDK QQYGEGLARS
VQDGLKAANA NVVFFDGITA GEKDFSALIA RLKKENIDFV YYGGYYPEMG QMLRQARSVG
LKTQFMGPEG VGNASLSNIA GDAAEGMLVT MPKRYDQDPA NQGIVDALKA DKKDPSGPYV
WITYAAVQSL ATALERTGSD EPLALVKDLK ANGANTVIGP LNWDEKGDLK GFDFGVFQWH
ADGSSTAAK