Gene EcolC_0256 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0256 
Symbol 
ID6067961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp293805 
End bp294908 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content55% 
IMG OID641599655 
Productextracellular ligand-binding receptor 
Protein accessionYP_001723262 
Protein GI170018308 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATAA AGGGTAAAGC GTTACTGGCA GGATGTATCG CGCTGGCATT CAGCAATATG 
GCTCTGGCAG AAGATATTAA AGTCGCGGTC GTGGGCGCAA TGTCCGGTCC GGTGGCCCAG
TACGGTGACC AGGAGTTTAC CGGCGCAGAG CAGGCGGTTG CGGATATCAA CGCTAAAGGC
GGCATTAAAG GCAACAAACT GCAAATCGTA AAATATGACG ATGCCTGTGA CCCGAAACAG
GCGGTTGCGG TGGCGAACAA AGTCGTTAAC GACGGCATTA AATACGTGAT TGGTCACCTC
TGTTCTTCAT CAACGCAGCC TGCGTCTGAT ATCTACGAAG ACGAAGGCAT TTTAATGATC
ACCCCAGCGG CAACCGCGCC GGAGCTGACC GCCCGTAGCT ATCAGCTGAT CCTGCGCACC
ACCGGCCTGG ACTCCGACCA GGGGCCGACG GCGGCGAAAT ATATTCTTGA GAAAGTGAAA
CCGCAGCGTA TTGCTATCGT TCACGACAAA CAGCAATACG GCGAAGGTCT GGCGCGAGCG
GTGCAGGACG GCCTGAAGAA AGGCAATGCT AACGTGGTGT TCTTTGATGG TATCACCGCC
GGGGAAAAAG ATTTCTCAAC GCTGGTGGCG CGTCTGAAAA AAGAGAATAT CGACTTCGTT
TACTACGGCG GTTATCACCC GGAAATGGGG CAAATCCTGC GTCAGGCACG CGCGGCAGGG
CTGAAAACTC AGTTTATGGG GCCGGAAGGT GTGGCTAACG TTTCGCTGTC TAACATTGCG
GGCGAATCAG CGGAAGGGCT GCTGGTGACC AAGCCGAAGA ACTACGATCA GGTTCCGGCG
AACAAACCCA TTGTTGACGC GATCAAAGCG AAAAAACAGG ACCCAAGCGG CGCATTCGTC
TGGACCACCT ACGCCGCGCT GCAATCTTTG CAGGCGGGCC TGAATCAGTC TGACGATCCG
GCTGAAATCG CCAAATACCT GAAAGCGAAC TCCGTGGATA CCGTAATGGG ACCGCTGACG
TGGGATGAGA AAGGCGATCT GAAAGGCTTT GAGTTCGGCG TATTTGACTG GCACGCCAAC
GGCACGGCGA CCGATGCGAA GTAA
 
Protein sequence
MNIKGKALLA GCIALAFSNM ALAEDIKVAV VGAMSGPVAQ YGDQEFTGAE QAVADINAKG 
GIKGNKLQIV KYDDACDPKQ AVAVANKVVN DGIKYVIGHL CSSSTQPASD IYEDEGILMI
TPAATAPELT ARSYQLILRT TGLDSDQGPT AAKYILEKVK PQRIAIVHDK QQYGEGLARA
VQDGLKKGNA NVVFFDGITA GEKDFSTLVA RLKKENIDFV YYGGYHPEMG QILRQARAAG
LKTQFMGPEG VANVSLSNIA GESAEGLLVT KPKNYDQVPA NKPIVDAIKA KKQDPSGAFV
WTTYAALQSL QAGLNQSDDP AEIAKYLKAN SVDTVMGPLT WDEKGDLKGF EFGVFDWHAN
GTATDAK