Gene EcHS_A3656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3656 
SymbollivK 
ID5594557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3642707 
End bp3643816 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content54% 
IMG OID640922772 
Producthigh-affinity branched-chain amino acid ABC transporter, periplasmic leucine-specific-binding protein LivK 
Protein accessionYP_001460252 
Protein GI157162934 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones61 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGGA ATGCGAAAAC TATCATCGCA GGGATGATTG CACTGGCAAT TTCACACACC 
GCTATGGCTG ACGATATTAA AGTCGCCGTT GTCGGCGCGA TGTCCGGCCC GATTGCCCAG
TGGGGCGATA TGGAATTTAA CGGCGCGCGT CAGGCGATTA AAGACATTAA TGCCAAAGGG
GGAATTAAGG GCGACAAGCT GGTTGGCGTG GAATATGACG ACGCCTGCGA CCCGAAACAA
GCCGTTGCGG TCGCCAACAA AATCGTTAAT GACGGCATTA AATACGTTAT TGGTCATCTG
TGTTCTTCTT CTACCCAACC TGCATCAGAT ATCTACGAAG ACGAAGGTAT TCTGATGATC
TCGCCGGGAG CGACCAACCC GGAGCTGACC CAACGCGGTT ATCAACACAT TATGCGTACT
GCCGGGCTGG ACTCTTCCCA GGGGCCAACG GCGGCAAAAT ACATTCTTGA GACGGTGAAG
CCCCAGCGCA TCGCCATCAT TCACGACAAA CAACAGTATG GCGAAGGGCT GGCGCGTTCG
GTGCAGGACG GGCTGAAAGC GGCTAACGCC AACGTCGTCT TCTTCGACGG TATTACCGCC
GGGGAGAAAG ATTTCTCCGC GCTGATCGCC CGCCTGAAAA AAGAAAACAT CGACTTCGTT
TACTACGGCG GTTACTACCC GGAAATGGGG CAGATGCTGC GCCAGGCCCG TTCCGTTGGC
CTGAAAACTC AGTTTATGGG GCCGGAAGGT GTGGGTAACG CATCATTGTC GAATATTGCC
GGTGATGCTG CCGAAGGCAT GTTGGTCACT ATGCCAAAAC GCTATGACCA GGATCCGGCA
AACCAGGGCA TCGTTGATGC GCTGAAAGCA GACAAGAAAG ATCCGTCCGG GCCTTATGTC
TGGATCACCT ACGCGGCGGT GCAATCTCTG GCGACTGCCC TTGAGCGTAC TGGCAGCGAT
GAGCCGCTGG CGCTGGTGAA AGATTTAAAA GCTAACGGTG CAAACACCGT GATTGGGCCG
CTGAACTGGG ATGAAAAAGG CGATCTTAAG GGATTTGATT TTGGTGTCTT CCAGTGGCAC
GCCGACGGTT CATCCACGGC AGCCAAGTGA
 
Protein sequence
MKRNAKTIIA GMIALAISHT AMADDIKVAV VGAMSGPIAQ WGDMEFNGAR QAIKDINAKG 
GIKGDKLVGV EYDDACDPKQ AVAVANKIVN DGIKYVIGHL CSSSTQPASD IYEDEGILMI
SPGATNPELT QRGYQHIMRT AGLDSSQGPT AAKYILETVK PQRIAIIHDK QQYGEGLARS
VQDGLKAANA NVVFFDGITA GEKDFSALIA RLKKENIDFV YYGGYYPEMG QMLRQARSVG
LKTQFMGPEG VGNASLSNIA GDAAEGMLVT MPKRYDQDPA NQGIVDALKA DKKDPSGPYV
WITYAAVQSL ATALERTGSD EPLALVKDLK ANGANTVIGP LNWDEKGDLK GFDFGVFQWH
ADGSSTAAK