Gene EcSMS35_3740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3740 
SymbollivK 
ID6145055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3807699 
End bp3808808 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content53% 
IMG OID641618566 
Producthigh-affinity branched-chain amino acid ABC transporter, periplasmic leucine-specific-binding protein LivK 
Protein accessionYP_001745706 
Protein GI170683091 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGGA ATGCGAAAAC TATCATCGCA GGGATGATTG CACTGACAAT TTCACACACC 
GCTATGGCTG ACGATATTAA AGTCGCCGTT GTCGGCGCGA TGTCCGGCCC GATTGCCCAG
TGGGGCGATA TGGAATTTAA CGGCGCGCGT CAGGCAATTA AAGACATTAA TGCCAAAGGG
GGAATTAAGG GCGATAAACT GGTTGGCGTG GAATATGACG ACGCCTGCGA CCCGAAACAA
GCCGTTGCGG TCGCCAACAA AATCGTTAAC GACGGCATTA AATACGTTAT TGGTCATCTG
TGTTCTTCTT CTACCCAACC TGCATCAGAT ATCTACGAAG ACGAAGGTAT TTTGATGATC
TCGCCGGGGG CGACCAACCC GGAGCTGACC CAACGCGGTT ATCAATACAT CATGCGTACT
GCCGGGCTGG ATTCTTCCCA GGGGCCAACG GCGGCAAAAT ACATTGTTGA GACGGTGAAG
CCCCAGCGCA TCGCCATCAT TCACGACAAA CAACAGTATG GCGAAGGGCT GGCACGTTCG
GTGCAGGACG GGCTGAAAGC GGCTAACGCC AACGTTGTCT TCTTCGACGG TATTACCGCG
GGTGAGAAAG ATTTCTCCGC GCTGATCGCC CGCCTGAAAA AAGAAAACAT CGACTTCGTT
TACTACGGCG GTTACTACCC GGAAATGGGG CAGATGCTGC GCCAGGCCCG TTCCGTTGGC
CTGAAAACCC AGTTTATGGG GCCGGAAGGT GTGGGTAATG CGTCGTTGTC GAATATTGCT
GGCGATGCTG CCGAAGGCAT GTTGGTCACT ATGCCAAAAC GCTATGACCA GGATCCGGCA
AATCAGGGCA TCGTTGATGC GCTGAAAGCA GACAAGAAAG ATCCGTCCGG GCCATATGTC
TGGATCACTT ACGCGGCGGT GCAATCTCTG GCGACTGCAC TTGAGCGTAC CGGCAGCGAT
GAGCCGCTGG CGCTGGTGAA AGATTTAAAA GCTAACGGTG CAAACACCGT AATTGGGCCG
CTGAACTGGG ATGAAAAAGG CGATCTTAAG GGATTTGATT TTGGTGTCTT CCAGTGGCAC
GCCGACGGTT CATCCACGGC AGCCAAGTGA
 
Protein sequence
MKRNAKTIIA GMIALTISHT AMADDIKVAV VGAMSGPIAQ WGDMEFNGAR QAIKDINAKG 
GIKGDKLVGV EYDDACDPKQ AVAVANKIVN DGIKYVIGHL CSSSTQPASD IYEDEGILMI
SPGATNPELT QRGYQYIMRT AGLDSSQGPT AAKYIVETVK PQRIAIIHDK QQYGEGLARS
VQDGLKAANA NVVFFDGITA GEKDFSALIA RLKKENIDFV YYGGYYPEMG QMLRQARSVG
LKTQFMGPEG VGNASLSNIA GDAAEGMLVT MPKRYDQDPA NQGIVDALKA DKKDPSGPYV
WITYAAVQSL ATALERTGSD EPLALVKDLK ANGANTVIGP LNWDEKGDLK GFDFGVFQWH
ADGSSTAAK