Gene EcSMS35_2303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2303 
SymbollysP 
ID6146570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2332774 
End bp2334243 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content53% 
IMG OID641617177 
Productlysine transporter 
Protein accessionYP_001744350 
Protein GI170679799 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0833] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0126674 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTCCG AAACTAAAAC TACAGAAGCG CCGGGCTTAC GCCGTGAATT AAAGGCGCGT 
CACCTGACGA TGATTGCCAT TGGCGGTTCC ATCGGTACAG GTCTTTTTGT TGCCTCTGGC
GCAACGATTT CTCAGGCAGG TCCGGGCGGG GCATTGCTCT CGTATATGCT GATTGGCCTG
ATGGTTTACT TCCTGATGAC CAGTCTCGGT GAACTGGCTG CATATATGCC GGTTTCCGGT
TCGTTTGCCA CTTACGGTCA GAACTATGTT GAAGAAGGCT TTGGCTTCGC GCTGGGCTGG
AACTACTGGT ACAACTGGGC GGTGACTATC GCCGTTGACC TGGTTGCAGC TCAGCTGGTC
ATGAGCTGGT GGTTCCCGGA TACACCGGGC TGGATCTGGA GTGCGCTGTT CCTCGGCGTT
ATCTTCCTGC TGAACTACAT CTCAGTTCGT GGCTTTGGTG AAGCGGAATA CTGGTTCTCA
CTGATCAAAG TCACGACAGT TATTGTCTTT ATCATCGTTG GCGTGCTGAT GATTATCGGT
ATCTTCAAAG GCGCGCAGCC TGCGGGCTGG AGCAACTGGA CAATCGGCGA AGCGCCGTTT
GCTGGTGGTT TTGCGGCGAT GATCGGCGTA GCTATGATTG TCGGCTTCTC TTTCCAGGGA
ACCGAGCTGA TCGGTATTGC TGCAGGCGAG TCCGAAGATC CGGCGAAAAA CATTCCACGC
GCGGTACGTC AGGTATTCTG GCGAATCCTG TTGTTCTATG TGTTCGCGAT CCTGATTATC
AGCCTGATTA TTCCGTATAC CGATCCGAGC CTGCTGCGTA ACGATGTTAA AGACATCAGC
GTCAGTCCGT TCACCCTGGT GTTCCAGCAC GCGGGTCTGC TCTCTGCGGC GGCGGTGATG
AACGCAGTTA TTCTGACGGC GGTGCTGTCA GCGGGTAACT CCGGTATGTA TGCGTCTACT
CGTATGCTGT ACACCCTGGC GTGTGACGGT AAAGCGCCGC GCATTTTCGC TAAACTGTCG
CGTGGTGGCG TGCCGCGTAA TGCGCTGTAT GCGACGACGG TGATTGCAGG TCTGTGCTTC
CTGACCTCCA TGTTTGGCAA CCAGACGGTA TACCTGTGGC TGCTGAACAC CTCCGGGATG
ACCGGTTTTA TCGCCTGGCT GGGGATTGCC ATTAGCCACT ATCGTTTCCG TCGCGGGTAC
GTATTGCAGG GACACGACAT TAACGATCTG CCGTACCGTT CAGGTTTCTT CCCACTGGGG
CCGATCTTCG CATTCATTCT GTGTCTGATT ATCACTTTGG GCCAGAACTA CGAAGCGTTC
CTGAAAGATA CCATTGACTG GGGCGGCGTA GCGGCAACGT ATATTGGTAT CCCGCTGTTC
CTGATTATTT GGTTCGGATA CAAGCTGATT AAAGGAACTC ACTTCGTACG CTACAGCGAA
ATGAAGTTCC CGCAGAACGA TAAGAAATAA
 
Protein sequence
MGSETKTTEA PGLRRELKAR HLTMIAIGGS IGTGLFVASG ATISQAGPGG ALLSYMLIGL 
MVYFLMTSLG ELAAYMPVSG SFATYGQNYV EEGFGFALGW NYWYNWAVTI AVDLVAAQLV
MSWWFPDTPG WIWSALFLGV IFLLNYISVR GFGEAEYWFS LIKVTTVIVF IIVGVLMIIG
IFKGAQPAGW SNWTIGEAPF AGGFAAMIGV AMIVGFSFQG TELIGIAAGE SEDPAKNIPR
AVRQVFWRIL LFYVFAILII SLIIPYTDPS LLRNDVKDIS VSPFTLVFQH AGLLSAAAVM
NAVILTAVLS AGNSGMYAST RMLYTLACDG KAPRIFAKLS RGGVPRNALY ATTVIAGLCF
LTSMFGNQTV YLWLLNTSGM TGFIAWLGIA ISHYRFRRGY VLQGHDINDL PYRSGFFPLG
PIFAFILCLI ITLGQNYEAF LKDTIDWGGV AATYIGIPLF LIIWFGYKLI KGTHFVRYSE
MKFPQNDKK