Gene EcSMS35_2327 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2327 
Symbol 
ID6145337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2359837 
End bp2360931 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content52% 
IMG OID641617201 
ProductABC transporter, permease protein 
Protein accessionYP_001744374 
Protein GI170681691 
COG category[R] General function prediction only 
COG ID[COG4174] ABC-type uncharacterized transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0047912 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGCGCTT ATCTGATTCG CCGTCTGTTG CTGGTGATCC CAACGCTATG GGCGATTATC 
ACTATCAACT TTTTCATCGT GCAAATTGCG CCTGGCGGTC CGGTCGACCA GGCCATCGCC
GCCATTGAGT TTGGCAATGC CGGAGTATTA CCCGGCGCAG GCGGTGAAGG TGTTCGCGCC
AGCCATGCGC AAACGGGCGT CGGCAATATC AGCGACAGTA ATTACCGTGG CGGACGCGGA
TTAGATCCAG AAGTGATCGC CGAGATCACT CATCGCTACG GTTTCGATAA GCCGATCCAC
GAACGTTACT TCAAAATGCT CTGGGACTAC ATCCGTTTTG ATTTTGGCGA CAGCCTTTTT
CGCAGCGCCT CGGTGCTGAC GCTGATTAAA GACAGTCTGC CGGTTTCCAT CACCCTCGGA
TTGTGGAGCA CGCTGATTAT CTATCTGGTG TCGATTCCGT TAGGCATTCG CAAAGCTGTT
TATAATGGGA GCCGCTTTGA CGTCTGGAGT AGCGCATTTA TCATCATCGG CTACGCCATT
CCGGCCTTTT TGTTTGCCAT CCTGCTGATT GTCTTCTTCG CGGGCGGCAG CTATTTCGAC
CTGTTCCCTC TGCGCGGCCT GGTTTCCGCT AACTTTGATT CGCTGCCGTG GTATCAGAAA
ATCACCGATT ATCTGTGGCA TATCACGCTG CCGGTGCTGG CGACTGTGAT TGGTGGCTTT
GCAGCGCTGA CCATGCTGAC AAAAAACTCA TTCCTTGATG AAGTGCGCAA GCAATACGTG
GTGACCGCGC GCGCGAAAGG GGTAAGTGAA AAAAATATTC TCTGGAAACA TGTGTTCCGC
AACGCCATGC TGCTGGTGAT TGCCGGTTTT CCGGCGACGT TTATCAGCAT GTTTTTTACC
GGCTCGCTGC TGATTGAGGT GATGTTTTCA CTCAATGGTC TTGGTTTACT GGGCTACGAA
GCGACCGTCT CGCGCGATTA TCCTGTAATG TTTGGTACCT TGTATATTTT CACCCTGATT
GGCCTGCTGC TGAATATTGT CAGTGATATC AGCTATACGC TGGTCGATCC GCGTATAGAT
TTTGAGGGAC GCTAA
 
Protein sequence
MGAYLIRRLL LVIPTLWAII TINFFIVQIA PGGPVDQAIA AIEFGNAGVL PGAGGEGVRA 
SHAQTGVGNI SDSNYRGGRG LDPEVIAEIT HRYGFDKPIH ERYFKMLWDY IRFDFGDSLF
RSASVLTLIK DSLPVSITLG LWSTLIIYLV SIPLGIRKAV YNGSRFDVWS SAFIIIGYAI
PAFLFAILLI VFFAGGSYFD LFPLRGLVSA NFDSLPWYQK ITDYLWHITL PVLATVIGGF
AALTMLTKNS FLDEVRKQYV VTARAKGVSE KNILWKHVFR NAMLLVIAGF PATFISMFFT
GSLLIEVMFS LNGLGLLGYE ATVSRDYPVM FGTLYIFTLI GLLLNIVSDI SYTLVDPRID
FEGR