Gene EcSMS35_4610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4610 
Symbol 
ID6146817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4714123 
End bp4715379 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content52% 
IMG OID641619426 
Productinner membrane protein YjeH 
Protein accessionYP_001746537 
Protein GI170683031 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.401918 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGGAC TCAAACAAGA ACTGGGGCTG GCCCAGGGCA TTGGCCTGCT ATCGACGTCA 
TTATTAGGCA CTGGCGTGTT TGCCGTTCCT GCGTTAGCTG CGCTGGTAGC GGGCAATAAC
AGCCTGTGGG CGTGGCCCGT TTTGATTATC TTAGTGTTCC CGATTGCGAT TGTGTTTGCG
ATTCTGGGTC GCCACTATCC CAGCGCAGGC GGTGTCGCGC ACTTCGTCGG TATGGCGTTT
GGTTCGCGGC TTGAGCGAGT CACCGGCTGG TTGTTTTTAT CGGTCATTCC CGTGGGTTTG
CCTGCCGCGC TACAAATTGC CGCCGGGTTC GGCCAGGCAA TGTTTGGCTG GCATAGCTGG
CAACTGTTGT TGGCAGAACT CGGTACGCTG GCGCTGGTGT GGTATATCGG TACTCGCGGT
GCCAGTTCCA GTGCTAATCT ACAAACAGTT ATTGCCGGAC TTATCGTCGC GCTGATTGTC
GCTATCTGGT GGGCGGGCGA TATCAAACCT GCGAATATCC CCTTCCCTGC ACCTGGTAAT
ATCGAACTTA CCGGGTTATT TGCTGCGTTA TCAGTGATGT TCTGGTGCTT TGTCGGTCTG
GAGGCATTTG CCCATCTTGC CTCGGAATTT AAAAATCCAG AGCGTGATTT TCCTCGTGCT
TTGATGATTG GTCTGCTGCT GGCAGGATTA GTCTACTGGG GCTGTACGGT AGTCGTCTTA
CACTTCGACG CCTATGGTGA ACAAATGGCC GCGGCAGCAT CGCTTCCAAA AATTGTAGTG
CAACTGTTCG GTGTAGGAGC GTTATGGATT GCCTGCGTGA TTGGCTATCT GGCCTGCTTT
GCCAGTCTCA ACATTTATAT ACAGAGCTTC GCCCGCCTGG TCTGGTCGCA GGCGCAACAT
AATCCTGACC ACTACCTGGC ACGCCTCTCT TCTCGTCATA TCCCGAATAA TGCCCTCAAT
GCGGTGCTCG GCTGCTGCGT GGTGAGCACG TTGGTGATTC ATGCTTTAGA GATCAATCTG
GACGCTCTTA TTATTTATGC CAATGGCATC TTTATTATGA TTTATCTGTT ATGCATGCTG
GCAGGTTGTA AATTATTGCA AGGACGTTAT CGACTACTGG CGGTGGTTGG CGGGCTGTTA
TGCGTTCTGT TACTGGCAAT GGTCGGCTGG AAAAGTCTCT ATGCGCTGAT CATGCTGGCG
GGGTTATGGC TGTTTCTGCC AAAACGAAAA ACGCCGGAAA ATGGCATAAC CACATAA
 
Protein sequence
MSGLKQELGL AQGIGLLSTS LLGTGVFAVP ALAALVAGNN SLWAWPVLII LVFPIAIVFA 
ILGRHYPSAG GVAHFVGMAF GSRLERVTGW LFLSVIPVGL PAALQIAAGF GQAMFGWHSW
QLLLAELGTL ALVWYIGTRG ASSSANLQTV IAGLIVALIV AIWWAGDIKP ANIPFPAPGN
IELTGLFAAL SVMFWCFVGL EAFAHLASEF KNPERDFPRA LMIGLLLAGL VYWGCTVVVL
HFDAYGEQMA AAASLPKIVV QLFGVGALWI ACVIGYLACF ASLNIYIQSF ARLVWSQAQH
NPDHYLARLS SRHIPNNALN AVLGCCVVST LVIHALEINL DALIIYANGI FIMIYLLCML
AGCKLLQGRY RLLAVVGGLL CVLLLAMVGW KSLYALIMLA GLWLFLPKRK TPENGITT