Gene EcSMS35_3265 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3265 
Symbol 
ID6145542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3343940 
End bp3345010 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content51% 
IMG OID641618095 
ProductYjgP/YjgQ permease 
Protein accessionYP_001745245 
Protein GI170683865 
COG category[R] General function prediction only 
COG ID[COG0795] Predicted permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.342085 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTGG TTGAACACTA CATCATGCGT GGTACGCGCC GTCTGGTGCT GATTATCGTC 
GGTTTTCTGA TCTTTATTTT CGCCAGCTAC TCCGCACAGC GTTATCTGAC CGAAGCGGCA
AATGGGACCT TAGCTCTTGA TGTTGTGCTG GATATCGTTT TCTACAAAGT GCTGATTGCA
CTGGAGATGT TGTTACCTGT TGGTCTGTAT GTGTCAGTTG GCGTGACGCT AGGGCAGATG
TATACCGACT CGGAAATTAC CGCTATCTCT GCGGCGGGGG GCAGTCCGGG ACGCTTGTAC
AAAGCCGTTC TTTATCTGGC GATACCGCTA AGTATTTTTG TCACCCTTCT GTCGATGTAT
GGTCGGCCGT GGGCTTATGC GCAGATTTAT CAACTGGAGC AACAGTCACA GTCGGAGCTG
GATGTTCGCC AGTTGCGGGC AAAGAAATTT AACACTAACG ATAACGGACG AATGATCCTT
TCGCAGACGG TTGATCAGGA TAATAATCGC CTGACTGACG CGCTGATTTA TACTTCTACT
GCCAATCGAA CCCGCATTTT CCGCGCCCGT TCGGTTGATG TGGTTGACCC ATCACCTGAG
AAACCGACCG TTATGTTGCA TAACGGGACC GCCTATCTTC TCGATCATCA GGGGCGTGAC
GACAACGAAC AGATCTACCG TAATCTGCAA TTACATCTGA ATCCGCTGGA TCAAAGCCCT
AACGTCAAAC GCAAAGCAAA ATCGGTCACG GAGCTGGCGC GCTCCGCCTT TCCTGCCGAT
CATGCCGAAC TGCAATGGCG ACAAAGCCGT GGCCTGACAG CATTGTTGAT GGCGCTGCTG
GCCATTTCAT TAAGTCGGGT AAAACCGCGG CAAGGGCGAT TTTCAACGTT ATTGCCACTG
ACGTTGCTGT TTGTTGCCAT TTTTTATGGC GGCGACGTCT GCCGTACGCT GGTGGCTAAC
GGTGCGATTC CCCTCATTCC TGGTTTGTGG TTAGTACCCG GACTCATGCT AATGGGCCTG
CTGATGCTGG TCGCACGCGA CTTCTCTTTG CTGCAGAAAT TTTCCCGATG A
 
Protein sequence
MKLVEHYIMR GTRRLVLIIV GFLIFIFASY SAQRYLTEAA NGTLALDVVL DIVFYKVLIA 
LEMLLPVGLY VSVGVTLGQM YTDSEITAIS AAGGSPGRLY KAVLYLAIPL SIFVTLLSMY
GRPWAYAQIY QLEQQSQSEL DVRQLRAKKF NTNDNGRMIL SQTVDQDNNR LTDALIYTST
ANRTRIFRAR SVDVVDPSPE KPTVMLHNGT AYLLDHQGRD DNEQIYRNLQ LHLNPLDQSP
NVKRKAKSVT ELARSAFPAD HAELQWRQSR GLTALLMALL AISLSRVKPR QGRFSTLLPL
TLLFVAIFYG GDVCRTLVAN GAIPLIPGLW LVPGLMLMGL LMLVARDFSL LQKFSR