Gene EcSMS35_3264 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3264 
Symbol 
ID6144368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3342864 
End bp3343943 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content52% 
IMG OID641618094 
ProductYjgP/YjgQ permease 
Protein accessionYP_001745244 
Protein GI170681358 
COG category[R] General function prediction only 
COG ID[COG0795] Predicted permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.917388 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGTTT TCAGTCGCTA TTTAATCCGT CATCTCTTTC TCGGTTTTGC CGCCGCCGCA 
GGGCTATTGC TGCCGCTTTT TACCACCTTC AACCTGATTA ACGAACTGGA TGATGTCAGC
CCGGGCGGTT ATCGCTGGAC TCTGGCGGTG CTGGTGGTGC TGATGACCTT ACCGCGTACA
CTGGTCGAAC TTTCGCCATT TATCGCATTA TTGGGAGGGA TTGTTGGCCT GGGGCAGTTA
TCGAAAAACA GTGAGCTTAC CGCCATTCGC AGCACTGGGT TTTCTATCTT CCGTATTGCA
CTGGTGGCGC TGGTTGCCGG GATATTGTGG ACTGTTTCGT TAGGCGCGAT AGATGAGTGG
GTGGCGTCGC CATTGCAGCA GCAGGCGCTG CAAATCAAAT CGACTGCCAC CGCGTTGGGG
GAGGACGATG ACATTACCGG CAATATGCTT TGGGCCAGGC GCGGTAATGA ATTTGTGACG
GTGAAATCGC TGAACGAGCA GGGCCAGCCT GTGGGCGTGG AGATATTTCA TTATCGCGAC
GATCTCTCGC TCGAATCCTA CATTTTTGCA CGCAGTGCCT CCATTGAAGA CGACAAAACG
TGGATCCTGC ATGGTGTGAA TCATAAAAAA TGGTTGAATG GCAAAGAAAC GCTGGAAACA
TCAGATAATC TTGCCTGGCA ATCGGCCTTC ACCAGTATGG ATCTTGAAGA GTTATCGATG
CCGGGGAATA CTTTTTCTGT CCGTCAGCTT AATCATTACA TCCATTATTT GCAGGAAACC
GGACAACCCA GCAGCGAATA CCGCCTTGCA CTGTGGGAAA AACTGGGGCA ACCGATCCTG
ACCCTGGCGA TGATTTTGCT GGCTGTGCCG TTCACCTTTA GCGCCCCGCG CTCGCCAGGG
GTGGGTAGCC GTCTCGCTGT AGGCGTCATC GTTGGCTTAC TCACCTGGAT CAGCTATCAA
ATCATGGTCA ATCTGGGATT GTTATTTGCG TTGAGCGCAC CTGTTACCGC GCTCGGTTTA
CCGGTAGCGT TTGTGTTGGT GGCGTTGAGC CTGGTGTATT GGTATGACAG ACAACATTAA
 
Protein sequence
MNVFSRYLIR HLFLGFAAAA GLLLPLFTTF NLINELDDVS PGGYRWTLAV LVVLMTLPRT 
LVELSPFIAL LGGIVGLGQL SKNSELTAIR STGFSIFRIA LVALVAGILW TVSLGAIDEW
VASPLQQQAL QIKSTATALG EDDDITGNML WARRGNEFVT VKSLNEQGQP VGVEIFHYRD
DLSLESYIFA RSASIEDDKT WILHGVNHKK WLNGKETLET SDNLAWQSAF TSMDLEELSM
PGNTFSVRQL NHYIHYLQET GQPSSEYRLA LWEKLGQPIL TLAMILLAVP FTFSAPRSPG
VGSRLAVGVI VGLLTWISYQ IMVNLGLLFA LSAPVTALGL PVAFVLVALS LVYWYDRQH