Gene EcHS_A3159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3159 
Symbol 
ID5593631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3172316 
End bp3173395 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content51% 
IMG OID640922279 
ProductYjgP/YjgQ permease 
Protein accessionYP_001459777 
Protein GI157162459 
COG category[R] General function prediction only 
COG ID[COG0795] Predicted permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGTTT TCAGTCGCTA TTTAATCCGT CATCTCTTTC TCGGTTTTGC CGCCGCCGCA 
GGGCTATTGC TGCCGCTTTT TACCACCTTC AACCTGATTA ACGAACTGGA TGATGTCAGC
CCGGGCGGTT ATCGCTGGAC TCAAGCGGTG CTGGTGGTGC TAATGACCTT ACCGCGCACA
CTGGTCGAAC TTTCGCCGTT TATCGCCTTA TTGGGAGGGA TTGTTGGCCT GGGGCAGTTA
TCGAAAAACA GTGAGCTTAC CGCCATTCGC AGCACGGGGT TTTCTATCTT CCGTATTGCA
CTGGTGGCGC TGGTTGCCGG GATATTGTGG ACTGTTTCGT TAGGCGCGAT AGATGAGTGG
GTGGCGTCGC CATTGCAGCA ACAGGCGCTG CAAATCAAAT CGACCGCCAC CGCGTTGGGG
GAGGACGATG ACATTACCGG CAATATGCTG TGGGCCAGGC GCGGCAATGA ATTTGTGACG
GTGAAATCGC TGAATGAACA GGGCCAGCCT GTGGGCGTGG AGATTTTTCA TTATCGTGAC
GATCTTTCGC TCGAATCCTA CATTTATGCA CGCAGTGCCA CCATTAAAGA CGACAAAACG
TGGATCCTGC ATGGTGTGAA TCATAAAAAA TGGCTTAACG GTAAAGAAAC GCTGGAAACA
TCAGATAATC TTGCCTGGCA ATCGGCCTTC ACCAGTATGG ATCTTGATGA GTTATCGATG
CCGGGGAATA CTTTTTCTGT CCGTCAGCTT AATCATTACA TCCATTATTT GCAGGAAACC
GGGCAACCCA GCAGCGAATA CCGCCTTGCA CTGTGGGAAA AACTGGGGCA ACCGATCCTG
ACCCTGGCGA TGATTTTGCT GGCTGTGCCG TTTACCTTTA GCGCCCCGCG CTCGCCAGGG
ATGGGTAGCC GTCTCGCTGT AGGCGTCATC GTTGGCTTAC TCACCTGGAT CAGCTATCAA
ATCATGGTCA ATTTGGGATT GCTATTTGCG TTAAGCGCAC CTGTTACCGC GCTCGGTTTA
CCGGTAGCGT TTGTGTTGGT GGCGTTGAGC CTGGTGTATT GGTATGACAG ACAACATTAA
 
Protein sequence
MNVFSRYLIR HLFLGFAAAA GLLLPLFTTF NLINELDDVS PGGYRWTQAV LVVLMTLPRT 
LVELSPFIAL LGGIVGLGQL SKNSELTAIR STGFSIFRIA LVALVAGILW TVSLGAIDEW
VASPLQQQAL QIKSTATALG EDDDITGNML WARRGNEFVT VKSLNEQGQP VGVEIFHYRD
DLSLESYIYA RSATIKDDKT WILHGVNHKK WLNGKETLET SDNLAWQSAF TSMDLDELSM
PGNTFSVRQL NHYIHYLQET GQPSSEYRLA LWEKLGQPIL TLAMILLAVP FTFSAPRSPG
MGSRLAVGVI VGLLTWISYQ IMVNLGLLFA LSAPVTALGL PVAFVLVALS LVYWYDRQH