Gene ECH74115_4299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4299 
Symbol 
ID6966972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3978491 
End bp3979807 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content52% 
IMG OID643388028 
Productprobable low-affinity inorganic phosphate transporter 2 
Protein accessionYP_002272466 
Protein GI209399859 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0306] Phosphate/sulphate permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAAATT TATTTGTTGG CCTTGATATA TACACAGGGC TTTTGTTATT ACTTGCTCTG 
GCATTTGTGT TGTTCTACGA AGCAATCAAT GGTTTTCATG ACACGGCGAA TGCGGTGGCA
ACCGTTATTT ATACTCGTGC CATGCAGCCA CAGCTTGCCG TGGTGATGGC GGCATTTTTT
AACTTTTTTG GCGTGTTATT GGGCGGACTT AGCGTTGCCT ATGCCATTGT CCATATGTTG
CCAACCGATT TGTTGCTGAA TATGGGGTCA ACCCACGGCC TGGCGATGGT CTTTTCCATG
CTGCTGGCGG CGATTATCTG GAACCTGGGA ACGTGGTTCT TTGGTTTACC GGCCTCCAGT
TCGCATACCT TGATTGGCGC GATTATCGGC ATCGGTTTAA CCAACGCGCT GTTAACCGGC
TCATCGGTGA TGGATGCGTT AAACCTGCGT GAAGTGACCA AAATTTTCTC CTCGCTGATT
GTTTCCCCTA TCGTCGGCCT GGTCATTGCG GGAGGCCTGA TATTCCTGCT GCGACGCTAC
TGGAGCGGAA CGAAAAAGCG TGACCGTATT CACCGCATTC CGGAAGATCG CAAAAAGAAA
AAAGGCAAAC GTAAGCCGCC ATTCTGGACG CGTATTGCGC TGATTGTTTC CGCTGCGGGC
GTGGCGTTTT CGCACGGCGC GAACGACGGA CAAAAAGGGA TCGGCCTGGT GATGCTGGTA
CTGGTGGGGA TTGCCCCTGC TGGCTTCGTC GTCAATATGA ACGCGTCCGG CTATGAAATT
ACCCGTACCC GCGATGCCGT TACCAACTTC GAACACTACC TGCAACAGCA TCCTGAACTG
CCGCAGAAGT TGATTGCGAT GGAACCTCCA TTGCCTGCGG CATCTACTGA TGGCGCGCAA
GTGACGGAGT TTCACTGTCA TCCGGGAAAT ACCTTTGATG CGATTGCGCG CGTTAAAACG
ATGCTGCCAG GCAATATGGA AAGTTACGAG CCGTTAAGCG TGAGTCAGCG CAGCCAGCTG
CGCCGCATTA TGCTGTGCAT CTCTGATACT TCCGCGAAGC TGGCGAAACT GCCAGGCGTC
AGTAAAGAAG ACCAGAACCT GCTGAAAAAA CTGCGCAGCG ATATGTTAAG CACCATTGAG
TACGCTCCGG TGTGGATCAT CATGGCAGTA GCACTGGCGC TCGGCATTGG CACCATGATT
GGCTGGCGTC GTGTTGCGAT GACCATCGGT GAGAAGCCTT TTTTAATATC GTATTGTGTT
CCTCCAGGCG GCGAACCTGC TTTTCCAGTT CGCGGATACG TTGCTGGTCT GGAGTAA
 
Protein sequence
MLNLFVGLDI YTGLLLLLAL AFVLFYEAIN GFHDTANAVA TVIYTRAMQP QLAVVMAAFF 
NFFGVLLGGL SVAYAIVHML PTDLLLNMGS THGLAMVFSM LLAAIIWNLG TWFFGLPASS
SHTLIGAIIG IGLTNALLTG SSVMDALNLR EVTKIFSSLI VSPIVGLVIA GGLIFLLRRY
WSGTKKRDRI HRIPEDRKKK KGKRKPPFWT RIALIVSAAG VAFSHGANDG QKGIGLVMLV
LVGIAPAGFV VNMNASGYEI TRTRDAVTNF EHYLQQHPEL PQKLIAMEPP LPAASTDGAQ
VTEFHCHPGN TFDAIARVKT MLPGNMESYE PLSVSQRSQL RRIMLCISDT SAKLAKLPGV
SKEDQNLLKK LRSDMLSTIE YAPVWIIMAV ALALGIGTMI GWRRVAMTIG EKPFLISYCV
PPGGEPAFPV RGYVAGLE