Gene ECH74115_4839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4839 
SymbolpitA 
ID6967768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4475204 
End bp4476703 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content54% 
IMG OID643388530 
Productlow-affinity inorganic phosphate transporter 1 
Protein accessionYP_002272958 
Protein GI209397927 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0306] Phosphate/sulphate permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTACATT TGTTTGCTGG CCTGGATTTG CATACCGGGC TGTTATTATT GCTTGCACTG 
GCTTTTGTGC TGTTCTACGA AGCCATCAAT GGTTTCCATG ACACAGCCAA CGCCGTGGCA
ACCGTTATCT ATACCCGCGC GATGCGTTCT CAGCTCGCCG TGGTTATGGC AGCGGTGTTC
AACTTTTTGG GTGTTTTGCT GGGTGGTCTG AGTGTTGCCT ATGCCATTGT GCATATGCTG
CCGACGGATC TGCTGCTTAA TATGGGATCG TCTCATGGCC TTGCCATGGT GTTCTCTATG
TTGCTGGCGG CGATTATCTG GAACCTGGGT ACCTGGTACT TTGGTTTGCC TGCATCCAGC
TCTCATACGC TGATTGGCGC GATCATCGGG ATTGGTTTAA CCAATGCGTT GATGACCGGG
ACGTCAGTGG TGGATGCACT CAATATCCCG AAAGTATTAA GTATTTTCGG TTCTCTGATC
GTTTCCCCTA TTGTCGGCCT GGTGTTTGCT GGCGGTCTGA TTTTCTTGCT GCGTCGCTAC
TGGAGCGGCA CCAAGAAACG CGCCCGTATC CACCTGACCC CAGCGGAGCG TGAAAAGAAA
GACGGCAAGA AAAAGCCGCC GTTCTGGACG CGTATCGCGC TGATCCTTTC CGCTATCGGC
GTGGCGTTTT CGCACGGCGC AAACGATGGT CAGAAAGGCA TTGGTCTGGT TATGTTGGTA
TTGATTGGTG TCGCACCAGC AGGCTTCGTG GTGAACATGA ATGCCACTGG CTACGAAATC
ACCCGTACCC GTGATGCCAT CAACAACGTC GAAGCTTACT TTGAGCAGCA TCCTGCGCTG
CTGAAACAAG CTACTGGTGC TGATCAGTTA GTACCGGCTC CGGAAGCTGG CGCAACGCAA
CCTGCGGAGT TCCACTGCCA TCCGTCGAAT ACCATTAACG CGCTCAACCG CCTGAAAGGT
ATGTTGACCA CCGATGTGGA AAGCTACGAC AAGCTGTCGC TTGATCAACG TAGCCAGATG
CGCCGCATTA TGCTGTGCGT TTCTGACACT ATCGACAAAG TGGTGAAGAT GCCTGGCGTG
AGTGCTGACG ATCAGCGCCT GTTGAAGAAA CTGAAGTCCG ACATGCTTAG CACCATCGAG
TATGCACCGG TGTGGATCAT CATGGCGGTC GCGCTGGCGT TAGGTATCGG TACGATGATT
GGCTGGCGTC GTGTGGCAAC GACTATCGGT GAGAAAATCG GTAAGAAAGG CATGACCTAC
GCTCAGGGGA TGTCTGCCCA GATGACTGCG GCAGTGTCTA TCGGCCTGGC GAGTTATACC
GGGATGCCGG TTTCCACTAC TCACGTACTC TCCTCTTCTG TCGCGGGGAC GATGGTGGTA
GATGGTGGCG GCTTACAGCG TAAAACCGTG ACCAGCATTC TGATGGCCTG GGTGTTTACC
CTTCCGGCTG CGGTACTGCT TTCCGGCGGG CTTTACTGGC TCTCCTTGCA GTTCCTGTAA
 
Protein sequence
MLHLFAGLDL HTGLLLLLAL AFVLFYEAIN GFHDTANAVA TVIYTRAMRS QLAVVMAAVF 
NFLGVLLGGL SVAYAIVHML PTDLLLNMGS SHGLAMVFSM LLAAIIWNLG TWYFGLPASS
SHTLIGAIIG IGLTNALMTG TSVVDALNIP KVLSIFGSLI VSPIVGLVFA GGLIFLLRRY
WSGTKKRARI HLTPAEREKK DGKKKPPFWT RIALILSAIG VAFSHGANDG QKGIGLVMLV
LIGVAPAGFV VNMNATGYEI TRTRDAINNV EAYFEQHPAL LKQATGADQL VPAPEAGATQ
PAEFHCHPSN TINALNRLKG MLTTDVESYD KLSLDQRSQM RRIMLCVSDT IDKVVKMPGV
SADDQRLLKK LKSDMLSTIE YAPVWIIMAV ALALGIGTMI GWRRVATTIG EKIGKKGMTY
AQGMSAQMTA AVSIGLASYT GMPVSTTHVL SSSVAGTMVV DGGGLQRKTV TSILMAWVFT
LPAAVLLSGG LYWLSLQFL