Gene ECH74115_5617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5617 
SymbolphnD 
ID6969384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5253383 
End bp5254399 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content55% 
IMG OID643389252 
Productphosphonate ABC transporter, periplasmic phosphonate-binding protein 
Protein accessionYP_002273649 
Protein GI209399090 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3221] ABC-type phosphate/phosphonate transport system, periplasmic component 
TIGRFAM ID[TIGR01098] phosphate/phosphite/phosphonate ABC transporters, periplasmic binding protein
[TIGR03431] phosphonate ABC transporter, periplasmic phosphonate binding protein 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCTA AGATAATTGC CTCGCTGGCC TTCACCAGCA TGTTCAGCCT CAGCACCCTG 
TTAAGCCCGG CACACGCCGA AGAGCAGGAA AAGGCGCTGA ATTTCGGCAT TATTTCAACG
GAATCACAGC AAAACCTGAA ACCGCAATGG ACGCCGTTTT TACAGGATAT GGAGAAGAAG
CTGGGCGTGA AGGTCAACGC CTTCTTTGCC CCGGACTATG CGGGCATTAT CCAGGGAATG
CGTTTCAACA AAGTGGATAT CGCCTGGTAC GGCAACCTGT CGGCGATGGA AGCAGTGGAT
CGCGCCAACG GCCAAGTCTT CGCTCAGACC GTCGCGGCGG ATGGATCGCC GGGTTACTGG
AGCGTGTTGA TCGTCAACAA AGACAGCCCG ATCAACAACC TGAACGATCT GCTGGCGAAG
CGGAAAGATC TCACCTTCGG TAATGGCGAT CCTAACTCCA CCTCTGGCTT CCTCGTCCCC
GGCTACTACG TCTTCTCCAA AAACAATATC TCCGCCAGTG ACTTCAAGCG CACCGTCAAC
GCCGGGCATG AAACCAACGC GCTGGCCGTC GCCAACAAGC AGGTGGATGT GGCGACCAAC
AACACCGAAA ACCTCGACAA GCTGAAAACC TCCGCGCCGG AGAAGCTGAA AGAACTGAAA
GTGATCTGGA AGTCACCGCT GATCCCAGGC GATCCGATCG TCTGGCGTAA AAACCTTTCC
GAAACCACCA AAGACAAGAT CTACGACTTC TTTATGAATT ACGGCAAGAC GCCGGAAGAG
AAAGCGGTGC TGGAACGCCT GGGCTGGGCG CCGTTCCGCG CCTCCAGCGA CCTGCAACTG
GTGCCGATTC GCCAGCTCGC ACTGTTTAAA GAGATGCAAA GCGTGAAAGG CAATAAAGGG
CTGAATGAGC AGGACAAGCT GGCAAAAACC ACCGAGATTC AAGCGCAGCT GGATGACCTG
GACCGCCTGA ACAACGCGTT AAGCGCGATG AGTTCGGTAA GTAAAGCGAT GCAGTAA
 
Protein sequence
MNAKIIASLA FTSMFSLSTL LSPAHAEEQE KALNFGIIST ESQQNLKPQW TPFLQDMEKK 
LGVKVNAFFA PDYAGIIQGM RFNKVDIAWY GNLSAMEAVD RANGQVFAQT VAADGSPGYW
SVLIVNKDSP INNLNDLLAK RKDLTFGNGD PNSTSGFLVP GYYVFSKNNI SASDFKRTVN
AGHETNALAV ANKQVDVATN NTENLDKLKT SAPEKLKELK VIWKSPLIPG DPIVWRKNLS
ETTKDKIYDF FMNYGKTPEE KAVLERLGWA PFRASSDLQL VPIRQLALFK EMQSVKGNKG
LNEQDKLAKT TEIQAQLDDL DRLNNALSAM SSVSKAMQ