Gene EcHS_A4345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4345 
SymbolphnD 
ID5594419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4347854 
End bp4348870 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content54% 
IMG OID640923443 
Productphosphonate ABC transporter, periplasmic phosphonate-binding protein 
Protein accessionYP_001460888 
Protein GI157163570 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3221] ABC-type phosphate/phosphonate transport system, periplasmic component 
TIGRFAM ID[TIGR01098] phosphate/phosphite/phosphonate ABC transporters, periplasmic binding protein
[TIGR03431] phosphonate ABC transporter, periplasmic phosphonate binding protein 


Plasmid Coverage information

Num covering plasmid clones80 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCTA AGATAATTGC CTCGCTGGCC TTCACCAGCA TGTTCAGCCT CAGCACCCTG 
TTAAGCCCGG CACACGCCGA AGAGCAGGAA AAGGCGCTGA ATTTCGGCAT TATTTCAACG
GAATCACAGC AAAACCTTAA ACCGCAATGG ACGCCATTCT TACAGGATAT GGAGAAGAAG
CTGGGCGTGA AGGTGAACGC CTTCTTTGCC CCAGACTACG CAGGCATTAT CCAGGGAATG
CGCTTCAATA AAGTGGATAT CGCCTGGTAC GGCAACCTGT CGGCAATGGA AGCGGTGGAT
CGCGCCAACG GCCAAGTCTT CGCCCAGACG GTCGCGGCGG ATGGATCGCC AGGTTACTGG
AGCGTGTTGA TCGTCAACAA AGATAGTCCG ATCAACAACC TGAACGATCT GCTGGCGAAG
CGGAAAGATC TCACCTTCGG CAATGGCGAT CCTAACTCCA CCTCTGGCTT CCTCGTCCCC
GGTTACTACG TCTTTGCCAA AAACAATATC TCCGCCAGCG ACTTTAAGCG CACCGTCAAC
GCTGGGCATG AAACCAACGC GCTGGCCGTC GCCAACAAGC AGGTGGATGT TGCCACCAAC
AACACCGAAA ACCTCGACAA GCTGAAAACC TCCGCGCCAG AGAAGCTGAA AGAACTGAAG
GTGATCTGGA AGTCGCCGCT GATCCCAGGC GATCCGATCG TCTGGCGCAA GAATCTCTCC
GAAACCACCA AAGACAAGAT CTACGACTTC TTTATGAACT ACGGAAAAAC GCCGGAAGAG
AAAGCGGTGC TGGAACGCCT GGGCTGGGCC CCGTTCCGCG CCTCCAGCGA CCTGCAACTG
GTGCCGATTC GCCAGCTCGC ACTGTTTAAA GAGATGCAGG GCGTAAAAAG CAATAAAGGA
CTGAATGAGC AGGACAAGCT GGCGAAAACC ACCGAGATTC AGGCGCAGCT GGATGACCTG
GACCGCCTGA ACAACGCGTT AAGCGCGATG AGTTCGGTGA GTAAAGCGGT GCAGTAA
 
Protein sequence
MNAKIIASLA FTSMFSLSTL LSPAHAEEQE KALNFGIIST ESQQNLKPQW TPFLQDMEKK 
LGVKVNAFFA PDYAGIIQGM RFNKVDIAWY GNLSAMEAVD RANGQVFAQT VAADGSPGYW
SVLIVNKDSP INNLNDLLAK RKDLTFGNGD PNSTSGFLVP GYYVFAKNNI SASDFKRTVN
AGHETNALAV ANKQVDVATN NTENLDKLKT SAPEKLKELK VIWKSPLIPG DPIVWRKNLS
ETTKDKIYDF FMNYGKTPEE KAVLERLGWA PFRASSDLQL VPIRQLALFK EMQGVKSNKG
LNEQDKLAKT TEIQAQLDDL DRLNNALSAM SSVSKAVQ