Gene EcSMS35_4570 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4570 
SymbolphnD 
ID6144862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4668784 
End bp4669800 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content55% 
IMG OID641619386 
Productphosphonate ABC transporter, periplasmic phosphonate-binding protein 
Protein accessionYP_001746498 
Protein GI170681158 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3221] ABC-type phosphate/phosphonate transport system, periplasmic component 
TIGRFAM ID[TIGR01098] phosphate/phosphite/phosphonate ABC transporters, periplasmic binding protein
[TIGR03431] phosphonate ABC transporter, periplasmic phosphonate binding protein 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCTA AGATAATTGC CTCGCTGGCC TTCACCAGCA TGTTCAGCCT CAGCACCCTG 
TTAAGCCCGG CGCACGCCGA AGAGCAGGAA AAGGCGCTGA ATTTCGGCAT TATTTCAACG
GAATCACAGC AAAACCTGAA ACCGCAATGG ACGCCGTTCT TACAGGATAT GGAGAAGAAG
CTGGGCGTAA AGGTCAACGC CTTCTTTGCC CCGGACTACG CGGGCATTAT CCAGGGGATG
CGCTTCAATA AAGTGGATAT CGCCTGGTAC GGCAACCTGT CGGCGATGGA AGCGGTGGAT
CGCGCCAACG GTCAGGTCTT CGCCCAGACA GTCGCGGCGG ATGGATCGCC AGGTTACTGG
AGCGTGTTGA TCGTCAACAA AGACAGCCCG ATCAACAACC TGAACGATCT GCTGGCGAAG
CGTAAAGATC TCACCTTTGG CAATGGCGAT CCTAACTCCA CCTCTGGCTT CCTCGTCCCC
GGTTACTACG TCTTTGCCAA AAACAATATC TCCGCCAGCG ACTTCAAGCG CACCGTCAAC
GCCGGGCATG AAACCAATGC GCTGGCCGTC GCCAACAAGC AAGTGGATGT GGCGACCAAC
AATACCGAAA ACCTCGACAA GCTGAAAACC TCCGCACCAG AGAAGCTCAA AGAACTGAAG
GTGATCTGGA AATCGCCGCT GATCCCAGGC GATCCGATCG TCTGGCGCAA AAATCTTTCC
GAAACTACCA AAGACAAGAT CTACGACTTC TTTATGAATT ACGGCAAGAC GCCGGAAGAG
AAAGCGGTGC TGGAACGCCT GGGCTGGGCC CCGTTCCGCG CCTCCAGCGA CCTGCAACTG
GTGCCGATTC GCCAGCTCGC ACTGTTTAAA GAGATGCAGG GCGTGAAAAG CAATAAAGGG
CTGAATGAGC AGGACAAGCT GGCGAAAACC ACCGCGATTC AGGCGCAGCT GGATGACCTG
GACCGCCTGA ACAACGCGTT AAGCGCGATG AGTTCGGTAA GTAAAGCGGT GCAGTAA
 
Protein sequence
MNAKIIASLA FTSMFSLSTL LSPAHAEEQE KALNFGIIST ESQQNLKPQW TPFLQDMEKK 
LGVKVNAFFA PDYAGIIQGM RFNKVDIAWY GNLSAMEAVD RANGQVFAQT VAADGSPGYW
SVLIVNKDSP INNLNDLLAK RKDLTFGNGD PNSTSGFLVP GYYVFAKNNI SASDFKRTVN
AGHETNALAV ANKQVDVATN NTENLDKLKT SAPEKLKELK VIWKSPLIPG DPIVWRKNLS
ETTKDKIYDF FMNYGKTPEE KAVLERLGWA PFRASSDLQL VPIRQLALFK EMQGVKSNKG
LNEQDKLAKT TAIQAQLDDL DRLNNALSAM SSVSKAVQ