Gene EcSMS35_4565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4565 
SymbolphnI 
ID6144936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4665096 
End bp4666160 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content62% 
IMG OID641619381 
Productphosphonate metabolism protein PhnI 
Protein accessionYP_001746493 
Protein GI170680400 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3626] Uncharacterized enzyme of phosphonate metabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGTTG CCGTGAAAGG GGGCGAGAAG GCGATCGACG CCGCCCACGC CCTGCAAGAG 
AGCCGACGCC GGGGCAATAC CGATTTGCCT GAACTGAGCG TCGCCCAGAT TGAACAGCAG
CTTAATCTTG CAGTAGATCG CGTGATGACC GAAGGCGGCA TTGCCGACCG CGAACTGGCG
GCGCTGGCGC TGAAACAGGC CAGCGGCGAT AACGTCGAAG CGATTTTCCT GCTACGCGCC
TACCGCACCA CGCTGGCGAA GCTTGCGGTG AGCGAGCCGC TCGACACCAC GGAGATGCGT
CTCGAACGGC GTATTTCAGC GGTTTATAAA GATATTCCCG GCGGCCAGCT GCTTGGCCCA
ACCTACGACT ACACCCATCG CCTGCTCGAT TTCACCCTGC TGGCAAACGG CGAAGCGCCG
ACGCTGACCA CAGCCGACAG CGAACAACAG CCGTCGCCGC ACGTTTTCAG CCTGCTGGCG
CGTCAGGGGC TGGCGAAGTT TGAAGAAGAT AGCGGCGCAC AGCCGGACGA CATCACCCGC
ACGCCGCCGG TTTACCCCTG CTCACGCTCC TCCCGCCTGC AACAATTAAT GCGCGGCGAC
GAAGGCTATT TGCTGGCGCT GGCCTACTCC ACCCAGCGTG GTTACGGACG CAATCATCCG
TTCGCAGGCG AAATCCGCAG CGGCTATATC GACGTGTCGA TTGTGCCGGA AGAGCTGGGA
TTTGCGGTGA ACGTCGGCGA ACTGCTGATG ACCGAATGTG AAATGGTCAA CGGTTTTATC
GACCCGCCGG ATGAGCCGCC GCACTTCACG CGCGGCTACG GGCTGGTGTT CGGCATGAGC
GAGCGCAAAG CGATGGCGAT GGCGCTGGTT GACCGCGCTC TACAAGCCCC GGAGTACGGC
GAGCACGCGA CAGGCCCGGC GCAGGATGAA GAGTTCGTGC TGGCGCATGC CGACAACGTC
GAAGCCGCAG GCTTTGTCTC GCACCTTAAA CTCCCGCACT ACGTCGATTT CCAGGCCGAA
CTGGAGCTAC TCAAACGTCT GCAACAGGAG CGGAACCATG GCTAA
 
Protein sequence
MYVAVKGGEK AIDAAHALQE SRRRGNTDLP ELSVAQIEQQ LNLAVDRVMT EGGIADRELA 
ALALKQASGD NVEAIFLLRA YRTTLAKLAV SEPLDTTEMR LERRISAVYK DIPGGQLLGP
TYDYTHRLLD FTLLANGEAP TLTTADSEQQ PSPHVFSLLA RQGLAKFEED SGAQPDDITR
TPPVYPCSRS SRLQQLMRGD EGYLLALAYS TQRGYGRNHP FAGEIRSGYI DVSIVPEELG
FAVNVGELLM TECEMVNGFI DPPDEPPHFT RGYGLVFGMS ERKAMAMALV DRALQAPEYG
EHATGPAQDE EFVLAHADNV EAAGFVSHLK LPHYVDFQAE LELLKRLQQE RNHG