Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4565 |
Symbol | phnI |
ID | 6144936 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4665096 |
End bp | 4666160 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641619381 |
Product | phosphonate metabolism protein PhnI |
Protein accession | YP_001746493 |
Protein GI | 170680400 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3626] Uncharacterized enzyme of phosphonate metabolism |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACGTTG CCGTGAAAGG GGGCGAGAAG GCGATCGACG CCGCCCACGC CCTGCAAGAG AGCCGACGCC GGGGCAATAC CGATTTGCCT GAACTGAGCG TCGCCCAGAT TGAACAGCAG CTTAATCTTG CAGTAGATCG CGTGATGACC GAAGGCGGCA TTGCCGACCG CGAACTGGCG GCGCTGGCGC TGAAACAGGC CAGCGGCGAT AACGTCGAAG CGATTTTCCT GCTACGCGCC TACCGCACCA CGCTGGCGAA GCTTGCGGTG AGCGAGCCGC TCGACACCAC GGAGATGCGT CTCGAACGGC GTATTTCAGC GGTTTATAAA GATATTCCCG GCGGCCAGCT GCTTGGCCCA ACCTACGACT ACACCCATCG CCTGCTCGAT TTCACCCTGC TGGCAAACGG CGAAGCGCCG ACGCTGACCA CAGCCGACAG CGAACAACAG CCGTCGCCGC ACGTTTTCAG CCTGCTGGCG CGTCAGGGGC TGGCGAAGTT TGAAGAAGAT AGCGGCGCAC AGCCGGACGA CATCACCCGC ACGCCGCCGG TTTACCCCTG CTCACGCTCC TCCCGCCTGC AACAATTAAT GCGCGGCGAC GAAGGCTATT TGCTGGCGCT GGCCTACTCC ACCCAGCGTG GTTACGGACG CAATCATCCG TTCGCAGGCG AAATCCGCAG CGGCTATATC GACGTGTCGA TTGTGCCGGA AGAGCTGGGA TTTGCGGTGA ACGTCGGCGA ACTGCTGATG ACCGAATGTG AAATGGTCAA CGGTTTTATC GACCCGCCGG ATGAGCCGCC GCACTTCACG CGCGGCTACG GGCTGGTGTT CGGCATGAGC GAGCGCAAAG CGATGGCGAT GGCGCTGGTT GACCGCGCTC TACAAGCCCC GGAGTACGGC GAGCACGCGA CAGGCCCGGC GCAGGATGAA GAGTTCGTGC TGGCGCATGC CGACAACGTC GAAGCCGCAG GCTTTGTCTC GCACCTTAAA CTCCCGCACT ACGTCGATTT CCAGGCCGAA CTGGAGCTAC TCAAACGTCT GCAACAGGAG CGGAACCATG GCTAA
|
Protein sequence | MYVAVKGGEK AIDAAHALQE SRRRGNTDLP ELSVAQIEQQ LNLAVDRVMT EGGIADRELA ALALKQASGD NVEAIFLLRA YRTTLAKLAV SEPLDTTEMR LERRISAVYK DIPGGQLLGP TYDYTHRLLD FTLLANGEAP TLTTADSEQQ PSPHVFSLLA RQGLAKFEED SGAQPDDITR TPPVYPCSRS SRLQQLMRGD EGYLLALAYS TQRGYGRNHP FAGEIRSGYI DVSIVPEELG FAVNVGELLM TECEMVNGFI DPPDEPPHFT RGYGLVFGMS ERKAMAMALV DRALQAPEYG EHATGPAQDE EFVLAHADNV EAAGFVSHLK LPHYVDFQAE LELLKRLQQE RNHG
|
| |