Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_03931 |
Symbol | phnI |
ID | 8114431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 4224876 |
End bp | 4225940 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644850084 |
Product | hypothetical protein |
Protein accession | YP_003001657 |
Protein GI | 251787353 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3626] Uncharacterized enzyme of phosphonate metabolism |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACGTTG CCGTAAAAGG GGGCGAAAAG GCGATCGACG CCGCCCACGC CCTGCAAGAG AGCCGACGCC GGGGCGATAC CGATTTGCCT GAACTGAGCG TCGCCCAGAT TGAACAGCAG CTTAACCTCG CGGTAGATCG CGTGATGACC GAAGGCGGCA TTGCCGACCG CGAACTGGCG GCGCTGGCGC TGAAACAGGC CAGCGGCGAT AACGTTGAAG CGATTTTCCT GCTGCGCGCC TACCGCACCA CGTTGGCGAA GCTGGCGGTA AGCGAGCCGC TCGACACCAC CGGGATGCGT CTCGAACGCC GTATCTCCGC CGTTTATAAA GACATTCCCG GCGGCCAGCT GCTTGGCCCA ACCTACGACT ACACCCATCG CCTGCTCGAT TTTACCCTGC TGGCAAACGG CGAAGCGCCG ACGCTGACCA CCGCCGACAG CGAACAACAG CCGTCGCCGC ACGTTTTCAG CCTGCTGGCG CGTCAGGGGC TGGCGAAGTT TGAAGAGGAT AGCGGCGCAC AGCCGGATGA CATCACCCGC ACGCCGCCGG TTTACCCCTG CTCACGTTCT TCCCGTTTGC AGCAGTTGAT GCGCGGCGAC GAAGGCTATT TGCTGGCGCT GGCCTACTCC ACCCAGCGTG GTTACGGACG CAATCACCCG TTCGCGGGCG AGATCCGCAG TGGTTACATC GACGTGTCGA TTGTGCCGGA AGAGCTGGGA TTTGCGGTAA ACGTCGGCGA ACTACTGATG ACCGAGTGTG AAATGGTCAA CGGTTTTATC GACCCGCCGG ATGAGCCGCC GCACTTCACG CGCGGCTACG GGCTGGTATT CGGCATGAGC GAGCGCAAAG CGATGGCAAT GGCGCTGGTC GATCGTGCGT TGCAGGCTCC GGAATACGGC GAGCACGCGA CAGGCCCGGC GCAGGATGAA GAGTTTGTGC TGGCACATGC CGACAACGTC GAAGCCGCAG GCTTTGTCTC GCACCTCAAA CTCCCCCACT ACGTCGATTT CCAGGCCGAA CTGGAGCTAC TCAAACGTCT GCAACAGGAG AAGAACCATG GCTAA
|
Protein sequence | MYVAVKGGEK AIDAAHALQE SRRRGDTDLP ELSVAQIEQQ LNLAVDRVMT EGGIADRELA ALALKQASGD NVEAIFLLRA YRTTLAKLAV SEPLDTTGMR LERRISAVYK DIPGGQLLGP TYDYTHRLLD FTLLANGEAP TLTTADSEQQ PSPHVFSLLA RQGLAKFEED SGAQPDDITR TPPVYPCSRS SRLQQLMRGD EGYLLALAYS TQRGYGRNHP FAGEIRSGYI DVSIVPEELG FAVNVGELLM TECEMVNGFI DPPDEPPHFT RGYGLVFGMS ERKAMAMALV DRALQAPEYG EHATGPAQDE EFVLAHADNV EAAGFVSHLK LPHYVDFQAE LELLKRLQQE KNHG
|
| |