Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_00334 |
Symbol | phoA |
ID | 8114778 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 366273 |
End bp | 367688 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644846618 |
Product | hypothetical protein |
Protein accession | YP_002998191 |
Protein GI | 251783887 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1785] Alkaline phosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.644931 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAACAAA GCACTATTGC ACTGGCACTC TTACCGTTAC TGTTTACCCC TGTGACAAAA GCCCGGGCAC CAGAAATGCC TGTTCTGGAA AACCGGGCTG CTCAGGGCGA TATTACTGCA CCCGGCGGTG CTCGCCGCTT AACGGGTGAT CAGACCGCCG CTCTGCGTGA TTCTCTTAGC GATAAACCTG CAAAAAATAT TATTTTGCTG ATTGGCGATG GGATGGGGGA TTCGGAAATT ACTGCCGCAC GTAATTATGC CGAAGGTGCG GGCGGCTTTT TTAAAGGTAT CGATGCCTTA CCGCTTACCG GGCAATACAC TCACTATGCG CTGAATAAAA AAACCGGCAA ACCGGACTAC GTCACCGACT CGGCTGCATC AGCAACCGCC TGGTCAACCG GTGTCAAAAC CTATAACGGC GCGCTGGGCG TCGATATTCA CGAAAAAGAT CACCCAACGA TTCTGGAAAT GGCAAAAGCC GCAGGTCTGG CGACCGGTAA CGTTTCTACC GCAGAGTTGC AGGATGCCAC GCCCGCTGCG CTGGTGGCAC ATGTGACCTC GCGCAAATGC TACGGTCCGA GCGCGACCAG TGAAAAATGT CCGGGTAACG CGCTAGAAAA AGGCGGGAGA GGATCGATTA CCGAACAGCT GCTTAACGCT CGTGCCGATG TTACGCTTGG CGGCGGCGCA AAAACCTTTG CTGAAACGGC AACCGCCGGT GAATGGCAGG GAAAAACGCT GCGTGAACAG GCACAGGCGC GTGGTTATCA GTTGGTGAGC GATGCTGCCT CACTGAATGC GGTGACGGAA GCGAACCAGC AAAAACCCCT GCTAGGACTG TTTGCTGACG GCAATATGCC AGTGCGCTGG CAAGGACCGA AAGCAACGTA CCACGGCAAT ATCGACAAGC CCGCAGTTAC CTGTACGCCT AATCCGCAAC GTAATGACAG CGTACCGACC CTGGCGCAGA TGACTGATAA AGCCATTGAA TTGTTGAGTA AAAATGAGAA AGGCTTTTTC CTGCAAGTTG AAGGTGCATC AATCGATAAA CAGGATCACG CTGCGAATCC TTGTGGGCAA ATTGGCGAGA CGGTCGATCT CGACGAAGCC GTACAACGGG CGCTGGAATT CGCTAAAAAG GATGGCAACA CGCTGGTCAT AGTCACCGCT GATCACGCCC ACGCCAGCCA GATTGTTGCG CCGGACACCA AAGCGCCGGG CCTCACCCAG GCGCTAAATA CCAAAGATGG CGCAGTGATG GTGATGAGTT ACGGGAACTC CGAAGAGGAT TCACAAGAAC ATACCGGTAG TCAGCTGCGT ATTGCGGCGT ATGGCCCACA TGCCGCCAAT GTCGTTGGAC TGACCGACCA GACCGATCTC TTCTACACCA TGAAAGCCGC CCTGGGGCTG AAATAA
|
Protein sequence | MKQSTIALAL LPLLFTPVTK ARAPEMPVLE NRAAQGDITA PGGARRLTGD QTAALRDSLS DKPAKNIILL IGDGMGDSEI TAARNYAEGA GGFFKGIDAL PLTGQYTHYA LNKKTGKPDY VTDSAASATA WSTGVKTYNG ALGVDIHEKD HPTILEMAKA AGLATGNVST AELQDATPAA LVAHVTSRKC YGPSATSEKC PGNALEKGGR GSITEQLLNA RADVTLGGGA KTFAETATAG EWQGKTLREQ AQARGYQLVS DAASLNAVTE ANQQKPLLGL FADGNMPVRW QGPKATYHGN IDKPAVTCTP NPQRNDSVPT LAQMTDKAIE LLSKNEKGFF LQVEGASIDK QDHAANPCGQ IGETVDLDEA VQRALEFAKK DGNTLVIVTA DHAHASQIVA PDTKAPGLTQ ALNTKDGAVM VMSYGNSEED SQEHTGSQLR IAAYGPHAAN VVGLTDQTDL FYTMKAALGL K
|
| |