Gene B21_00334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00334 
SymbolphoA 
ID8114778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp366273 
End bp367688 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content54% 
IMG OID644846618 
Producthypothetical protein 
Protein accessionYP_002998191 
Protein GI251783887 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1785] Alkaline phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.644931 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAACAAA GCACTATTGC ACTGGCACTC TTACCGTTAC TGTTTACCCC TGTGACAAAA 
GCCCGGGCAC CAGAAATGCC TGTTCTGGAA AACCGGGCTG CTCAGGGCGA TATTACTGCA
CCCGGCGGTG CTCGCCGCTT AACGGGTGAT CAGACCGCCG CTCTGCGTGA TTCTCTTAGC
GATAAACCTG CAAAAAATAT TATTTTGCTG ATTGGCGATG GGATGGGGGA TTCGGAAATT
ACTGCCGCAC GTAATTATGC CGAAGGTGCG GGCGGCTTTT TTAAAGGTAT CGATGCCTTA
CCGCTTACCG GGCAATACAC TCACTATGCG CTGAATAAAA AAACCGGCAA ACCGGACTAC
GTCACCGACT CGGCTGCATC AGCAACCGCC TGGTCAACCG GTGTCAAAAC CTATAACGGC
GCGCTGGGCG TCGATATTCA CGAAAAAGAT CACCCAACGA TTCTGGAAAT GGCAAAAGCC
GCAGGTCTGG CGACCGGTAA CGTTTCTACC GCAGAGTTGC AGGATGCCAC GCCCGCTGCG
CTGGTGGCAC ATGTGACCTC GCGCAAATGC TACGGTCCGA GCGCGACCAG TGAAAAATGT
CCGGGTAACG CGCTAGAAAA AGGCGGGAGA GGATCGATTA CCGAACAGCT GCTTAACGCT
CGTGCCGATG TTACGCTTGG CGGCGGCGCA AAAACCTTTG CTGAAACGGC AACCGCCGGT
GAATGGCAGG GAAAAACGCT GCGTGAACAG GCACAGGCGC GTGGTTATCA GTTGGTGAGC
GATGCTGCCT CACTGAATGC GGTGACGGAA GCGAACCAGC AAAAACCCCT GCTAGGACTG
TTTGCTGACG GCAATATGCC AGTGCGCTGG CAAGGACCGA AAGCAACGTA CCACGGCAAT
ATCGACAAGC CCGCAGTTAC CTGTACGCCT AATCCGCAAC GTAATGACAG CGTACCGACC
CTGGCGCAGA TGACTGATAA AGCCATTGAA TTGTTGAGTA AAAATGAGAA AGGCTTTTTC
CTGCAAGTTG AAGGTGCATC AATCGATAAA CAGGATCACG CTGCGAATCC TTGTGGGCAA
ATTGGCGAGA CGGTCGATCT CGACGAAGCC GTACAACGGG CGCTGGAATT CGCTAAAAAG
GATGGCAACA CGCTGGTCAT AGTCACCGCT GATCACGCCC ACGCCAGCCA GATTGTTGCG
CCGGACACCA AAGCGCCGGG CCTCACCCAG GCGCTAAATA CCAAAGATGG CGCAGTGATG
GTGATGAGTT ACGGGAACTC CGAAGAGGAT TCACAAGAAC ATACCGGTAG TCAGCTGCGT
ATTGCGGCGT ATGGCCCACA TGCCGCCAAT GTCGTTGGAC TGACCGACCA GACCGATCTC
TTCTACACCA TGAAAGCCGC CCTGGGGCTG AAATAA
 
Protein sequence
MKQSTIALAL LPLLFTPVTK ARAPEMPVLE NRAAQGDITA PGGARRLTGD QTAALRDSLS 
DKPAKNIILL IGDGMGDSEI TAARNYAEGA GGFFKGIDAL PLTGQYTHYA LNKKTGKPDY
VTDSAASATA WSTGVKTYNG ALGVDIHEKD HPTILEMAKA AGLATGNVST AELQDATPAA
LVAHVTSRKC YGPSATSEKC PGNALEKGGR GSITEQLLNA RADVTLGGGA KTFAETATAG
EWQGKTLREQ AQARGYQLVS DAASLNAVTE ANQQKPLLGL FADGNMPVRW QGPKATYHGN
IDKPAVTCTP NPQRNDSVPT LAQMTDKAIE LLSKNEKGFF LQVEGASIDK QDHAANPCGQ
IGETVDLDEA VQRALEFAKK DGNTLVIVTA DHAHASQIVA PDTKAPGLTQ ALNTKDGAVM
VMSYGNSEED SQEHTGSQLR IAAYGPHAAN VVGLTDQTDL FYTMKAALGL K