Gene EcSMS35_0413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0413 
SymbolphoA 
ID6144438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp424133 
End bp425548 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content54% 
IMG OID641615309 
Productalkaline phosphatase 
Protein accessionYP_001742516 
Protein GI170680523 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1785] Alkaline phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACAAA GCACTATTGC ACTGGCACTC TTACCGTTAC TGTTTTCCCC TGTGACAAAA 
GCCCGGACAC CAGAAATGCC TGTTCTGGAA AACCGGGCTG CTCAGGGCGA TATTACTGCG
CCCGGCGGTG CTCGCCGCTT AACGGGTGAT CAGACCGCCG CTCTGCGTGA TTCTCTTAGC
GATAAACCTG CAAAAAATAT TATTTTGCTG ATTGGCGATG GGATGGGGGA CTCGGAAATT
ACTGCCGCAC GCAATTATGC CGAAGGTGCG GGCGGCTTTT TTAAAGGTAT CGATGCCTTA
CCGCTTACCG GGCAATACAC TCACTATGCG CTGAATAAAA AAACCGGCAA ACCGGACTAC
GTCACCGACT CGGCTGCATC AGCAACCGCC TGGTCAACTG GTGTCAAAAC CTATAACGGC
GCGCTGGGCA TCGATATTCA CGAAAAAGAT CACCCAACGA TTCTGGAAAT GGCAAAAGCC
GCAGGTCTGG CGACCGGTAA CGTTTCTACC GCAGAGTTGC AGGATGCCAC GCCCGCTGCG
CTGGTGGCGC ATGTGACCTC GCGCAAATGC TACGGTCCGA GCGCGACCAG TGAAAAATGT
CCGGGTAACG CTCTGGAAAA AGGCGGAAAA GGATCGATTA CCGAACAGCT GCTTAACGCC
CGTGCCGATG TTACGCTTGG CGGCGGCGCA AAAACCTTTG CTGAAACGGC AACCGCCGGT
GAATGGCAGG GAAAAACGCT GCGTGAACAG GCACAGGCGC GTGGTTATCA GTTGGTGAGT
GATGCTGCCT CACTGAATTC GGTGACGGAA GCGAATCAGC AAAAACCCCT ATTAGGACTG
TTTGCTGACG GCAATATGCC AGTGCGCTGG CTAGGACCGA AAGCAACGTA CCACGGCAAT
ATCGACAAGC CCGCAGTTAC CTGTACGCCT AATCCGCAAC GTAATGACAG CGTACCGACC
CTGGCGCAGA TGACCGACAA AGCCATTGAA TTGTTGAGTA AAAATGAGAA AGGCTTTTTC
CTGCAAGTTG AAGGTGCATC AATCGATAAA CAGGATCACG CTGCGAATCC TTGTGGGCAA
ATTGGCGAGA CGGTCGATCT CGATGAAGCC GTACAACGTG CGCTGGAATT CGCTAAAAAG
GATGGCAACA CGCTGGTCAT AGTCACCGCT GATCACGCCC ACGCCAGCCA GATTGTCGCG
CCGGACACCA AAGCGCCGGG CCTCACCCAG GCGCTAAATA CCAAAGATGG CGCAGTGATG
GTGATGAGTT ACGGGAACTC CGAAGAGGAT TCACAAGAAC ATACCGGCAG TCAGTTGCGT
ATTGCAGCGT ATGGCCCACA TGCCGCCAAT GTCGTTGGAC TGACCGACCA GACCGATCTC
TTCTACACCA TGAAAGCCGC CCTGGGGCTG AAATAA
 
Protein sequence
MKQSTIALAL LPLLFSPVTK ARTPEMPVLE NRAAQGDITA PGGARRLTGD QTAALRDSLS 
DKPAKNIILL IGDGMGDSEI TAARNYAEGA GGFFKGIDAL PLTGQYTHYA LNKKTGKPDY
VTDSAASATA WSTGVKTYNG ALGIDIHEKD HPTILEMAKA AGLATGNVST AELQDATPAA
LVAHVTSRKC YGPSATSEKC PGNALEKGGK GSITEQLLNA RADVTLGGGA KTFAETATAG
EWQGKTLREQ AQARGYQLVS DAASLNSVTE ANQQKPLLGL FADGNMPVRW LGPKATYHGN
IDKPAVTCTP NPQRNDSVPT LAQMTDKAIE LLSKNEKGFF LQVEGASIDK QDHAANPCGQ
IGETVDLDEA VQRALEFAKK DGNTLVIVTA DHAHASQIVA PDTKAPGLTQ ALNTKDGAVM
VMSYGNSEED SQEHTGSQLR IAAYGPHAAN VVGLTDQTDL FYTMKAALGL K