Gene EcHS_A2327 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2327 
Symbol 
ID5592008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2326554 
End bp2328314 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content52% 
IMG OID640921453 
Productsulfatase family protein 
Protein accessionYP_001458989 
Protein GI157161671 
COG category[R] General function prediction only 
COG ID[COG3083] Predicted hydrolase of alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value0.194972 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAACTC ATCGTCAGCG CTACCGTGAA AAAGTCTCCC AGATGGTCAG TTGGGGGCAC 
TGGTTTGCAC TGTTCAATAT TCTGCTTTCG CTCGTCATTG GCAGCCGTTA CCTGTTTATC
GCCGACTGGC CGACAACGCT TGCTGGTCGC ATTTATTCCT ACGTAAGCAT TATCGGCCAT
TTCAGCTTCC TGGTGTTCGC CACCTACTTG CTGATCCTCT TCCCGCTGAC CTTTATCGTC
GGCTCCCAGA GGCTGATGAG GTTTTTGTCC GTCATTCTGG CAACGGCGGG AATGACGCTA
TTACTGATCG ATAGCGAAGT CTTTACTCGT TTCCATCTCC ATCTTAATCC CATCGTCTGG
CAACTGGTTA TCAACCCAGA CGAAAATGAG ATGGCGCGCG ACTGGCAGCT GATGTTCATC
AGCGTGCCGG TTATTTTATT GCTTGAACTG GTGTTTGCGA CGTGGAGCTG GCAAAAGCTG
CGCAGCCTGA CGCGTCGTCG ACGCTTCGCG CGCCCGCTGG CCGCATTCTT ATTTATCGCC
TTTATCGCCT CGCATGTGGT GTATATCTGG GCCGATGCCA ACTTCTATCG CCCGATCACC
ATGCAGCGCG CTAACCTGCC GCTTTCGTAC CCGATGACGG CGCGACGTTT TCTTGAGAAG
CATGGTCTGC TTGATGCGCA GGAGTATCAA CGCCGTCTTA TTGAGCAAGG TAATCCAGAC
GCCGTTTCCG TTCAGTATCC GTTAAGCGAA CTGCGCTATC GCGATATGGG CACCGGGCAG
AATGTGCTGT TGATTACTGT CGATGGCCTG AACTACTCAC GCTTCGAGAA GCAGATGCCT
GCGCTGGCAG GTTTTGCTGA GCAAAATATT TCGTTCACGC GCCATATGAG CTCCGGCAAC
ACTACAGACA ACGGCATCTT TGGCCTGTTC TATGGCATCT CGCCGAGCTA TATGGACGGC
ATTCTGTCGA CCCGTACGCC TGCGGCATTA ATTACTGCGC TTAATCAGCA AGGCTATCAG
CTGGGGTTAT TCTCATCAGA TGGCTTTACC AGCCCGCTGT ATCGCCAGGC ATTGTTGTCA
GATTTCTCGA TGCCGAGCGT ACGCACCCAA TCCGACGAGC AGACCGCCAC GCAGTGGATC
AACTGGCTGG GACGCTACGC ACAAGAAGAT AACCGCTGGT TCTCGTGGGT TTCTTTCAAT
GGTACTAACA TTGACGACAG CAATCAGCAG GCATTTGCAC GGAAATATAG CCGGGCGGCA
GGCAATGTCG ATGACCAGAT CAACCGCGTG CTCAATGCAC TGCGTGATTC TGGCAAACTG
GACAATACGG TAGTGATTAT CACTGCCGGT CGGGGTATTC CACTGAGCGA AGAGGAAGAA
ACCTTTGACT GGTCCCACGG TCATCTGCAG GTACCATTAG TGATTCACTG GCCAGGCACG
CCGGCGCAGC GTATTAATGC TCTGACGGAT CATACCGATC TGATGACGAC GCTGATGCAA
CGCCTGCTAC ATGTCAGCAC ACCTGCCAGC GAATATTCGC AAGGTCAGGA TTTGTTCAAC
CCTCAACGCC GTCATTACTG GGTTACCGCA GCGGATAACG ATACGCTGGC AATTACCACC
CCGAAAAAGA CGCTGGTGCT GAACAATAAC GGTAAATACC GCACTTACAA CTTACGTGGT
GAAAGAGTGA AAGATGAAAA ACCACAGTTA AGTTTGTTAT TGCAAGTGCT GACAGACGAG
AAGCGTTTTA TCGCTAACTG A
 
Protein sequence
MVTHRQRYRE KVSQMVSWGH WFALFNILLS LVIGSRYLFI ADWPTTLAGR IYSYVSIIGH 
FSFLVFATYL LILFPLTFIV GSQRLMRFLS VILATAGMTL LLIDSEVFTR FHLHLNPIVW
QLVINPDENE MARDWQLMFI SVPVILLLEL VFATWSWQKL RSLTRRRRFA RPLAAFLFIA
FIASHVVYIW ADANFYRPIT MQRANLPLSY PMTARRFLEK HGLLDAQEYQ RRLIEQGNPD
AVSVQYPLSE LRYRDMGTGQ NVLLITVDGL NYSRFEKQMP ALAGFAEQNI SFTRHMSSGN
TTDNGIFGLF YGISPSYMDG ILSTRTPAAL ITALNQQGYQ LGLFSSDGFT SPLYRQALLS
DFSMPSVRTQ SDEQTATQWI NWLGRYAQED NRWFSWVSFN GTNIDDSNQQ AFARKYSRAA
GNVDDQINRV LNALRDSGKL DNTVVIITAG RGIPLSEEEE TFDWSHGHLQ VPLVIHWPGT
PAQRINALTD HTDLMTTLMQ RLLHVSTPAS EYSQGQDLFN PQRRHYWVTA ADNDTLAITT
PKKTLVLNNN GKYRTYNLRG ERVKDEKPQL SLLLQVLTDE KRFIAN