Gene EcolC_1459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1459 
Symbol 
ID6067304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1609038 
End bp1610798 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content52% 
IMG OID641600879 
Productsulfatase 
Protein accessionYP_001724449 
Protein GI170019495 
COG category[R] General function prediction only 
COG ID[COG3083] Predicted hydrolase of alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.741497 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.82887 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAACTC ATCGTCAGCG CTACCGTGAA AAAGTCTCCC AGATGGTCAG TTGGGGGCAC 
TGGTTTGCAC TGTTCAATAT TCTGCTTTCG CTCGTCATTG GCAGCCGTTA CCTGTTTATC
GCCGACTGGC CGACAACGCT TGCTGGTCGC ATTTATTCCT ACGTAAGCAT TATCGGCCAT
TTCAGCTTCC TGGTGTTCGC CACCTACTTG CTGATCCTCT TCCCGCTGAC CTTTATCGTC
GGCTCCCAGA GGCTGATGAG GTTTTTGTCC GTCATTCTGG CAACGGCGGG AATGACGCTA
TTACTGATCG ATAGCGAAGT CTTTACTCGT TTCCATCTCC ATCTTAATCC CATCGTCTGG
CAACTGGTTA TCAACCCAGA CGAAAATGAG ATGGCGCGCG ACTGGCAGCT GATGTTCATC
AGCGTGCCGG TTATTTTATT GCTTGAACTG GTGTTTGCGA CGTGGAGCTG GCAAAAGCTG
CGCAGCCTGA CGCGTCGTCG ACGCTTCGCG CGCCCGCTGG CCGCATTCTT ATTTATCGCC
TTTATCGCCT CGCATGTGGT GTATATCTGG GCCGATGCCA ACTTCTATCG CCCGATCACC
ATGCAGCGCG CTAACCTGCC GCTTTCGTAC CCGATGACGG CGCGACGTTT TCTTGAGAAG
CATGGTCTGC TTGATGCGCA GGAGTATCAA CGCCGTCTTA TTGAGCAAGG TAATCCAGAC
GCCGTTTCCG TTCAGTATCC GTTAAGCGAA CTGCGCTATC GCGATATGGG CACCGGGCAG
AATGTGCTGT TGATTACTGT CGATGGCCTG AACTACTCAC GCTTCGAGAA GCAGATGCCT
GCGCTGGCAG GTTTTGCTGA GCAAAATATT TCGTTCACGC GCCATATGAG CTCCGGCAAC
ACTACAGACA ACGGCATCTT TGGCCTGTTC TATGGCATCT CGCCGAGCTA TATGGACGGC
ATTCTGTCGA CCCGTACGCC TGCGGCATTA ATTACTGCGC TTAATCAGCA AGGCTATCAG
CTGGGGTTAT TCTCATCAGA TGGCTTTACC AGCCCGCTGT ATCGCCAGGC ATTGTTGTCA
GATTTCTCGA TGCCGAGCGT ACGCACCCAA TCCGACGAGC AGACCGCCAC GCAGTGGATC
AACTGGCTGG GACGCTACGC ACAAGAAGAT AACCGCTGGT TCTCGTGGGT TTCTTTCAAT
GGTACTAACA TTGACGACAG CAATCAGCAG GCATTTGCAC GGAAATATAG CCGGGCGGCA
GGCAATGTCG ATGACCAGAT CAACCGCGTG CTCAATGCAC TGCGTGATTC TGGCAAACTG
GACAATACGG TAGTGATTAT CACTGCCGGT CGGGGTATTC CACTGAGCGA AGAGGAAGAA
ACCTTTGACT GGTCCCACGG TCATCTGCAG GTACCATTAG TGATTCACTG GCCAGGCACG
CCGGCGCAGC GTATTAATGC TCTGACGGAT CATACCGATC TGATGACGAC GCTGATGCAA
CGCCTGCTAC ATGTCAGCAC ACCTGCCAGC GAATATTCGC AAGGTCAGGA TTTGTTCAAC
CCTCAACGCC GTCATTACTG GGTTACCGCA GCGGATAACG ATACGCTGGC AATTACCACC
CCGAAAAAGA CGCTGGTGCT GAACAATAAC GGTAAATACC GCACTTACAA CTTACGTGGT
GAAAGAGTGA AAGATGAAAA ACCACAGTTA AGTTTGTTAT TGCAAGTGCT GACAGACGAG
AAGCGTTTTA TCGCTAACTG A
 
Protein sequence
MVTHRQRYRE KVSQMVSWGH WFALFNILLS LVIGSRYLFI ADWPTTLAGR IYSYVSIIGH 
FSFLVFATYL LILFPLTFIV GSQRLMRFLS VILATAGMTL LLIDSEVFTR FHLHLNPIVW
QLVINPDENE MARDWQLMFI SVPVILLLEL VFATWSWQKL RSLTRRRRFA RPLAAFLFIA
FIASHVVYIW ADANFYRPIT MQRANLPLSY PMTARRFLEK HGLLDAQEYQ RRLIEQGNPD
AVSVQYPLSE LRYRDMGTGQ NVLLITVDGL NYSRFEKQMP ALAGFAEQNI SFTRHMSSGN
TTDNGIFGLF YGISPSYMDG ILSTRTPAAL ITALNQQGYQ LGLFSSDGFT SPLYRQALLS
DFSMPSVRTQ SDEQTATQWI NWLGRYAQED NRWFSWVSFN GTNIDDSNQQ AFARKYSRAA
GNVDDQINRV LNALRDSGKL DNTVVIITAG RGIPLSEEEE TFDWSHGHLQ VPLVIHWPGT
PAQRINALTD HTDLMTTLMQ RLLHVSTPAS EYSQGQDLFN PQRRHYWVTA ADNDTLAITT
PKKTLVLNNN GKYRTYNLRG ERVKDEKPQL SLLLQVLTDE KRFIAN