Gene CPS_3032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPS_3032 
Symbol 
ID3518391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameColwellia psychrerythraea 34H 
KingdomBacteria 
Replicon accessionNC_003910 
Strand
Start bp3170144 
End bp3171712 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content41% 
IMG OID637285484 
Productsulfatase family protein 
Protein accessionYP_269731 
Protein GI71278798 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAATTA ATACTAAATT TACGCAGTTT GCCATTGCAT TAGGGATGTT GACTGCATCT 
GCAACGGCAC TAGCAACAAC TGATACAAGC AAACCTAATA TTTTGGCAAT TTGGGGCGAT
GATATTGGTA TATATAATAT TAGTGCTTAT AACCATGGCA TGATGGGTTA TCAAACACCT
AATATCGACA GGATTGCAAA TGAAGGTGCA TTATTCACCG ATCAATACGC GCAACAAAGC
TGTACGGCAG GTCGCTCAGC GTTCATTTTA GGGCAAGAAC CTTTTCGTAC CGGTTTGTTA
ACTATCGGTA TGCCAGGTTC TACTCATGGC ATTCCTGATT GGGCCCCTAC GATAGGTGAT
GTTGCTAAAG ATAACGGCTA CATGACCGCA CAATTTGGTA AGAACCATTT AGGTGACCAA
GACAAACATT TACCGACTAA ACATGGTTTT GATGAGTTTT TCGGTAACCT TTATCACTTA
AATGCTGAAG AAGAGCCGGA AACCTATTAC TACCCGAAAG ATCCAAGATT TAAGAAAAAG
TTTGGTCCTC GCGGTGTTTT ACATACCTTT GCTGATGGTC GTATGGAAGA TACTGGCGCA
TTAACAAGAA AGCGCATGGA AACGGCTGAT GAAGAGTTTT TAGGTGCCAC GTTAAAGTTT
ATCGACAAAG CTCATAAGGC GGATAAACCT TTCTTTATTT GGTACAACAG TACACGAATG
CATGTTCACA CACGCTTACA AGAAAAATGG CAAGGTAAGT CAGGCATCAG CATTTATGCA
GATGGTATGT TAGAGCACGA TGAGCACGTA GGGGTTTTAT TAGACAAACT TGATGATTTG
AAAATTGCTG ACAATACCAT TGTTATTTAC ACCACAGATA ATGGTGCAGA AACATTTACT
TGGCCTGACG GTGGTAATAC TCCATTCCAT GGCGAAAAGG GTACAACTTA TGAAGGTGGC
ATGCGTGTAC CTCAGTTAGT TAGATGGCCC GGTACTATCA AACCCGGTAG CAAAATGAAC
TCAATGATGT CTCATATCGA TTGGATGCCA ACATTAGCTG CAGCGATGGG TAACGATACG
TTAGTTGCTG ATCTTAAAAA AGGTGGTGAA ATAAATAACA AAAAATGGCG AGTACATTTA
GATGGTTTTA ATTTCAAACC TTACTTTGCT GGTGAAGTTG ACAAAGGGCC ACGTGAAACG
ATTATGTACT TTAGCCAATC AGGTCAATTA AATGCGATAC GTTGGAATGA TTGGAAAGCA
AGTTTTGCAC TGGTTAAAGG GGATATGGCA AGTGGCACAC GTGAAGTACC AGCGTGGCCA
CAACTAGTAA ACTTACGTGC AGACCCTTTT GAAAAAGGAC CGATTGAATC GTCTATGTAT
GTTCGTTGGA TGGTTGATAA CATGTGGGCA TTTGTACCTG TAAGCGGCAA GGTAAAAGAG
TTCCTAGGCT CATTAGAAGG TTACCCAATG CAAGTTGGTC AGAGTTTTGG TGCTGCCGAT
ATAAACTACA CAACATTGCA AATGAAAGCA TTTGTTAAAA AAGTATCAAC AGAGATTAAA
GCGAAATAA
 
Protein sequence
MTINTKFTQF AIALGMLTAS ATALATTDTS KPNILAIWGD DIGIYNISAY NHGMMGYQTP 
NIDRIANEGA LFTDQYAQQS CTAGRSAFIL GQEPFRTGLL TIGMPGSTHG IPDWAPTIGD
VAKDNGYMTA QFGKNHLGDQ DKHLPTKHGF DEFFGNLYHL NAEEEPETYY YPKDPRFKKK
FGPRGVLHTF ADGRMEDTGA LTRKRMETAD EEFLGATLKF IDKAHKADKP FFIWYNSTRM
HVHTRLQEKW QGKSGISIYA DGMLEHDEHV GVLLDKLDDL KIADNTIVIY TTDNGAETFT
WPDGGNTPFH GEKGTTYEGG MRVPQLVRWP GTIKPGSKMN SMMSHIDWMP TLAAAMGNDT
LVADLKKGGE INNKKWRVHL DGFNFKPYFA GEVDKGPRET IMYFSQSGQL NAIRWNDWKA
SFALVKGDMA SGTREVPAWP QLVNLRADPF EKGPIESSMY VRWMVDNMWA FVPVSGKVKE
FLGSLEGYPM QVGQSFGAAD INYTTLQMKA FVKKVSTEIK AK