Gene CPS_0841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPS_0841 
SymbolatsA 
ID3522242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameColwellia psychrerythraea 34H 
KingdomBacteria 
Replicon accessionNC_003910 
Strand
Start bp855958 
End bp857712 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content39% 
IMG OID637283306 
Productarylsulfatase 
Protein accessionYP_267590 
Protein GI71281770 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TAATAAGTAT TCTGTCGACC GTTATATTAC TAACCAGTTG TGCAGAAAAG 
TCAGAATCAG TTCAGGCTAA TACGGCGGTA GAAGCTGATG CTAAGAAACC TAATATTCTG
CTCTTAGTCG CTGATGATAC AGCCTTTGGC GATATTGGCG CTTATGGCTC AGAAGTACAT
ACACCGAATA TGAATGAGAT AGCTAATGCA GGGATTCGTT TCACAAACTT TCATGTGTCA
CCCGTTTGTT CGGTAACACG TTCAATGCTT TTTACCGGTA ACGATAATAT CGAAGTGGGT
TTAGGTTCAT TTGATTATTC TGTTTATCCA GCGACACGCG GTAAAAAGGG TTATGAAGGT
TACCTCACCA AAGATGCAGT GACGATTTCA GAGTTATTAA ATGATGATGG TTATGAAGTC
TATAAGTCGG GTAAATGGCA TTTAGGTGGT GAAGAATCCG GTGGTAAAGG ACCGTTAGAA
TGGGGCTTCA CTAAAGAATT CGGTATTTTG TCAGGTGGTT CTAATCATTG GAATGATTTA
GCCATGACAC CAAATTTTAA AGATCCTAAT GGTTTAAATG TTAAAAGAAA AGAAAATTGG
ACTTTAAACG GCGAACCTTA TGACCGCCCT GAAGGTGTCT ATTCAGGTGA AATATACACT
AACCAAATGT TAGAGTTTAT TAAAGAAGGC GCTAAAAATG ATAAACCATG GTTTGCTTAC
ATGGCATTCA CCACCGCGCA CTTTCCTATT CAAGCGCCTA AAGGGCTGAT AATGAAATAT
TATCCTAAGT ACCTTGAACT AGGTTACGCA GGATTAAAAA AATCACGTTA TGAAAGCTTA
AAGGCCCAAG GCCTAATTTC ACATGAAGCA ACAGAAGCAC CTTTTAATAA CTTAACAAAA
AAATGGCAAG ATCTGTCTCA AGAAAACAAA GAGAAGCAAG CTAAAATCAT GGCAACTTAT
GCGGCCATGA TTGAAGACCA AGATAATCGT ATAGGGCAAA TATTAGATTA CCTGAGAGAG
TCGGGTCAGT TAGATAACAC CTTAGTCGTT TACATGACAG ATAATGGTCC TGAAGGTTTA
GAGCCAACAA ACCCTAAAAC GGGTAATCCT GAGTTTGCCA AGTGGATTGA AAACCAATTT
GATTCATCTT TTGAAGCGAT AGGTACTGCT AACTCACAAA ATGTTATTGG TACGTCTTGG
GCGAATTCCG CCACAGGCGG TCTACAGTGG TGGAAGTGGT TTGTTGGTGA AGGTGGCATA
CGTGTGCCAT TAATGATTGT ACCTCCAGGT GCTTTTAATA CTGACTATGT GCGTGCGGGT
GAAAAATCGA ATGTAGCTGT TTATGTTAAA GATGTTCCAA TGACTATTTT GGAATACGCC
AACGTTAAAC ACCCGATGAC AGAATATAAA GATAAAAAAG TTATCCCGCC AACGGGGATT
AGCATGAAGC CATTTTTAGA TGGGCAGTTT GATGTTGTTA GAACCGATAA AGATTGGTGG
GCATTTGAAT TGTTTGGTAA TGGCTATGTT ATGCAAGGTG AGTTCAAAGC GATGAAAGTG
AGAACAGGTA TGTTTGGTGA TGGTCAGTGG CACCTCTATA ATGTTGTTTC TGATCCCTCT
GAATCACACC CCCTAGAGCA TAAAAATCCA GAAAAGCTTA AAGCAATGAT TGCGCTTTAT
GAGTCTTATA TCGCTAAAAA TAATATCTTA GCAGTAGACG CTGATTGGAG CGCTTTTAAA
GGTGCTAGTC AATAA
 
Protein sequence
MKKIISILST VILLTSCAEK SESVQANTAV EADAKKPNIL LLVADDTAFG DIGAYGSEVH 
TPNMNEIANA GIRFTNFHVS PVCSVTRSML FTGNDNIEVG LGSFDYSVYP ATRGKKGYEG
YLTKDAVTIS ELLNDDGYEV YKSGKWHLGG EESGGKGPLE WGFTKEFGIL SGGSNHWNDL
AMTPNFKDPN GLNVKRKENW TLNGEPYDRP EGVYSGEIYT NQMLEFIKEG AKNDKPWFAY
MAFTTAHFPI QAPKGLIMKY YPKYLELGYA GLKKSRYESL KAQGLISHEA TEAPFNNLTK
KWQDLSQENK EKQAKIMATY AAMIEDQDNR IGQILDYLRE SGQLDNTLVV YMTDNGPEGL
EPTNPKTGNP EFAKWIENQF DSSFEAIGTA NSQNVIGTSW ANSATGGLQW WKWFVGEGGI
RVPLMIVPPG AFNTDYVRAG EKSNVAVYVK DVPMTILEYA NVKHPMTEYK DKKVIPPTGI
SMKPFLDGQF DVVRTDKDWW AFELFGNGYV MQGEFKAMKV RTGMFGDGQW HLYNVVSDPS
ESHPLEHKNP EKLKAMIALY ESYIAKNNIL AVDADWSAFK GASQ