Gene CPS_2984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPS_2984 
Symbol 
ID3520029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameColwellia psychrerythraea 34H 
KingdomBacteria 
Replicon accessionNC_003910 
Strand
Start bp3124275 
End bp3125813 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content39% 
IMG OID637285437 
Productsulfatase family protein 
Protein accessionYP_269684 
Protein GI71279570 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00336122 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAAAA ATAAACTAAA AATGTTGATG ATGGGTGCAA GCTTGATAGC GACTGCATCC 
GCAACAGCAG CGGAAAAGCC AAATATTTTA TTTTTCTGGG GCGATGATAT AGGACGTACA
AATATCAGTG CCTACAGCCA CGGTATAATG GGTTTTAAAA CACCTAACAT CGATCGCATA
GCTAAAGAAG GTATGATGTT CACCGATTAT TATGCAGATC AAAGCTGTAC CGCTGGTCGT
TCAACGTTTA TCACTGGACA ATCAGGTTTA CGTACCGGCA TGACAAAAGT TGGCTTACCT
GGCGCTAAAG AAGGCATTCA AGATAGAGAT ATTACTATTG CAGAAATGTT AAAAGCTAAG
GGCTATACCA CAGGTCAATT TGGTAAAAAC CACTTAGGTG ATAAAGATGA ACATTTACCC
TCTAATCATG GTTTTGATGA ATTTTTTGGT AACCTTTACC ATTTAAATGC AGAGGAAGAG
CCAGAAGACC CTGATTACCC TAAAGATCCT GCTTTTAAGA AAAAATTTGG ACCACGCGGT
GTTATTCACT CTTATGCCGA TGGTAAAATT GAAGATACCG GCCCTTTAAC TAAAAAACGC
ATGGAAACAG CTGATGATGA ATTTGTCGCT GCAGCCATGA AATTCGTTGA TAAAGCAGTG
AAAGCTAAAA AACCTTTCTT TGTTTGGGTT AATACTGCAG GCATGCACTT TAGAACACAC
ATCAATCCAA AACATGTGGG TCTTTCAGGT CAAGGGTTCT ATAACGATGT GATGGTCGCT
CACGATAATC ATGTTGGCAT GATGTTAGAT CAACTTGATA AGTTAAAAGT TACTGACAGT
ACAATTGTCA TGTACTCTAC CGATAATGGC GTGCACTACA ATACTTGGCC AGATGCCGGT
ATAACACCGT TTGATGGTGA AAAAAACAGT GAAAAAGAAG GTGCTTATCG TGTTCCAATG
ATGGTGCGCT GGCCTGGTAA AATTAAAGCC GGTGAAGTTT CAAACGAAAT GATGGCTCAT
TTAGATTGGA TGCCAACTTT AGCTGCCGCT GCAGGTGATA CTAAACTCAA AGAAGACATG
CTTAAAGGCA AACGTCGCTT TGGTAATAAG CAATCAAAAA TTCATCTTGA TGGCTATAAT
ATGCTACCCC ACCTTACGGG TAAAACAGAG AAAAGCCCAC GCAACATTTA TCATTATTTA
AATGATGAAG GTTTCCCTGT TGCCATTCGT ATTGGTGATT GGAAAATGGT TTATGCAGAA
AATCGTGGTA AAACCTTGGC CCTTTGGACA GAACCTTTCA CTATGCTAAG AATGCCTAAA
ATCTTAAACT TACGTCGTGA CCCGTGGAGT AAAGCTGAAG AAAACTCTAA TTCTTACTAC
GATTGGATGA TTGATAAAGC GCCGTATATC TATTTAGGTT TATCAGAAAC AGCTAAGTTT
TTATCAACCT TTAAAGACTA TCCACCTAGC CAACCTACTG GCTCTTGGTC AGTTGAAGCG
GTATATGATA CTTTTTTGAA AAAATCTGAA GGTAAATAA
 
Protein sequence
MIKNKLKMLM MGASLIATAS ATAAEKPNIL FFWGDDIGRT NISAYSHGIM GFKTPNIDRI 
AKEGMMFTDY YADQSCTAGR STFITGQSGL RTGMTKVGLP GAKEGIQDRD ITIAEMLKAK
GYTTGQFGKN HLGDKDEHLP SNHGFDEFFG NLYHLNAEEE PEDPDYPKDP AFKKKFGPRG
VIHSYADGKI EDTGPLTKKR METADDEFVA AAMKFVDKAV KAKKPFFVWV NTAGMHFRTH
INPKHVGLSG QGFYNDVMVA HDNHVGMMLD QLDKLKVTDS TIVMYSTDNG VHYNTWPDAG
ITPFDGEKNS EKEGAYRVPM MVRWPGKIKA GEVSNEMMAH LDWMPTLAAA AGDTKLKEDM
LKGKRRFGNK QSKIHLDGYN MLPHLTGKTE KSPRNIYHYL NDEGFPVAIR IGDWKMVYAE
NRGKTLALWT EPFTMLRMPK ILNLRRDPWS KAEENSNSYY DWMIDKAPYI YLGLSETAKF
LSTFKDYPPS QPTGSWSVEA VYDTFLKKSE GK