Gene CPS_2983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPS_2983 
Symbol 
ID3520535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameColwellia psychrerythraea 34H 
KingdomBacteria 
Replicon accessionNC_003910 
Strand
Start bp3122407 
End bp3123975 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content39% 
IMG OID637285436 
Productputative arylsulfatase 
Protein accessionYP_269683 
Protein GI71280072 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0960108 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAATGA ATAATCGGCT TAAGAAGTTA GCACTAGGCA TAGGTGTACT TGCCATCGCC 
ACCAGTGCAG CAGCAACAAC CAACAAAGCT AAGCCTAATG TACTAGCTAT TTGGGGTGAT
GATATTGGTT ATTACAATAT CAGTGCTTAT AACCAAGGCA TGATGGGTTA TCAAACACCA
AATATCGACC GTATTGCTGA TGAAGGCGCT TTGTTTACCC ATCATTATGC ACAACAAAGT
TGTACTGCTG GCCGTGCTTC TTTCATTTTA GGTCAAGAAC CCTTCAGAAC CGGTTTATTA
ACTATTGGTA TGCCAGGTTC AACACACGGT ATTCCCGATT GGACACCTAC CATTGCTGAT
CTTCTAAAAG AAAAAGGTTA CATGACTGCG CAATTTGGTA AAAACCATTT AGGTGATCAA
GATAAACACT TACCGACTAA TCATGGTTTT GATGAGTTTT TTGGTAATTT ATATCATTTG
AATGCCGAAG AAGAGCCTGA AACTTATTAT TATCCTAAAG ATAAAGAATT TCATAAAAAA
TATGGTCCTC GCGGTGTTAT CCATTCATTC GCTGATGGAA AAATAGAAAA TACAGGTTCT
ATGACGCGTA AACGCATGGA AACAGCTGAT GGAGAGTTTT TAGCGGGTAC CTTGAAGTTT
ATTGATAAAG CGCATAAAGC CAAAAAGCCT TTCTTTATCT GGCATAGCTC AACTCGTATG
CATGTATGGA CACGTTTGCA AGAAAAGTAT CGCGGTAAGT CAGGCGTAAG TTTAACGGCT
GATGGTATGT TAGAACATGA TGATCAAGTG GGTATATTAC TTGATAAATT AGACGATTTA
AAAATTGCAG ATAATACCAT TGTTATTTAT TCAACCGACA ATGGTGCAGA AAAATTTACT
TGGCCTGATG GTGGTACATC ACCATTTAGA GGCGAAAAAG GAACGACAAC AGAAGGCGGT
ATGCGTGTTC CTCAACTCGT TCGCTGGCCT GGTACAATCA AGGCAGGCAG TAAATTTAAT
AACATGATGT CACATGAAGA TTGGATGCCA ACACTATTAG CGGCAGCGGG TGAGCCAAAC
ATAGTTAACA AGCTTAAAAA AGGTTACAAA GCTAACGGTA AAAAATGGAA AATTCATCCT
GATGGTCATA ACTTCTTACC TTTCTTTAAA GGCCAAGAAA AAGCATCTCC GCGCACGAGT
AAATTATATT TCAATGCTGC CGGTGATTTG AATGCTGTAC GTTGGAATGA ATGGAAAATT
GCCTTTGCAG AAGAAGAAGG CGGAATTAGC ACTGCATACC GTAAAGTCCC TGCATGGCCT
ACCATTACCA ACTTACATGC AGATCCCTTT GAAACGGCTG CAAAAGAGTC AGGAATGTAC
TTACGTTGGT ATGCGGATAA CATGTGGTTA TTTGTCCCGG CACAACAACA AGTTGCACAG
TTCATGTCAA CTATTGACAA ATATCCTTTC CAAGAAGGTA GTAGTTTAAG TGCGAGTAAT
ATTGGTTATA AAAGCATTAG AACACAGGCT GCACTAAAAA AAATACAACA ACTAAGTCCT
AACCGATAA
 
Protein sequence
MEMNNRLKKL ALGIGVLAIA TSAAATTNKA KPNVLAIWGD DIGYYNISAY NQGMMGYQTP 
NIDRIADEGA LFTHHYAQQS CTAGRASFIL GQEPFRTGLL TIGMPGSTHG IPDWTPTIAD
LLKEKGYMTA QFGKNHLGDQ DKHLPTNHGF DEFFGNLYHL NAEEEPETYY YPKDKEFHKK
YGPRGVIHSF ADGKIENTGS MTRKRMETAD GEFLAGTLKF IDKAHKAKKP FFIWHSSTRM
HVWTRLQEKY RGKSGVSLTA DGMLEHDDQV GILLDKLDDL KIADNTIVIY STDNGAEKFT
WPDGGTSPFR GEKGTTTEGG MRVPQLVRWP GTIKAGSKFN NMMSHEDWMP TLLAAAGEPN
IVNKLKKGYK ANGKKWKIHP DGHNFLPFFK GQEKASPRTS KLYFNAAGDL NAVRWNEWKI
AFAEEEGGIS TAYRKVPAWP TITNLHADPF ETAAKESGMY LRWYADNMWL FVPAQQQVAQ
FMSTIDKYPF QEGSSLSASN IGYKSIRTQA ALKKIQQLSP NR