Gene CPS_2368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPS_2368 
Symbol 
ID3522371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameColwellia psychrerythraea 34H 
KingdomBacteria 
Replicon accessionNC_003910 
Strand
Start bp2466469 
End bp2468082 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content38% 
IMG OID637284825 
Productputative N-acetylglucosamine-6-sulfatase 
Protein accessionYP_269086 
Protein GI71281899 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTCAA AAATAGATGC GTTAAAAATA ACAGTGCTAA GTTTAAGTTT ATGCTTTTCA 
GTATCATCGT TATCAGCAAC CGTAAACAAA ACAGTAAAAC AAAAGAAAAA TGTCATCTAC
ATTTTAACTG ATGACCAACG CTATGATGAA GTAGGCTTTT TAAACCCGCG TATTGATACA
CCAAATATGG ATAAACTTGC TGCTGGCGGT GTTTATTTCA AAAATGCTTT TGTTACTACC
GCCCTTTGCT CACCTAGTCG TGCAACGATA TTAACTGGTC AGTACATGCA TAATCATGGA
GTAGTTGATA ATAATAACCC AGCAAAAGAA AGCTCTGTAT ATTTCCCTTC CTATCTACAA
GAGGTGGGTT ATGAAACAAG TTTCTTCGGC AAATGGCATA TGGGTGGTCA CGGTGACTCT
CCTCAACCGG GGTTTGATCA TTGGTTAAGT TTTGCGGGTC AAGGACATTA CTATCCCAAA
AAAGATAAAA AAGGTCGAAC AAACAAAATT AATATCAATG GCGAAAGAGT TGACCAAAAG
GGCTATATTA CCGATGAGTT GACTGATTAC GCGGTGGATT GGTTAGACAA ACGTGATTCA
GACAAACCAT TTTTTATGTA TTTATCTCAT AAAGCAGTAC ATTCTAATTT TGATCCTGCT
CCACGCCATA AAGATCAATA TAGCGATGTA GCAATTGAAG TCCCTGAAAG CCAAGCCGAT
ACTCCAGAAA ACTATGCAGG CAAGCCTATG TGGGTGAAGA ATCAACGTAA TAGCTGGCAT
GGGGTCGACT TTCCTTACCA TAGTGAAATG GACGTTCAAG AATATAAGCG TCAATACCAT
AGAGCACTAT CTGCCGTGGA TGATAGTTTA GGTCGTGTAT TAAAGTGGCT AAAAGATAAT
AACTTAGAAA ATGATACTAT TGTGATGTTA ATGGGCGATA ACGGCTTTAT GTTTGGCGAA
CACGGTTTAA TTGACAAGCG TAATGCTTAT GAAGAGTCTA TGCGTGTACC GTTACTTGCT
TATGCTCCCG GTTATTTCAA ACCCGGCACC GTAGTAGACG AAATGGTTGC TAACCTAGAC
ATAGCCCCTA CAATATTAGA AATTGCAGGC GCTAAAAAAC CAGCTCACTT TGATGGCGAC
AGTTGGTTAC CTCTTGCTAA AAACAAAGAA GTAAATCAAT GGCGTGAGAA CTTTTTATAT
GAATATTATT GGGAATTTAA CTACCCTTCT ACCCCAACTA CTTTTGCTTT GCGTACTGAC
AACTACAAAC TAATTCAATA TCACGGTGTT TGGGACACTG AAGAGCTTTA TGACTTAAAA
AATGATCCTA AAGAAATGAA CAACTTAATC AATACACCTA AACATCAACC ACTTATAGCG
CAAATGCGTC ATGATTTATT CAACCTTTTA GTGAATAAAA AAGGTGATAA TGTTATCCCT
TACACTGAAA AGTATACTCC TGGTGCGGTT TACCGTGAAC GTGACCGTGG CGAAACAGCT
GACTTTCCAG ATAACTGGTT GAAAAAAGAG GGTGATGATG GTTTAAGAAC GTTCTTACGA
ATAAAGCCTA TTAAAGATAA AAAAGATGAC AAGAAAAAAT CAGCTAAACA TTAA
 
Protein sequence
MSSKIDALKI TVLSLSLCFS VSSLSATVNK TVKQKKNVIY ILTDDQRYDE VGFLNPRIDT 
PNMDKLAAGG VYFKNAFVTT ALCSPSRATI LTGQYMHNHG VVDNNNPAKE SSVYFPSYLQ
EVGYETSFFG KWHMGGHGDS PQPGFDHWLS FAGQGHYYPK KDKKGRTNKI NINGERVDQK
GYITDELTDY AVDWLDKRDS DKPFFMYLSH KAVHSNFDPA PRHKDQYSDV AIEVPESQAD
TPENYAGKPM WVKNQRNSWH GVDFPYHSEM DVQEYKRQYH RALSAVDDSL GRVLKWLKDN
NLENDTIVML MGDNGFMFGE HGLIDKRNAY EESMRVPLLA YAPGYFKPGT VVDEMVANLD
IAPTILEIAG AKKPAHFDGD SWLPLAKNKE VNQWRENFLY EYYWEFNYPS TPTTFALRTD
NYKLIQYHGV WDTEELYDLK NDPKEMNNLI NTPKHQPLIA QMRHDLFNLL VNKKGDNVIP
YTEKYTPGAV YRERDRGETA DFPDNWLKKE GDDGLRTFLR IKPIKDKKDD KKKSAKH