Gene Cpin_2850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_2850 
Symbol 
ID8359011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp3516028 
End bp3517719 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content48% 
IMG OID644965030 
Productsulfatase 
Protein accessionYP_003122530 
Protein GI256421877 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00796001 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCGC GGAATGTATT ACTATCATCA TTGATCATTC TATCTTCCTG TACAGGGAGT 
CGTAAGAACA ACAAGACAAC AGTAAGTAAG GATGAAAGAC CCAATATCGT GCTGATACTG
GCGGATGATC TTGGCTACTC AGATCTTGGT TGTTACGGCG GGGAGATACA AACACCCAAT
CTCGATTATC TGGCAGCAAA TGGTCTGCGC TTCAGGCATT TCTATAATAC TTCGCGTTGT
TGCCCTTCCA GGGCTTCCCT GCTGACCGGT TTATATAATC AACAGGCAGG TATCGGCGAG
ATGACGACCG CGAGAGCAGA GGCCGGATAT AGGGGTTATA TCACCGAGAA TACGGTTACA
CTGGCAGAGG TATTGAAAGA TGCCGGTTAC CATACGGCTA TGTCAGGCAA GTGGCATGTA
TCCAATACAG TGGAACAGTC CACACCGGCA GCACAACTGA AATGGCTGAA CCACCAGGCA
TCACATCCTT ACTTCTCACC AGTGGAACAA TATCCTGTCA ACAGAGGATT TGAAAAGTAT
TACGGTAATA TCTTCGGTGT GGTCGATTAT TTTGATCCTT TCAGTCTGGT GAATGGTACG
ACGCCGGTGG AAAGTGTTCC GAAAGACTAT TATCATACAG ACGCGATCAA CGATACAGCT
GTCAGCTATG TGAGAGCGCT CAGTAAAGAA GATAAACCCT TCTTCCTGTA TGTTGCACAT
ACTGCCCCGC ACTGGCCTTT ACAGGCATTA CCGGAAGACA TAAAGAAATA CGAACAGACT
TATAAGGGTG GCTGGGATGT TATCCGGGAA GCTCGTTATA AAAGGATGGT AGCACAGGGG
CTGATCGATC CCAAAACCAC ACCATTATCG CCCCGTATCA ATAACCAGCT GAGCTGGGAT
AAAAATCCGG ATAAGGACTG GGATGCACGG GCCATGGCAG TACATGCTGC GATGGTTGAT
CGTATGGACC AGGGCATTGG TCGCCTGATT CAGACATTAC GGGAAACGGG TAAACTGGAC
AATACGATTA TCATATTCTT AAGCGATAAC GGCGCCAGTC CGGAGAACTG TATGCGATAT
GGTCCGGGCT TCGATCGTCC GGGACAAACC AGGGATGGAA AAGAGATCAG TTACCCGGTA
AAGAAAGATG TATTGCCTGG TCCTCAGACC ACCTTTGCTT CCATTGGAGA GCGTTGGGCG
AATGTAGTGA ATACGCCTTA TCAATACGCG AAAGCACAGT CCTATGAAGG CGGTGTGCGT
ACGCCGATGA TTGCCTACTG GCCGAAGGGT ATTAAAGCGA AAGGGGCGTA TGCAGACCAA
CTCGCACACG TCATGGATTT CATGCCTACC TTCCTGAATG TGGCAAAAGC CAGTTATCCG
CAGACTTATA AGGGGCATAG TATTACTCCT TCCACTGGGG TCAGTCTGCT ACCAGCTTTC
GAAGGTAAAC AGGAGCCAGG ACATGATGTG TTATATAACG AGCATTTCAA CGCCCGTTAT
GTACGAGCAG GCGACTGGAA ACTGGTCTCC TTGTCCGGCG ATTCCACCTG GCATCTCTAC
AAGATCAACC AGGATGAAAC GGAATTAAAC GATCTGGCGG CACAGCATCC CGATGTAGTC
GCCCGTATGA CGGCACAATG GAGACAATGG GCAAATACCC ACAATGTGTT TCCTAAACCC
GGTAAAAAAT AG
 
Protein sequence
MKARNVLLSS LIILSSCTGS RKNNKTTVSK DERPNIVLIL ADDLGYSDLG CYGGEIQTPN 
LDYLAANGLR FRHFYNTSRC CPSRASLLTG LYNQQAGIGE MTTARAEAGY RGYITENTVT
LAEVLKDAGY HTAMSGKWHV SNTVEQSTPA AQLKWLNHQA SHPYFSPVEQ YPVNRGFEKY
YGNIFGVVDY FDPFSLVNGT TPVESVPKDY YHTDAINDTA VSYVRALSKE DKPFFLYVAH
TAPHWPLQAL PEDIKKYEQT YKGGWDVIRE ARYKRMVAQG LIDPKTTPLS PRINNQLSWD
KNPDKDWDAR AMAVHAAMVD RMDQGIGRLI QTLRETGKLD NTIIIFLSDN GASPENCMRY
GPGFDRPGQT RDGKEISYPV KKDVLPGPQT TFASIGERWA NVVNTPYQYA KAQSYEGGVR
TPMIAYWPKG IKAKGAYADQ LAHVMDFMPT FLNVAKASYP QTYKGHSITP STGVSLLPAF
EGKQEPGHDV LYNEHFNARY VRAGDWKLVS LSGDSTWHLY KINQDETELN DLAAQHPDVV
ARMTAQWRQW ANTHNVFPKP GKK