Gene Cpin_2849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_2849 
Symbol 
ID8359010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp3514520 
End bp3516031 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content48% 
IMG OID644965029 
Productsulfatase 
Protein accessionYP_003122529 
Protein GI256421876 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000585259 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCAAA GAAATAAATA CCATATCATA CTATTGCTGG CTGCGCTATG CGCTTTATCC 
GGATCGGGTG TAACTGCCCA GTCAAAGCCC AATATTATCC TCTTGTATGC AGATGACCTG
GGATATGGTG ACGTAGGTTG TTATGGCGCG TCGGCAGTAA AAACACCGAA TATCGACCGT
CTTGCCAGTA AAGGAGTACG ATTTACAGAT GCGCATTGTA CAGCGGCTAC CTGTACGCCA
TCAAGACTGT CCTTACTGAC AGGCACTTAT GCTTTTCGAA AGAAAGCAGC TATCCTTCCT
GGTGATGCGC CTTTGCTGAT CCCACCGGAT ACGTATACTT TACCACGTAT GTTACAGCAG
GCAGGGTATA CGACCGCCGT GATCGGTAAA TGGCACTTAG GATTGGGGAA CGGCGTTATT
AACTGGAATG ATAACATTGG TCCGGGGCCG AATGAGATAG GTTTTGACTA TTCGTTTATT
ATTCCGGCTA CTACTGACCG GGTACCTACT GTCTTTGTGG AGAATGGAAG GGTACCTGAC
CTGGATCCCA ATGATCCTAT TGCTGTGAGT TATGCCGCGA TGATCGGGGA TGAACCTACC
GGACTCTCAG ATCCACAATT GCTGAAGCAA CGGGCAGATA CACAGCATAG CAATACGATT
ATTAACGGCA TCAGCCGGAT CGGTTTCATG ACCGGTGGCA AGCGTGCCCG TTGGGTAGAT
GAAGAGATCC CGATGGTGTT GAACGGAAAG GCGAAAGACT TCATAACTAC GCATAAAGAG
CAGCCGTTTT TCCTTTATTA TCCTTTCCCT AACATCCATG TACCGCGTAC ACCGAATAGG
AAATTTGCCG GTACTACGGC ACTTGGCGCC CGTGGAGACG TCATTGCAGA AATGGACTGG
TTAGTGGGAG AGATCACACA GCTGTTGGAT TCACTGGGAA TCGCGAAAAA TACACTGATT
GTATTCAGTA GTGATAATGG TCCTGTATTA GACGATGGCT ATGAAGACCA GGCCGGACAA
CTGAACAAAA GTCATAAACC GGCAGGGATA TTCAATGGTG GGAAATACAG CGCATTTGAA
GCCGGTACCC GGATGTCTAC CATTACCTAC TGGCCGGGTA CTATACGTCC TGGTGTTTCA
GCGGCTTTGT GTTCGCAGGT AGACCTGATG GCTTCTTTTG CAGCATTGAC AGGACAAAAA
TTACCTGCAG GCGCTGCACC TGACAGTCAG AATGCGCTGG ACGTATGGTT GGGCAAGTCA
GTACAGGGCA GGAAATACCT GCTGGAAGAA TCTTACACCC TGGCGTTGCG GGATAAAAGG
TGGAAGTATA TCGCTCCTCA GACGACGCCT ACGCCTGACT GGATGAAAAA CAAGGAAATA
GCTACCGGAC TGTCACCTGT GGAACAACTA TACGATCTGC ATAAGGATCC CGGAGAAACG
CATAATCTCG CCGGCCAACA TCCGGAAATC ATAAAGACAT TGAAAGCTGA ACTGAAAAAA
CTGACACTAT GA
 
Protein sequence
MEQRNKYHII LLLAALCALS GSGVTAQSKP NIILLYADDL GYGDVGCYGA SAVKTPNIDR 
LASKGVRFTD AHCTAATCTP SRLSLLTGTY AFRKKAAILP GDAPLLIPPD TYTLPRMLQQ
AGYTTAVIGK WHLGLGNGVI NWNDNIGPGP NEIGFDYSFI IPATTDRVPT VFVENGRVPD
LDPNDPIAVS YAAMIGDEPT GLSDPQLLKQ RADTQHSNTI INGISRIGFM TGGKRARWVD
EEIPMVLNGK AKDFITTHKE QPFFLYYPFP NIHVPRTPNR KFAGTTALGA RGDVIAEMDW
LVGEITQLLD SLGIAKNTLI VFSSDNGPVL DDGYEDQAGQ LNKSHKPAGI FNGGKYSAFE
AGTRMSTITY WPGTIRPGVS AALCSQVDLM ASFAALTGQK LPAGAAPDSQ NALDVWLGKS
VQGRKYLLEE SYTLALRDKR WKYIAPQTTP TPDWMKNKEI ATGLSPVEQL YDLHKDPGET
HNLAGQHPEI IKTLKAELKK LTL