Gene Cpin_2602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_2602 
Symbol 
ID8358762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp3188471 
End bp3189829 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content50% 
IMG OID644964785 
Productsulfatase 
Protein accessionYP_003122286 
Protein GI256421633 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.128586 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAATCA GACGCTTAAG TGCCATGGTT GCGCTGTCTT GTTTTATGGC GGCGCCACTG 
TTTGCACAGC AACAGAAAAG GCCGAATGTA CTTATAATTT ACACTGACGA CCAGGGCACC
CTGGATGTGA ATTGTTATGG CGCCAAGGAT CTGCATACCC CTAATATCGA CCGTCTGGCC
AAGGAGGGGG TTCTCTTCAG CCAGTTCTAC GCAGCAGCGC CAGTATGTTC TCCTTCCCGC
GCATCCCTGC TGACGGGCAG GTATCCACAG CGTGCACAAC TGGATAATAA TGCACCCAGT
GAGGAAGGAC ATGCCGGAAT GCCGGGATCG CAGTATACCA TGGCAGAAAT GTTTAAAGAT
GGCGGCTATA CGACAGCGCA CATCGGTAAG TGGCATATCG GTTATTCACC GGAGACCATG
CCGAATCAAC AGGGATTTGA CTATTCCTTT GGTTTTATGG GTGGTTGTAT AGATAATTAT
TCACATTATT TCTACTGGGC GGGACCTAAC AGACATGACC TGTGGAGAAA CGGGCAGGAG
ATCTGGGAGG ATGGGAAGTT CTTTGCTGAC CTGACAGTAC AGGAAGTAAA CGGATTCCTG
GAGAAAAACA AGCGGGCTGA TAAGCCTTTC TTCCTGTACT GGGCGATCAA TATGCCCCAC
TACCCTTTAC AGGGACAGGA GAAATGGAGA CAATATTACA AAGATCTGCC GGCGCCACGC
AGAATGTATG CTGCCGCAGT TTCCACAATG GACGAAAAGA TCGGACAGGT ATTACAGCAG
CTGGACCGTC TCGGACTGGC GGAGAATACG ATCGTTGTAT TCCAGTCCGA TCAGGGACAT
TCCACAGAGG ACAGAAGCTT TGGCGGTGGC GGTTTTACCG GTCCGTACAG AGGGGCGAAA
TTCAGTCTGT TTGAAGGTGG CATCCGTGTT CCGGCTATTA TCCGCTGGAC TGGACATCTG
CCAAAGAACG AGGTACGTGA TCAGCTGTGT GTAAATATCG ACTGGTATCC TACATTGGCC
GGACTTTGTA AAGTGGCTTT ACCGCAGCGG AAGATTGACG GAAAAGATAT TCAGCAGGTG
ATCACTTCTT CCAAGACCAG CTCTCCTCAT GACATTTTCT TCTGGCAATC GCAGGGCACG
AAGGAGAATC CGCAGTGGGC TGTCAGACAA GGTAACTGGA AACTCCTGCA CAATCCTTCC
AGTGCGAAGA AAGCAGAAAC AGGCCCGGAC GACCTCTTCC TGGTGAATCT GCAACAGGAT
ACTTCAGAAG CGAAGAACCT GGCAGCACAA CATCCGGAGA TTGTCTCTTC CTTAAAAGAG
CAATACCTGA AATGGATCAA CGAAGTCGTA CAACAATAA
 
Protein sequence
MRIRRLSAMV ALSCFMAAPL FAQQQKRPNV LIIYTDDQGT LDVNCYGAKD LHTPNIDRLA 
KEGVLFSQFY AAAPVCSPSR ASLLTGRYPQ RAQLDNNAPS EEGHAGMPGS QYTMAEMFKD
GGYTTAHIGK WHIGYSPETM PNQQGFDYSF GFMGGCIDNY SHYFYWAGPN RHDLWRNGQE
IWEDGKFFAD LTVQEVNGFL EKNKRADKPF FLYWAINMPH YPLQGQEKWR QYYKDLPAPR
RMYAAAVSTM DEKIGQVLQQ LDRLGLAENT IVVFQSDQGH STEDRSFGGG GFTGPYRGAK
FSLFEGGIRV PAIIRWTGHL PKNEVRDQLC VNIDWYPTLA GLCKVALPQR KIDGKDIQQV
ITSSKTSSPH DIFFWQSQGT KENPQWAVRQ GNWKLLHNPS SAKKAETGPD DLFLVNLQQD
TSEAKNLAAQ HPEIVSSLKE QYLKWINEVV QQ