Gene Cpin_3296 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_3296 
Symbol 
ID8359462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp4059850 
End bp4061232 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content48% 
IMG OID644965469 
Productsulfatase 
Protein accessionYP_003122964 
Protein GI256422311 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0308085 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.000582836 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTTACAG GATGCGCGTT ACTGACTACT ATCCTGACCC AGGGACAGAC CCATAAACCC 
AATATTATCT TCATTCTGGC TGACGATCTG GGATATGGTA ATATCAGCGC TTATAACAGC
AAAAGCCCTG TTAAAACGCC TAATATTGAC AGACTGGGGC AGGAGGGAAT TCAATTCAAG
AACTTCTATT CCGGTAATAC CGTTTGTGCG CCTTCCCGTT GTGCCCTGCT CACCGGAAAG
CATATGGGAC ATGCCTATAT CAGGGGGAAT ACACGCCTTC CCTTACGTGC GGAGGACAGT
ACCCTGGCGC AGCTGCTGCA AGGGAACGGT TACCGTACCG GTATGTTTGG CAAATGGGGA
TTGGGAGAAT CCGGTACGAC AGGCTCCCCG GAGATCAAAG GTTTTGATAC CTTCTTTGGT
TATCTGAATC AGCAACATGC ACATAACTAC TATACAGACT ATCTCTTTGA AGTAAAGGAA
GGACAGATCA GCCGTGTACC CAGGGACACG AATGTCTATT CCCAGGATGA GATCCTGCAA
CATGCGCTGT CTTTCATCAA TGACAATAAA GACAAGCCTT TTTTCCTTTT CCTGCCATTT
ACCCTGCCAC ATGCAGAACT TGCGCCGCCA GCTACTGATA TGCAAGCCTT CCTCAATGCC
GATGGCAGCA GTAAGTTAGG TCCGGAAACG CCTTATGAGC GTAAGAACGG GACCTATCGC
AGTCAGGAAA ATCCACATGC AGCATTCGCA GCGATGGTCA CGAAACTGGA CAGGAACGTA
GGGGAGATCA GCGCACTGAT AAAACAACTT GGTCTTGACG ACAATACTTA TATCTTCTTT
ACAAGCGATA ACGGTCCGCA TAGAGAAGGT GGTGCAGACC CCATTTATTT TGACAGTAAT
GGCCCCCTAA AAGGCATTAA ACGCGATCTC TATGAAGGAG GTATCAGAGT ACCCCTGCTC
GTGAGAGCGC CGGGTAAAGT GTCCGCAGGA CAGGTTAGCA CCATTCCATG GGCGTTCTGG
GATGTATTGC CCACTTTGAG TGATATTACG CATTCCCCTG TTTTATCCGG AATAGATGGC
TTATCTTACA CAAAAGCGCT GAACGGAACG AAACCCGCAA GGCAGCATGA TCACTTCTAT
TGGCAGTTTA ATGAAGGCGG ATTACAGGAA GCATTGCTGA AAGACGACTG GAAACTGATC
CGCTTTAAAA AACGTGGCAC ACCCGAACGT TTTGAACTGT ACCATCTATC CGAAGACATA
GGAGAAGAAC ATGACCTGGC CACAAAATAC CCACAGAAAG TGAAAGCGCT CTCCGGACTG
ATGCTGCAAT CCAAAATGCC GGCGGAGAAC CCTGAGTTTG ACTGGTCGGC TACCGAACAA
TAA
 
Protein sequence
MLTGCALLTT ILTQGQTHKP NIIFILADDL GYGNISAYNS KSPVKTPNID RLGQEGIQFK 
NFYSGNTVCA PSRCALLTGK HMGHAYIRGN TRLPLRAEDS TLAQLLQGNG YRTGMFGKWG
LGESGTTGSP EIKGFDTFFG YLNQQHAHNY YTDYLFEVKE GQISRVPRDT NVYSQDEILQ
HALSFINDNK DKPFFLFLPF TLPHAELAPP ATDMQAFLNA DGSSKLGPET PYERKNGTYR
SQENPHAAFA AMVTKLDRNV GEISALIKQL GLDDNTYIFF TSDNGPHREG GADPIYFDSN
GPLKGIKRDL YEGGIRVPLL VRAPGKVSAG QVSTIPWAFW DVLPTLSDIT HSPVLSGIDG
LSYTKALNGT KPARQHDHFY WQFNEGGLQE ALLKDDWKLI RFKKRGTPER FELYHLSEDI
GEEHDLATKY PQKVKALSGL MLQSKMPAEN PEFDWSATEQ