Gene Cpin_3320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_3320 
Symbol 
ID8359486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp4083793 
End bp4085355 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content45% 
IMG OID644965493 
Productsulfatase 
Protein accessionYP_003122988 
Protein GI256422335 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0000925232 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAGTATCT TAGCTTCTAT GTTGAAAATT ACGAATATCA GATCTGTAGT TGTCGCACTT 
ATTGTACTCC TTGCACCAAT TCTGTTGAGT GCACAGTCGA AACGACCAAA TATCATCTTC
ATTCTTTCAG ATGATCATAC CTACCAGGCC ATCAGTGCTT ATGGTAACAG GTATGTGCAA
ACGCCAAATA TCGACCGGAT AGCACGCGAA GGGGTCTTAT TCCATCATGC CATGGTCACC
AACTCCATAT GTGGACCAAG CAGAGCTACG TTATTGACAG GAAAATACAG CCATAAAAAT
GGGTATCCAC TGAATGAAAA ACGATTCGAC AATACCCAGC AGACTTTCCC CGGTATTTTG
CAGCAAAATG GTTATCAGAC TGCATGGATA GGTAAAATGC ACCTTGGTAC ATTGCCAGAG
GGATTCAATT ACTTCAGCAT TTTGCCTGGT CAGGGTAAAT ACTACAATCC GGACTTCATT
TCCACACCGA ATGATACGGT GAGGATGGGC GGTTATGTCA CTAATATTAT TACGGACTTG
TCTGTATCCT GGCTTAATGG CCGGGATACA AGCCGTCCTT TTATGCTCGT TGTAGGAGAA
AAAGCAACAC ACCGCGAATG GCTGCCTGAC CTGCCTGATC TGGGCGCGTA CGACAGTATT
AATTTCCCGC TGCCTGCTAG CTTCTCCGAC GACTATAAAG GCCGTCGTGC AGCCCTCGAC
CAGGATATGA CCATCTCCAA CACGATGCGA CTGAAAGAAG ACCTGAAAGT GCATGTAAAC
TATCAGCAAA ATAAAAACAG TGAATACGGC AGGTTCACAC CAGAGCAGCT GGCGCCGTTT
GCTCAGTATT ATGAGCAGAA AATAAGCCAC GAATTCGATT CCCTGCACCT GAGTGGCCAA
GCACTTACAG CGTGGAAATA TCAGCGTTAT ATGCGCGATT ATCTCGCTAC TGCCCGTTCC
CTGGACAGGA ATATCGGACG CATCCTCCAA TACCTCGATA GCACCGGACT GGCCGACAAT
ACAGTGGTGA TCTATTGTTC CGATCAGGGC TTCTACCTGG GAGAACATGG CTGGTTTGAC
AAACGTTTTA TTTACGAACA GTCTTTACAT ACGCCTTTTG TAATGCGTTA TCCTGGTGTG
ATCAAACCCG GTACAAACAG TAATAGCTTT ATGCTGAATA TTGACTGGGC GCCTACCTTA
CTGGATATCG CACACGTAAA AGCACCTGCA GATATGCAGG GTACTTCGTT TATGCCTTTG
GTGAGCGGTA AAGGGGATAC TACCCGCTGG AGGAAGGATA TGTATTATCA TTATTACGAG
TTCCCGGAGC CGCATCATGT ATCGCCTCAC TTTGGCATAC GTACGGAGAG ATATAAACTG
GTACGTTTTT ATGGACCTGC TGATTTCTGG GAATTGTATG ATCTGAAAAA CGATCCGCAG
GAACTGCATA ATATTTACAA TGATCCGGCA CAGGCGAAGA ATATAGTAAA CCTGAAGAAA
CGTTTGAAGG TACTGATAAC GCAGTATGAT GATAAAGATG CGGAGAAACT ACTGAAGAAT
TAA
 
Protein sequence
MSILASMLKI TNIRSVVVAL IVLLAPILLS AQSKRPNIIF ILSDDHTYQA ISAYGNRYVQ 
TPNIDRIARE GVLFHHAMVT NSICGPSRAT LLTGKYSHKN GYPLNEKRFD NTQQTFPGIL
QQNGYQTAWI GKMHLGTLPE GFNYFSILPG QGKYYNPDFI STPNDTVRMG GYVTNIITDL
SVSWLNGRDT SRPFMLVVGE KATHREWLPD LPDLGAYDSI NFPLPASFSD DYKGRRAALD
QDMTISNTMR LKEDLKVHVN YQQNKNSEYG RFTPEQLAPF AQYYEQKISH EFDSLHLSGQ
ALTAWKYQRY MRDYLATARS LDRNIGRILQ YLDSTGLADN TVVIYCSDQG FYLGEHGWFD
KRFIYEQSLH TPFVMRYPGV IKPGTNSNSF MLNIDWAPTL LDIAHVKAPA DMQGTSFMPL
VSGKGDTTRW RKDMYYHYYE FPEPHHVSPH FGIRTERYKL VRFYGPADFW ELYDLKNDPQ
ELHNIYNDPA QAKNIVNLKK RLKVLITQYD DKDAEKLLKN