Gene Cpin_4597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_4597 
Symbol 
ID8360770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp5739812 
End bp5741656 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content45% 
IMG OID644966751 
Productsulfatase 
Protein accessionYP_003124239 
Protein GI256423586 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGAATCC TCTTTGCCAT CATTAGCTGT GTTTTCGGTT TGCTTTACAA TATACTTACA 
GCAGCTGCCC AGCGGCAACC CAACATCGTG ATTTTTATTG CAGATGACCT GAACCAGCAG
GATGTTGGCT GTTATGGCAA TAGAGATGTC AGGACGCCCA ATATGGATAA GCTGGCAGCA
GAAGGGATGC AATTTAAAAG TGCCTATGCC GCTTCGCCGA TGTGCGCACC TTCCAGGAGT
GTGATGTTTA CCGGATTGTA TCCTTTCCGG AATGGTTCCC AGATGAATCA CTTCACAGTA
AGGCCCAATA CCAGGAACCT GCCGCAGTTC CTGCAAAAAC TGGGATACCG GGTCGTGATC
TCTGGGAAGA CAGATATCTT TCCGTTACAT AACTTCCCCT TTGAGCATAT CGGAGAAGAG
TTCGGCAAAT ATGCGCCTAT TGAAAACAGG ACTGATCGTA AAAAGGAAAC AGTTAATATG
ATCCGCACAC ATTTTCAGGA TCATCCGGAG CAGCCTATCT GTCTGATTGT AGCGCCCTGG
ATACCACACG TACCGTGGTT CCCGAATACA GACTTCGATC CGCAGCAGAT AAAACTCCCC
GATTATCTTG CTGATACAAA AGAGACGCGC AAAGCGCTGA CTGCCTATTA CCAGAGTATC
GGTGTGGCAG ATAAGATGCT GGGAGAGGTA ATGCAGGCAA TAGAAGGCGC AGGAGTGAAG
GACAATACAG TCACGATGTT TATTGCAGAC CAGGGCGCAC AATTTCCTTC CGCCAAATGG
TCGGTATACG ATCAGGGATT ACGGATTCCG ATGATTGTCA GATGGCCGGG GAAAGTGATG
CCAGGTACCG TGTCAGATGC ATTGGTGTCA CTGGCGGATC TCACACCGAC ACTGGTAGAT
CTTGCAGGCG GTAAGGCCAT TGATGATTTA GACGGTACAT CGTTTAAAGA CGTGTTGCTC
AATAAAAAGA AAGAACATCA TCAGTATATC TTTGCGGAGA CGTCCATGGA ACCGCATTTC
TGGTATAACT ATACACCCTC CAGGACGGTC ATTACGAAAG ACGGCTTTCA TTATATCAGG
AATTATCATC CGGGCGTGCG TTTTATTACG CATATCGACA AGGTTGAACA GAATGAGTTT
TATTTTGACA GCTGGATTGC TGGCGCAGCT ACTGACCCCA AAACAAAGTT CCTGCTTGAC
CGTTATAGCT ATCATCCACC GGAAGAATTA TATGATCTGA ATCGTGATCG CAAAGAATTC
AGTAATCTGA TTGCGAATCC TGCTTATTAC AGCCGGACCA ACGAACTGAA AAAATTACTG
GATAAAGAAC TAAGCCGACA GGGAGAAACG GCTGCAATGA TCCTGGAAGG ACCGCTTCCG
CAATTCTTTG ACCGTAGCTA TACTATCCGG CAAAACGCGA GTGCGGCAGA TCTGTCGTTT
AATAAAAAAG TATGGAATCC GCATGTATTG GTTGTTACCG CTTACCTGGA TAAAATAGAC
AAAGGCGGGG TGATCTGTGA CTACTTCAAT AACTTTAAAC TATACGCTTA TCATGATCAG
ATAGGGATTG TACTGGCAGA TGGTAAGACG ATCAATAGTG AGCAATTACC TGCAAACAAA
GGACAGTTGT ACATGACTTT GTCAGAAAAA GGGGAGCTTG TCGTACAATT CAATCAACAG
ACGATCATCA CACAGCAATT AGAAAAAGAC CTTACAAAGA TAAAAGGTGG TTATGTGAGC
TGTGGGATGA TTCAGGGAGA GGAGATGACA GGTCATTTGC AAAGTTATCA GGGCAATATT
ACTGATCTGC GGTTTACAAT GAATGAATTA TCAGGAGAAC CCTGA
 
Protein sequence
MRILFAIISC VFGLLYNILT AAAQRQPNIV IFIADDLNQQ DVGCYGNRDV RTPNMDKLAA 
EGMQFKSAYA ASPMCAPSRS VMFTGLYPFR NGSQMNHFTV RPNTRNLPQF LQKLGYRVVI
SGKTDIFPLH NFPFEHIGEE FGKYAPIENR TDRKKETVNM IRTHFQDHPE QPICLIVAPW
IPHVPWFPNT DFDPQQIKLP DYLADTKETR KALTAYYQSI GVADKMLGEV MQAIEGAGVK
DNTVTMFIAD QGAQFPSAKW SVYDQGLRIP MIVRWPGKVM PGTVSDALVS LADLTPTLVD
LAGGKAIDDL DGTSFKDVLL NKKKEHHQYI FAETSMEPHF WYNYTPSRTV ITKDGFHYIR
NYHPGVRFIT HIDKVEQNEF YFDSWIAGAA TDPKTKFLLD RYSYHPPEEL YDLNRDRKEF
SNLIANPAYY SRTNELKKLL DKELSRQGET AAMILEGPLP QFFDRSYTIR QNASAADLSF
NKKVWNPHVL VVTAYLDKID KGGVICDYFN NFKLYAYHDQ IGIVLADGKT INSEQLPANK
GQLYMTLSEK GELVVQFNQQ TIITQQLEKD LTKIKGGYVS CGMIQGEEMT GHLQSYQGNI
TDLRFTMNEL SGEP