Gene Cpin_3801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_3801 
Symbol 
ID8359972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp4778414 
End bp4779775 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content50% 
IMG OID644965973 
Productsulfatase 
Protein accessionYP_003123464 
Protein GI256422811 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00499822 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.102523 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCAA GGTTTTTCTA TGTAATGACC CTATTGTTGG CCACTGTACA AATGACTTAT 
TGTCAAAAGC CCGTTCAGCC CAACGTGGTC ATCATACTGA CCGATGACAT GGGATATGGC
GATATTAGCT GCTACGGCGG CAACGTAATG CCTACTCCCC ATGTCGACAA TATGGCGAAA
AACGGTATGC GCTGTACCCA ATATTACAGT GCAGCCCCCA TCTGCTCTCC TTCCCGCGCA
GGCATCCTCA CAGGGATGTA TCCTGCCCGC TGGAACTTCA GCACCTACCT GGACAACAAA
AAACATAATA AAGCCGCGCA ACAAACCGAT TACCTGGATC CGAAAGCCCC CTCTATCGCA
TGCATCTTTA AAAATGCAGG ATATGCCACA GGCCATTTCG GTAAATGGCA CCTCGGCGGA
GGCCGTGACG TAACAGACGC TCCAGGCTTT GAACAATACG GTTTTGATGA ACATGCCAGT
ACCTATGAGA GTCCGGACCC GGACCCGCTA CTTACAGCAA CCAACTGGAT CTGGTCTGAT
AAAGACAGTA TCAAACGCTG GGACCGCACG GCCTATTTTG TAGATAAAGC ACTGAGCTTC
CTCCGTAGTC ATACCGGCCA ACCCTGCTTT ATCAACCTCT GGCCCGACGA TGTACATACC
CCCTGGGTTC CCCGCAGGTC AGATGGTGAT ACCGCCCGCC TGAAACCAGA AGAAGAAGCC
GCACTGAGAA AAGTTTTAAA AGAATACGAT ATACAGATCG GGCGCTTCCT CGCCGAACTG
AAAAGATCCG GTCTAGATAA AAATACCATC GTCATTTTTA CCAGTGATAA TGGTCCGCTA
CCCACCTTCC GGAACAGCCG TACATTGGGT TTACGTGGTT CCAAATTATC ACTGTACGAC
GGCGGTACAC GGATGCCATT TGTCATTAAC TGGACAGGAC ATATCAAACC GGGTAGTATC
GACAGCACCT CCATGATCAC CGGACTAGAC CTGTTGCCGA CTCTGGCAGG TATGGCAGGT
ATTAAACTAC CAAAAGACTA TCACGGCGAT GGTGTTGATC GCTCTGCCGT CTTTACCGGA
CGTCCCTCTG CCCGTAACAA AGACATGTTC TGGGAATACG GTCGTAATAA TATCGCCTAC
GCCTATCCCA AAGAATTGGC TTGGAATAGA AGTCCGCAGC TGGCTGTTCG CTCCGGAGAA
TGGAAATTCC TGATGAACGC AGACCGTAGC GAACCAGCAC TATACAACGT AAAACTGGAT
CCGGGAGAAT CACTCGACAT GAGCGGCATC CGTCCTGATC TGGTAAAACG GCTTTCCACT
GATCTCATGA ACTGGTGGAC CGCCATGCCA AAGCTGCAAT AA
 
Protein sequence
MKARFFYVMT LLLATVQMTY CQKPVQPNVV IILTDDMGYG DISCYGGNVM PTPHVDNMAK 
NGMRCTQYYS AAPICSPSRA GILTGMYPAR WNFSTYLDNK KHNKAAQQTD YLDPKAPSIA
CIFKNAGYAT GHFGKWHLGG GRDVTDAPGF EQYGFDEHAS TYESPDPDPL LTATNWIWSD
KDSIKRWDRT AYFVDKALSF LRSHTGQPCF INLWPDDVHT PWVPRRSDGD TARLKPEEEA
ALRKVLKEYD IQIGRFLAEL KRSGLDKNTI VIFTSDNGPL PTFRNSRTLG LRGSKLSLYD
GGTRMPFVIN WTGHIKPGSI DSTSMITGLD LLPTLAGMAG IKLPKDYHGD GVDRSAVFTG
RPSARNKDMF WEYGRNNIAY AYPKELAWNR SPQLAVRSGE WKFLMNADRS EPALYNVKLD
PGESLDMSGI RPDLVKRLST DLMNWWTAMP KLQ