Gene Cpin_2743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_2743 
Symbol 
ID8358904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp3363488 
End bp3365233 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content48% 
IMG OID644964923 
Productsulfatase 
Protein accessionYP_003122423 
Protein GI256421770 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.866766 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACT TCCTCTCCCT CATCGTCTGC CTGAGTGTAT GGTTCCAGGT GTCTGCACAG 
CAGCTGGCTA AGCGGCCGAA CATCATTGTA ATATTAAGTG ACGATATGGG ATATTCGGAT
CTGGGTTGTT ATGGCAGCGA GATCCAGACA CCCAATCTTG ATAAGTTGTA TAAAAACGGG
CTGCGTTTTA CACAGTTTTA TAATACCGCC CGTTGTTGTC CGTCCAGGGC TTCCCTGCTT
ACCGGGTTGT ATCCCCATCA GGCAGGAATG GGCTGGATGC GGGATAAAGA TGCTGGTCTG
GAAGGTTACC AGGGAGGACT GAGTAAACAT GCCGTAACGA TGGCGGAAGT GGTAAAGCCG
GCAGGTTATA CCACATTGAT GGTGGGTAAA TGGCATGTCA GTCAGGGTAT CCGCCAGGAT
GGTCCGCTGG CCAACTGGCC TTTACAACGT GGTTTTGACC GGTTTTACGG TACGATACAG
GGTGCAGGCA GCTATTATGA TCCTGCTACT TTGTGCCGGG GAAATCAGCT GATAACACCC
GCAACTGATA CTGCTTACCA GCCAGCTAAC TATTTCTATA CAGACGCGGT AACGCATGAA
GCGATTCGTT TTATCAGCGA AAGCGACCCT GCTAAGCCGT TCTTTATGTA CCTGGCTTAT
ACGGCCGCAC ACTGGCCTTT ACAGGCAAAA CCGGCTGATA TTAAGAAATA TAAAGGCATG
TATGATAAAG GCTGGGAGGC TATCCGTAAA CAGCGTTTTG AGAAAATGAA GAAACTGGGT
CTCCTGCCTG CCAACGCAGA ACTGTCCCCT TCCGATGCGC CTGACTGGAA AGACGAAAAG
AACAAAGCGG TACAGGCAGC AAGAATGGAA GTATATGCCG CAATGATCGA TGAAATGGAC
CAGGGAATCG GGGAGATTAT GGCACAACTG AAAAAACAGG GTCTCGATGA AAACACCGTT
GTAATGTTTA TGCAGGACAA TGGTGGTTGC GCAGAGGAAA TCGGTACAAA AGGAAAAACC
GGTCCGATAG CAGCAAACCC TGAAAAGATA AAACCAAGGG CAAAAGGCGA AGTAGAGTAT
GATGTGATCC CTAAAATGAC CAGAGACGGT AAACTGATCA TGAAAGGAGA GGGTATCATT
GGTGGTCCGG AAACAACTTA TGTTTCATAT GGAAAGGTAT GGGCCAATGT TTCCAATACG
CCCTTCCGTG AGTATAAACA CTGGGTACAT GAGGGCGGTA TCTCTACACC GCTGATCATT
CATTATCCGG CAGGTATCCG CCAGCATAAC AGCAGCAATT TTGTAGGTCA TTTTATTGAT
ATCATGCCAA CGGTAGCCGA ACTGGCAGGT GCTTCCTATC CATCTGACTG GCATAACAAT
AAGATCATTC CGATGGAGGG GATCAGCCTG GTGCCTTTAT TCACTGGTAA AGCACTGGAC
AGGGGTAAAG CGCTTTGCTG GGAACATGAG ATGAACAGGG CCGTACGCCT GGGAGACTGG
AAACTGGTCG CTAAAGGCGA ATTGCTGAGT GAGAAAGATG GCTATGGTGA ATGGAAGAAC
TATGAGCTGG GTAAATGGGA ACTCTATAAT ATCAAAACTG ACAGAAGTGA ATTACACGAT
GTATCCGCCC AACATCCTGA TATGGTAAAG GAAATGAGTG CGATCTGGGA TGATTATGCC
AAACGCGCTA AAGTATTACC AGCTCCATGG ACGCCATTGC AGGGAACACC TGAAAGCGGA
CAGTAA
 
Protein sequence
MKNFLSLIVC LSVWFQVSAQ QLAKRPNIIV ILSDDMGYSD LGCYGSEIQT PNLDKLYKNG 
LRFTQFYNTA RCCPSRASLL TGLYPHQAGM GWMRDKDAGL EGYQGGLSKH AVTMAEVVKP
AGYTTLMVGK WHVSQGIRQD GPLANWPLQR GFDRFYGTIQ GAGSYYDPAT LCRGNQLITP
ATDTAYQPAN YFYTDAVTHE AIRFISESDP AKPFFMYLAY TAAHWPLQAK PADIKKYKGM
YDKGWEAIRK QRFEKMKKLG LLPANAELSP SDAPDWKDEK NKAVQAARME VYAAMIDEMD
QGIGEIMAQL KKQGLDENTV VMFMQDNGGC AEEIGTKGKT GPIAANPEKI KPRAKGEVEY
DVIPKMTRDG KLIMKGEGII GGPETTYVSY GKVWANVSNT PFREYKHWVH EGGISTPLII
HYPAGIRQHN SSNFVGHFID IMPTVAELAG ASYPSDWHNN KIIPMEGISL VPLFTGKALD
RGKALCWEHE MNRAVRLGDW KLVAKGELLS EKDGYGEWKN YELGKWELYN IKTDRSELHD
VSAQHPDMVK EMSAIWDDYA KRAKVLPAPW TPLQGTPESG Q