Gene Cpin_5313 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_5313 
Symbol 
ID8361490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp6783198 
End bp6785036 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content49% 
IMG OID644967461 
Productsulfatase 
Protein accessionYP_003124945 
Protein GI256424292 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000397774 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.199785 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTCT TCACTTTAAG CACAGTGATA CTTGTAAGCG GGCTAGCGCT GAAGGCCCAG 
CAACCTTATC AGGGTACGGT GGGCCGTACC CTGGCGGACT CAAAAGAATG GTGGCCGGCG
CCGGTAAAGC CAGCTGCAGG GTCACCCAAT GTGATCTGGA TACTCCTGGA TGATGTAGGT
TTCGGAGCCA CCAGTACATT TGGCGGTGTG ATCAGTACAC CGACATTTGA TAGTCTGGCG
AATAACGGCC TGCGTTATAC CAATTTCCAT ACTGCCGCGA TCTGCGCCCC TACCAGGGCA
GCACTGATGA CCGGCAGGAA CCATCATGCG GTGCATATGG GCGGGTTTGC ACATTACTTC
TCTTCAGCAG GTTTCCCTGG TTATGACGGA CGTATTCCTT CCAGTTCAGG GACTATCGCA
GAAATCCTGA AAGAGAGCGG TTACAACACC TTTGCGGTGG GTAAATATGG TCTGACGCCT
GACGAGGATA CCTCTGATGC AGGTCCTTTC GACCGCTGGC CTTCAGGTAA GGGATTCGAA
CACTTCTTTG GTTTCCTGGG ATCGGAGACC GACCAATATA AACCGGCCCT GGTGGAAGAC
AACGTGAATA TCAAACCAGA TGGCAGACAC CTGAGTGAGC AGATCACGGA TAAAGCCATC
AGTTATATCG CCCGCCAGAA GAAAGCGGCG CCTGACAAGC CCTTCTTCCT GTATTATGCA
CCCGGCGCTA CACATGCGCC GCACCAGGTG ACGCCTGAAT GGAGCGATCG TTATAAAGGA
AAATTTGATG GTGGATGGGA TGTGTTCAGG GAACAGGTCT TTGCCAATCA GAAGAAGCAG
GGTATCATTC CTGCGAATGC AAAACTGCCG GAACGAAATG AAGATATACA GGCATGGAAT
ACCCTCAAGC CGGATCAGCA GAAACTGTAT GCCCGCTTTA TGGAGATATA TGCCGGTTAT
CTGACTTATA CCGACCATGA AGTAAGTCGT GTAATTAACT ACCTGCGTTC TATCAATCAG
CTGGACAACA CCCTGATCTT TGTGGTGCTG GGCGATAATG GTGGCAGCAA GGAAGGTACC
CAGGAAGGAA CGGTTTCAAA GGCTTATACG CCGAGAAGAA ACAAAGGACT GACCCGCGAT
TCGGCCAGGT TGTTCAATGA GGCACATATC AGCGAGATTG GTACACCGGC TTCTGACGCC
AACTATCCTT TAGGATGGGC ACAGGCTTCC AATACGCCGT TTAAATACTG GAAACAGGAT
GCCAATTCTG AAGGAGGAAC CCGTAATCCG CTGATCGTCT TTTATCCTAA GGGAATCAAG
GAAAAAGGCA TCCGTACACA ATATGGCTAT GTGTCAGACC TGCTGCCGAC GACACTGGAA
TTCCTGAAAA TACCATTCCC GCAGGAAATC AAAGGAGTCA AACAGGATAG TCTCCATGGT
ACTTCGCTGG TATATTCTTT TGAAAATGCC AGTGCACCTT CCCGTCATAC CGAACAGTAC
TACTACATAT TTGGCTCCCG CGCGATTTAT AAAGACGGAT GGAAGGCAGG CGCTGCACAT
CATCCTGATA TGGTGGAACT GAATGATTAT TCCGGTAGTG CCAAGCAGGT ACCTGCAAAA
AACTTTGACA AGGATGTATG GGAGCTGTAT AATCTGAATG AAGACTTTAA TGAGCGCGTG
AACCTGGCGG AGAAGTATCC GGAGAAACTG GCAGAATTGA AACAATTGTT TGAAGCCAAT
GCGAAAAAGT ATAATATCTA TCCGTTTATC GATTGGGAGG ATGTATTCCG CGCAAGGCTA
ATTAATTCAA AAAAGGCATT CAGTGCCGCT GCAAAATGA
 
Protein sequence
MKFFTLSTVI LVSGLALKAQ QPYQGTVGRT LADSKEWWPA PVKPAAGSPN VIWILLDDVG 
FGATSTFGGV ISTPTFDSLA NNGLRYTNFH TAAICAPTRA ALMTGRNHHA VHMGGFAHYF
SSAGFPGYDG RIPSSSGTIA EILKESGYNT FAVGKYGLTP DEDTSDAGPF DRWPSGKGFE
HFFGFLGSET DQYKPALVED NVNIKPDGRH LSEQITDKAI SYIARQKKAA PDKPFFLYYA
PGATHAPHQV TPEWSDRYKG KFDGGWDVFR EQVFANQKKQ GIIPANAKLP ERNEDIQAWN
TLKPDQQKLY ARFMEIYAGY LTYTDHEVSR VINYLRSINQ LDNTLIFVVL GDNGGSKEGT
QEGTVSKAYT PRRNKGLTRD SARLFNEAHI SEIGTPASDA NYPLGWAQAS NTPFKYWKQD
ANSEGGTRNP LIVFYPKGIK EKGIRTQYGY VSDLLPTTLE FLKIPFPQEI KGVKQDSLHG
TSLVYSFENA SAPSRHTEQY YYIFGSRAIY KDGWKAGAAH HPDMVELNDY SGSAKQVPAK
NFDKDVWELY NLNEDFNERV NLAEKYPEKL AELKQLFEAN AKKYNIYPFI DWEDVFRARL
INSKKAFSAA AK