Gene Coch_0440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCoch_0440 
Symbol 
ID8366846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCapnocytophaga ochracea DSM 7271 
KingdomBacteria 
Replicon accessionNC_013162 
Strand
Start bp556286 
End bp557803 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content47% 
IMG OID644982861 
Productsulfatase 
Protein accessionYP_003140564 
Protein GI256819285 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0138449 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT ACTTATTAGG GCTCTTATGC CCTTTATGTC TCAACGCACA AAACAAAACT 
ACTCAAAAAC CTAACGTCAT CATCATTGTA GCCGATGATT TAGGCTATGG CGACCTAAGT
TGCTATGGCG AAAAGACAAT TCATACCCCA CAGGTAGCCT CTCTCGCCAA ACAAGGAATA
GTCTTTACGA ACGTACATTC CACCGCTGCC ACTTGCACAC CTTCGCGCTA TTCACTCTTC
ACGGGCTTGT ACAATTGGCG ACGCAATGAC ACTGGTATTG CTCCTGGTGA TGCTGCTATG
GTCATTCGTC CTGAGCAAAC CACCATTGCC GATGTGTTCA AATCAGCAGG CTATACTACT
GGCGCTATTG GCAAATGGCA CTTAGGCTTA GGAGGCGAAC GCGGTAAACA GGATTGGAAC
GGCTTTATCA CACCTGGACC TTCCGATATA GGCTTCGAAT ATTCCTGCAT AATGGCTGCC
ACAGCCGACC GTGTACCTTG CGTGTGGATA GAAAACCAAC GCGTAGCCAA TTACGACCCC
TCAGCACCCA TCGAAGTAAG TTATAGCCTG CCTTTTAAAG GTGAACCTAC GGGTAAGAAC
AACCCCGAAC TGCTCACTAA ACTCAAACCT TCCTTACACC ACGGTCACGA CCAAACCATA
GTGAACGGCA TTTCGCGTAT CGGCTATATG AAAGGGGGTG GCAAAGCACT CTGGACGGAC
GAACATCTCG CCGATACCAT TGTAGCCAAA ACGGTAAAAT ACATCGAAAA TCACAAATCA
CAACCTTTCT TTCTCTATGT GGGTACTAAT GATATACACG TACCTCGCTA CCCCAATCCA
CGTTTTGTAG GCAAAAGTGG TATGGGTTAC CGCGGTGATG CAATTCTCCA ATTCGACTGG
ACTGTGGGTG AAATCGTAAA GGCCCTCAAA GCTAATAAGC TATACGATAA TACGCTTATC
ATTATCACCA GCGACAACGG TCCTGTGATA GACGATGGTT ATCAAGATCA AGCCGAAGAG
CTCTTAGGCA GACATCGTCC TTGGGGAGCT TTTCACAACC ACGGAGGCAA GTACAGCAAT
TACGAAGCAG GTACGCGTGT ACCCTTTATC GTACGCTATC CTAAGATTGT GAAAAAAGGT
ACTTCCGACG CTTTGCTTTC ACATATCGAC CTCTTTGCCT CTTTGAGTAA ATTTATTGGT
GCTGAGATAC CCGCAGGAGT AGCCACCGAT AGCGAAGATT ACCTCAAAGC CTTCTTAGGC
AAGGACAAGA AAGGTCGCCC TTATGTAATA GCCTCTGGGG GAGCGCTCTC CATCACCGAT
GGTCGTTGGA AATATGTAGT CCCCAGCGAT AACCCCTCTT ATCAACCCCT CACTCGCACA
CATCTGGGCA ACTATCCTGA GCCCCGATTG TACGACCTTA AAGAAGACAT AATGGAGCTC
TACAATGTAG CAAAAGACCA TCCCGAAGAG CTCGTCAGAC TCAAAACAAT GCTCGACAAT
ATTAAAAACA GAAAATAG
 
Protein sequence
MKKYLLGLLC PLCLNAQNKT TQKPNVIIIV ADDLGYGDLS CYGEKTIHTP QVASLAKQGI 
VFTNVHSTAA TCTPSRYSLF TGLYNWRRND TGIAPGDAAM VIRPEQTTIA DVFKSAGYTT
GAIGKWHLGL GGERGKQDWN GFITPGPSDI GFEYSCIMAA TADRVPCVWI ENQRVANYDP
SAPIEVSYSL PFKGEPTGKN NPELLTKLKP SLHHGHDQTI VNGISRIGYM KGGGKALWTD
EHLADTIVAK TVKYIENHKS QPFFLYVGTN DIHVPRYPNP RFVGKSGMGY RGDAILQFDW
TVGEIVKALK ANKLYDNTLI IITSDNGPVI DDGYQDQAEE LLGRHRPWGA FHNHGGKYSN
YEAGTRVPFI VRYPKIVKKG TSDALLSHID LFASLSKFIG AEIPAGVATD SEDYLKAFLG
KDKKGRPYVI ASGGALSITD GRWKYVVPSD NPSYQPLTRT HLGNYPEPRL YDLKEDIMEL
YNVAKDHPEE LVRLKTMLDN IKNRK