Gene Coch_1808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCoch_1808 
Symbol 
ID8368256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCapnocytophaga ochracea DSM 7271 
KingdomBacteria 
Replicon accessionNC_013162 
Strand
Start bp2157732 
End bp2159267 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content39% 
IMG OID644984250 
Productsulfatase 
Protein accessionYP_003141914 
Protein GI256820635 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT TCGTATTCTT CTTTGGGCTT TTATATGCCA CAACAGTTTT TGCACAAAGC 
AAACCTAATA TTATACTTAT TGTTTCAGAC GACCATTCGT ATCAAACGAT TGGAGCTTAT
AACAATGGGG CTACTAATGC AACTCCTGCA ATTGATAAAC TTGCTAATGA GGGTGTAAGG
TTCAACAAAG CTTTTGTAAC CAACTCCATT TGTGGTCCTA GTAGGGCTTG TATTCTCACG
GGTAAGTATA GCCATAAGAA TGGCTTTATG GACAATGAGA CATCACACTA TAATTCTTCA
CAACAACAGT TTGTGAATCT CTTGCAACAA GGAGGCTACC AAACTGCTTG GGTGGGAAAA
TACCACTTAG GTGATGACCC CAAGGGCTTT GACTTCTTTA AAATCTTAGT AGACCAAGGT
CGTTATTTTA ACCCCGACTT TATCATTGAA GGTAAAAAAC GTGTAAACGA GCAAGGTTAT
GTAAGCAATA TCATTGAAGA TGAAGCCGAA AAATGGCTCG ATCGTCGTGA CCCTAATAAA
CCTTTCTGCT TGGTTGTGGG GCATAAAGCG GTGCATCGTA CTTGGATGCC CGATTTACCT
GAATTAGGAG CTTATGAACA GGTAAACTTC CCATTGCCCG ATACTTTCTT TGACGATTAT
GCAACTCGCA AGCCCGCAAG TTTGCAAGAA ATGTCCATTG GTAAGGATAT GATAATGGGT
TACGATTTGA AGATGTTCAA AGATGAAAAA GAAGAAGTGC AAGATGCAAA CTTCTCTCGT
ATGACCGCTA TTCAATTAGC ACATTACAAC GATTTTTACA ATCCTATACG TCAACAGTAT
TTCAGCGCAA ACCTCAGTGG TAAAGAATTA GCACAATGGA AGTTTCAACG CTATATGCGT
GATTATCTAT CTACCGTAAG ATCGATGGAT AAGAACATTC AGCGTATGCT CGATTACTTA
GAAAAGCACA AATTGAAAGA CAATACTGTG ATTATATATA TGTCCGACCA AGGTTTTTAT
ATGGGCGAGC ACGGTTGGTT TGACAAACGT TGGATGTACG AAGAATCTTT CCGTACCCCA
ATGATTGTCT CTTACCCTAA GCTCTTTCCT AAAGGCACAA CCAACGATGA TTTTGTGTTG
AACATCGACT TGGCGCCTAC TTTCTTAGAA TTAGCCGGCT TGCCTATCCC TGCTGACATA
CAAGGCAAAT CGTTCTTACC TTTGTTTGCT AAAAAAGCCA AACCTATTCG CAATCAAATA
TTCTATCACT ACTACGAAAA CGGAGAACAC GCAGTATCAC CTCATTTTGG TGTTCGCACT
GATCGTTATA AACTCATTCG TTACTACAAA CGCGTGAATA CTTGGGAGCT ATTTGACCTC
AAAACCGATC CTAAAGAACT CAGCAACGTC TATGGCAAAA AAGAATATCT GAAAATTACT
AAACAGTTAG AAGCTTTACT TTTAAAAGAA ATTCGAGACA AGCAAGACGA TTTGGCTGAA
AAGGTATTTT TCAATAAAGA GTTTCCTGTG AAATGA
 
Protein sequence
MKKFVFFFGL LYATTVFAQS KPNIILIVSD DHSYQTIGAY NNGATNATPA IDKLANEGVR 
FNKAFVTNSI CGPSRACILT GKYSHKNGFM DNETSHYNSS QQQFVNLLQQ GGYQTAWVGK
YHLGDDPKGF DFFKILVDQG RYFNPDFIIE GKKRVNEQGY VSNIIEDEAE KWLDRRDPNK
PFCLVVGHKA VHRTWMPDLP ELGAYEQVNF PLPDTFFDDY ATRKPASLQE MSIGKDMIMG
YDLKMFKDEK EEVQDANFSR MTAIQLAHYN DFYNPIRQQY FSANLSGKEL AQWKFQRYMR
DYLSTVRSMD KNIQRMLDYL EKHKLKDNTV IIYMSDQGFY MGEHGWFDKR WMYEESFRTP
MIVSYPKLFP KGTTNDDFVL NIDLAPTFLE LAGLPIPADI QGKSFLPLFA KKAKPIRNQI
FYHYYENGEH AVSPHFGVRT DRYKLIRYYK RVNTWELFDL KTDPKELSNV YGKKEYLKIT
KQLEALLLKE IRDKQDDLAE KVFFNKEFPV K