Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Coch_1808 |
Symbol | |
ID | 8368256 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Capnocytophaga ochracea DSM 7271 |
Kingdom | Bacteria |
Replicon accession | NC_013162 |
Strand | - |
Start bp | 2157732 |
End bp | 2159267 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 644984250 |
Product | sulfatase |
Protein accession | YP_003141914 |
Protein GI | 256820635 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT TCGTATTCTT CTTTGGGCTT TTATATGCCA CAACAGTTTT TGCACAAAGC AAACCTAATA TTATACTTAT TGTTTCAGAC GACCATTCGT ATCAAACGAT TGGAGCTTAT AACAATGGGG CTACTAATGC AACTCCTGCA ATTGATAAAC TTGCTAATGA GGGTGTAAGG TTCAACAAAG CTTTTGTAAC CAACTCCATT TGTGGTCCTA GTAGGGCTTG TATTCTCACG GGTAAGTATA GCCATAAGAA TGGCTTTATG GACAATGAGA CATCACACTA TAATTCTTCA CAACAACAGT TTGTGAATCT CTTGCAACAA GGAGGCTACC AAACTGCTTG GGTGGGAAAA TACCACTTAG GTGATGACCC CAAGGGCTTT GACTTCTTTA AAATCTTAGT AGACCAAGGT CGTTATTTTA ACCCCGACTT TATCATTGAA GGTAAAAAAC GTGTAAACGA GCAAGGTTAT GTAAGCAATA TCATTGAAGA TGAAGCCGAA AAATGGCTCG ATCGTCGTGA CCCTAATAAA CCTTTCTGCT TGGTTGTGGG GCATAAAGCG GTGCATCGTA CTTGGATGCC CGATTTACCT GAATTAGGAG CTTATGAACA GGTAAACTTC CCATTGCCCG ATACTTTCTT TGACGATTAT GCAACTCGCA AGCCCGCAAG TTTGCAAGAA ATGTCCATTG GTAAGGATAT GATAATGGGT TACGATTTGA AGATGTTCAA AGATGAAAAA GAAGAAGTGC AAGATGCAAA CTTCTCTCGT ATGACCGCTA TTCAATTAGC ACATTACAAC GATTTTTACA ATCCTATACG TCAACAGTAT TTCAGCGCAA ACCTCAGTGG TAAAGAATTA GCACAATGGA AGTTTCAACG CTATATGCGT GATTATCTAT CTACCGTAAG ATCGATGGAT AAGAACATTC AGCGTATGCT CGATTACTTA GAAAAGCACA AATTGAAAGA CAATACTGTG ATTATATATA TGTCCGACCA AGGTTTTTAT ATGGGCGAGC ACGGTTGGTT TGACAAACGT TGGATGTACG AAGAATCTTT CCGTACCCCA ATGATTGTCT CTTACCCTAA GCTCTTTCCT AAAGGCACAA CCAACGATGA TTTTGTGTTG AACATCGACT TGGCGCCTAC TTTCTTAGAA TTAGCCGGCT TGCCTATCCC TGCTGACATA CAAGGCAAAT CGTTCTTACC TTTGTTTGCT AAAAAAGCCA AACCTATTCG CAATCAAATA TTCTATCACT ACTACGAAAA CGGAGAACAC GCAGTATCAC CTCATTTTGG TGTTCGCACT GATCGTTATA AACTCATTCG TTACTACAAA CGCGTGAATA CTTGGGAGCT ATTTGACCTC AAAACCGATC CTAAAGAACT CAGCAACGTC TATGGCAAAA AAGAATATCT GAAAATTACT AAACAGTTAG AAGCTTTACT TTTAAAAGAA ATTCGAGACA AGCAAGACGA TTTGGCTGAA AAGGTATTTT TCAATAAAGA GTTTCCTGTG AAATGA
|
Protein sequence | MKKFVFFFGL LYATTVFAQS KPNIILIVSD DHSYQTIGAY NNGATNATPA IDKLANEGVR FNKAFVTNSI CGPSRACILT GKYSHKNGFM DNETSHYNSS QQQFVNLLQQ GGYQTAWVGK YHLGDDPKGF DFFKILVDQG RYFNPDFIIE GKKRVNEQGY VSNIIEDEAE KWLDRRDPNK PFCLVVGHKA VHRTWMPDLP ELGAYEQVNF PLPDTFFDDY ATRKPASLQE MSIGKDMIMG YDLKMFKDEK EEVQDANFSR MTAIQLAHYN DFYNPIRQQY FSANLSGKEL AQWKFQRYMR DYLSTVRSMD KNIQRMLDYL EKHKLKDNTV IIYMSDQGFY MGEHGWFDKR WMYEESFRTP MIVSYPKLFP KGTTNDDFVL NIDLAPTFLE LAGLPIPADI QGKSFLPLFA KKAKPIRNQI FYHYYENGEH AVSPHFGVRT DRYKLIRYYK RVNTWELFDL KTDPKELSNV YGKKEYLKIT KQLEALLLKE IRDKQDDLAE KVFFNKEFPV K
|
| |