Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Coch_0295 |
Symbol | |
ID | 8366701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Capnocytophaga ochracea DSM 7271 |
Kingdom | Bacteria |
Replicon accession | NC_013162 |
Strand | - |
Start bp | 373934 |
End bp | 376921 |
Gene Length | 2988 bp |
Protein Length | 995 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644982716 |
Product | vault protein inter-alpha-trypsin |
Protein accession | YP_003140419 |
Protein GI | 256819140 |
COG category | [S] Function unknown |
COG ID | [COG4676] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00370516 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATCA AGCATTTTTT ATCAGCGACA GCTTTACTAC TTCCCTTACT GCTAACCGCA CAGAAATCGG TAGTGCCAGA GGTGAAGGTA GTGAATGAGC GCAATGCCAA CCCGATGGTA TTGCAAGACC TTTCAGTAGA TATATTAGTG GTAGGACAAA CGGCTGTTAC TACTATGGAA ATGACCTTCT ACAACCCCAA CACCCGCGTA ATGGAAGGCG AGTTTCAGTT CCCGCTTGCC GACGGACAGC AGGTGTCGCG CTTTGCCTTA GATATCAATG GCAAACTGCG TGAAGGTGTG GTAGTAGACA AAGCCTTAGG GCGCAAAGCC TTTGAGGATA TTGTGCGCAG GGGTGTAGAC CCAGGCTTAT TAGAAAAAAC TGAGGGCAAC AACTTTAAAG CACGTGTATA CCCTATGCCT GCCAAAGGTA CTCGCCGTGT ACTCATCGCC TTTGAACAAG AACTACACGA ACGAGGCGGA CAAGATTACT ACTTCTTGCC TATCACGGCT AACGTAACCC TTAAAAACTT CAAGGTGCGT ACTGAGGTAG TATCACGCTT TGTAAAGGCA GATATTCAGA ACAGTCTGCA ATTAGACTTC AAACAAGCGC GCAATAGCTA TATCAGTGAG GTAAGCAAGC AAAACTTCAC GCTCAACCAA AACATTGCAC TTACTTTCCC GAAAATTGAG AAACCACAAA CCATTAGCGC TACCCAAGGC AGTAAAAGCT ATTTTTATGG CAATATAGCT TTGAGCGATA CCAAAGCTAA AAGTAGCCCC ACTCCTAAGG AAATAGGACT CTTATGGGAC GCATCGCACT CAGCAATCCA ACGCGATAGA GCAAAAGAGT TTGCTTTCTT AGATGCTTAT TTTAAGGAAC TCAAAGACAC TAAGGTAGTG CTTAGTACTT TTAATATTCG TTCTGCTAAA CCGCTTACTT TTGAGGTGAA AAACGGTAAT TGGCAAGCAC TGAAATCGCA TTTAGAAAGT TTGCAATACG ATGGTGCTAC CGATGGTAAT GCTATTGATT TCAACCTCAA AACAGATGAA ATATTACTTT TTAGCGACGG TATTTTCAAC TTTGGCAGTA AAGAATTTTC TGTAAACGAA GTAGTGAAAC AAGCTAAAAC CCCTATAACA GTGGTCAATG CTTCGGCAGT TGCTAATACT CCAAAAATGC AATACCTCGC CAATGCTACG GGTGGTAACT TTATAGATTT GACAACCCTC ACTACTGAGC AAGCAATAAA AGTAGCACGC ACTGTACCTT TCCAACTGCT AGATATAGAA GTGAAAAGCG GTAAAGTAAC CAAAATTTTC CCGCAGAAAG GAGCTACTAT TAGCAAAGGT AATTTCACCC TTGCAGGTGA ATTGCAAAGC GAGGAAGCAA CCTTAGTACT TAGTTTTGGT TACCCTAAAA AGGTAATGGT GCAAAAAGAA GTAAAATTTG TAGCAAACCC CGATGCTTCT GAAAGTGAGT TCAATCTTTT GCGCCGTATA TGGGCAGAGA AACAAATTGC ACAACTACAA CGAGAAGGTG TTGAACAAAA ACAAATAGAT GCGGTAGGGC GTGAGTACGG CATTGTAACC GAAGGTAATT CGCTGATAGT ATTAGAAACC GTAGAAGATT ATGTACGTTA CCGCATTACG CCTCCTACCG AATTACAGCA AGAGTACTCC AAACGCTTAG CAAATGAGCA AAAGCAAAAA GAGGATACTG CTAAGCGTAT TTTGGATAGA GTCGTAGAGC AATCGGAAAA GCAAAGTAAA TGGTGGCATA CCGAATATCC TGTGAAAGGC ACTGAACCTA AGAAAAATGT TAATAACTCT AATGATACTC CTGTAAGAAT AAGAGGGGTG GCTTCTGGGG TAGCACAAGA AGTTAGAAGT GAGGAGGTTG CTGCTATAGA AGCTGACGAA AGTGCAGAAT TAAATGAAGT AGTTGTAGTA GGTTATTCTC CACGAAGAAA AGCAGCTATG ACGGGAGCTA TAAATAGTAG AGTTGCTGAT TCTCCTAGTG GTAACACTAC TGCTAAAAAA GATGTTTTTT TATCACGCAA ACCAGCAAGT CCTGTTCCTG TACCTGCCTC AAAAATAGAG CTCAACGCCT ATAACCCCGA TACCCCTTAT TTAAAGGTAA TGGAGTATAC TGAGGAAGCA AAAGCGGTAG AAACCTATTA CAAACTCAAA AAAGAATACG GTAACACACC TTCCTTTTAT GTAGATGTAG CCGACTATTT CTTTAAGAAA GGTAATCGTG AGCAAGCTAT CTTGGTAGTT TCCAATTTGG CAGAGCTCGG TTTAGACGAC CCTCAGTTGT TGCGTATGTT AGGCTACAAG CTCAGCAACT ACAATGCTAA GAAAGAAGCG GTATGGGTAT TCCGTAAGGT GGTAACTCTG CGTGAAGAAG AACCACAGTC TTTTCGTGAT TTGGGCTTAG CACTTGCCGA CGATGGTGCC TATAACGAAG CCGTGAAAAA CCTTTATAAA GTAGTTACAA GCGAGTGGAG CAGTCGTTTT GGCGATGTAC AAATCGTAAC AATGAACGAT ATCAATAGCC TCGTCGCACG CCACAAGGGT ATAGACGTGA GTTATATCGA CAAGCGTTTG CTCAAAAAAG AACCCGTAGA TGTGCGAGTA GTGCTCAGTT GGGATACAGA TAGCTGCGAT ATGGACTTAT GGGTAACCGA CCCCAAAGAC GAGAAGTGTT ACTACCGAAA CACTCTTACT TACCTTGGTG GTAAAATCAC ACGTGATGTT ACTCAAGGCT ACGGACCTGA AGAATTTATG CTCAAGAAAG CCGAAAAAGG CAAGTACAAA GTGCAAGTGG ATTACTTTGG TACTCGCTCA CAAAAGCAAC TGATGCCTGT GAACTTGCGT ATTACTTTCT ATACTCACTA TGGTACGCCT CAGCAAAAAC AGCAAGAAAC CACTGTACGC CTCAGTAATG CCAAGGAGGT GATTGAAGTA GGAAGTTTTG AATTTTAA
|
Protein sequence | MRIKHFLSAT ALLLPLLLTA QKSVVPEVKV VNERNANPMV LQDLSVDILV VGQTAVTTME MTFYNPNTRV MEGEFQFPLA DGQQVSRFAL DINGKLREGV VVDKALGRKA FEDIVRRGVD PGLLEKTEGN NFKARVYPMP AKGTRRVLIA FEQELHERGG QDYYFLPITA NVTLKNFKVR TEVVSRFVKA DIQNSLQLDF KQARNSYISE VSKQNFTLNQ NIALTFPKIE KPQTISATQG SKSYFYGNIA LSDTKAKSSP TPKEIGLLWD ASHSAIQRDR AKEFAFLDAY FKELKDTKVV LSTFNIRSAK PLTFEVKNGN WQALKSHLES LQYDGATDGN AIDFNLKTDE ILLFSDGIFN FGSKEFSVNE VVKQAKTPIT VVNASAVANT PKMQYLANAT GGNFIDLTTL TTEQAIKVAR TVPFQLLDIE VKSGKVTKIF PQKGATISKG NFTLAGELQS EEATLVLSFG YPKKVMVQKE VKFVANPDAS ESEFNLLRRI WAEKQIAQLQ REGVEQKQID AVGREYGIVT EGNSLIVLET VEDYVRYRIT PPTELQQEYS KRLANEQKQK EDTAKRILDR VVEQSEKQSK WWHTEYPVKG TEPKKNVNNS NDTPVRIRGV ASGVAQEVRS EEVAAIEADE SAELNEVVVV GYSPRRKAAM TGAINSRVAD SPSGNTTAKK DVFLSRKPAS PVPVPASKIE LNAYNPDTPY LKVMEYTEEA KAVETYYKLK KEYGNTPSFY VDVADYFFKK GNREQAILVV SNLAELGLDD PQLLRMLGYK LSNYNAKKEA VWVFRKVVTL REEEPQSFRD LGLALADDGA YNEAVKNLYK VVTSEWSSRF GDVQIVTMND INSLVARHKG IDVSYIDKRL LKKEPVDVRV VLSWDTDSCD MDLWVTDPKD EKCYYRNTLT YLGGKITRDV TQGYGPEEFM LKKAEKGKYK VQVDYFGTRS QKQLMPVNLR ITFYTHYGTP QQKQQETTVR LSNAKEVIEV GSFEF
|
| |