Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Coch_1807 |
Symbol | |
ID | 8368255 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Capnocytophaga ochracea DSM 7271 |
Kingdom | Bacteria |
Replicon accession | NC_013162 |
Strand | - |
Start bp | 2154577 |
End bp | 2157684 |
Gene Length | 3108 bp |
Protein Length | 1035 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 644984249 |
Product | Beta-galactosidase |
Protein accession | YP_003141913 |
Protein GI | 256820634 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.489965 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAACT ACCTATACAT TATTATATCA TTTTTTCTAA CTATAGTCTC TTATGCCCAA CACTTGGATT CCATTTTTGA AAATCCAGCA CTGCAAGAAA TCAATCGAAT GTCAATGCGT GCCTCCTACT TTCCGTTTGA GAATATAGCT AAAGCAAAAA ATGGTATGAT AGAGCAATCA GCACGTTTCC TAAATCTCAA TGGCTTGTGG TCATTCCTTT GGAAAGAAGA CTACCGTCAG TTGCCAAAAG ACTTCTACAA AACCAATTTT AACGAAAGTC AGTGGAAGAA AATTCCTGTA CCTTCTAATT GGGAAGTACA GGGATATGGC ATTCCAATTT ACGTGAACGC TTCTTACGAA TTCAATCAAA AGAACCCTAC TCCTCCCGAT ATTCCTGATA GTCTCCAACA AAATGCTGGT CTGTATCGCA AAACTTTTGA TTTGCCCACT TCTTGGCAAG GAGAAAAGGT ATATCTACAT TTAGGAGCAG TGAAATCGGC ATTTAAACTG TATATAAACG GTAAATTTGT GGGTATGGGT AAAGACAGCA AGTTGGCTTC TGAATTTGAC ATTACACCTT ATATCACTAA GGGCAAAAAC CTTATTGCAA TGGAAGTGCG TCGATGGACG GACGCAAGCT ATCTCGAATG CCAAGATATG TGGCGCTTTT CTGGGATTTC TCGTGATTGC TACCTATATA TGCGACCTAA AGTGCATTTA TACGACCTTA GTATCAGCGC TGGTTTAGAT AAAAACTACA CCAATGGCAA ACTCACCACT TCGGTAGAAG TATGGAACGA AACTCCAAGC GATGTCTCCA AATACCAAGT GGAAGTAAGT CTGTTCGACA AAGAACAACT GCTCTACCAA GAGCAAAAAG CAACTATCGG ACTCAAAAAA GCCTTTGGTA AAACCGAATT GCAATTCGAA GCACAATTGC CACAAGTGAG AGCGTGGAGT GCTGAAACGC CTTATCTGTA CCGTTTACAA ATGGCGCTGT ACGATGCTGA GGGAAAGGTG AAAGAAGTAG TAAGTCGTCC CATAGGCTTT AGAACGATTG AGATAGAAGG GGCTAACATT TTGGTAAACG GCAAACGCAT ACTCTTCAAA GGGGTGAACC GTCACGAAAC CGACCCGCAT ACGGGACAAG TAGTGAGTCA AGAACAGATG GAGAACGATG TAAAGCAGAT GAAAGCCCTC AATTTTAATG CGGTACGTAC CTCTCATTAC CCTAACGACC CTTATTTCTA TGACCTTTGT GATAAATATG GGCTTTATGT AATGGACGAG GCTAATATAG AAAGCCACGG TATGCATTAT GAAATGGATA AGACTATCGG TAATGACCCC GTGTGGGAAT ATGCTCATTT GCTACGTATG GAACGTATGG TAAAGCGCGA TAAAAACCAC CCATCAGTAT TGTTTTGGAG TATGGGCAAC GAGTCAGGTA ACGGTTGGAA CTTCTACAAA GGTTATCAAC ACATAAAAGG CTTAGACTCC TCACGCCCTA TTCACTATGA ATTAGCTCAC TACGATTGGA ATACTGATAT TGAGTCGCGT ATGTACCGTC GTATTCCTTT CCTTATCGAC TATGCGCTCA GCAATCATAC TAAACCCTTC TTGCAATGTG AATACGCTCA CGCAATGGGT AACAGTGTAG GTAACTTTCA AGAATATTGG GACGTGTATG AACATTATCC TAAACTACAA GGTGGTTTTA TTTGGGATTT CATCGATCAA GGGCTTTATA AAACCTTATC TAATGGCAAG AAAATAGTAA CCTACGGGGG TGATTATGGT GATAAAAATA CTCCCAGCGA TAATAATTTT CTTATCAATG GAGTAATAGC TTCCGACAGA AGTTGGCATC CTCACGCCTA TGAAGTACGC AAAGTGCAAC AAGAAATAGG CTTTCAATAT CAAAATAACC AACTGATATT GCGCAATAAA CATTTCTTTA AAGATTTATT AAACTACGAA ATATATTGGC AACTGCTCAA AGAAGGTGTT CCTGTTCAAA GTGGTAATAT TACCAACTTA ATAGTATTAC CACAAAGCGA GGCTACTTTT TTACTACCTC CTCTTAAAAC AGACGATAAA GCAGAATATA TCTTGCAATG TACCGCCCGT CTCAAACAAG ACGAAGGCCT TTTAAAGAAA GGTACAGAAC TCGCTTTTGC CGAGTTTCCT CTCACCTCCT ACTCTCCTCA AAAAGCTATT GCTGATACAA CACCTTTACA GGTAGAAGAA ACAGCCAGTC ATATTCTATT GTATAATAAG CACTATACAG CCAAAATAGA TAAACAAACA GGCAAATGGG TATCGTTTCA AGTGAAGAAT GAAGAACTCT TTGCTCCCGA AGGCTTAGAG GTGAATTTAT GGCGAGCTGG GACTGATAAT GATTTTGGGG CAGGTTTACC TAAAAAACTA CAACAGCTAC AAGAGGCAGA TAAAAAAGCT GATAGTGTCC GTATATCGGT AGAAAAACTC AACTCCGGTC AAGTGAAGAT AACCCTACGC AAGCGATTGG TAGAAGGTAC AATTAATTAT ACTCAAGAGT TGCTTTTTGA TGGCAAGCCT TCCGTAACGG TGAGCAATCA CTTCAAACCG CTAAAGAACG ACAAAACGCT TACTTTTAAG ATAGGCAATC ATCTTACACT ACTGCCTTTT CAGCGCATTC AGTGGTATGG TAGAGGTCCT TGGGAAAGTT ATTGGGATAG GAAAACATCG GCTATGGTAG GCTTGTACGA AGGTGCTATT GTTAGCCAGT ATTACCCTTA TGTGCGTCCG CAAGAGAATG GTAATAAAAC TGATGTACGT TGGGCAAAGC TCAGTAAGAA AAAAGGAGTA AACATTGCCA TTTATAGTAC AGGTTCTTTA CTAAATATAA ATGCCCTGCC TTATAGTCCT GCACAGTTAT TTCCTGGTAT AGAAAAAGGA CAAACACACG CAGGTGAACT CACCCCTGAT AAGTATACTC ACTTAGACAT CGATTTGCAA CAATTAGGTT TGGGAGGTGA TAACAGCTGG GGCAACTTAC CTATGGAACA ATACTTACTG TATCTGTATC AGCCTTACAG CTATAGTTAC AGAATAGAAG CTTTTTAA
|
Protein sequence | MKNYLYIIIS FFLTIVSYAQ HLDSIFENPA LQEINRMSMR ASYFPFENIA KAKNGMIEQS ARFLNLNGLW SFLWKEDYRQ LPKDFYKTNF NESQWKKIPV PSNWEVQGYG IPIYVNASYE FNQKNPTPPD IPDSLQQNAG LYRKTFDLPT SWQGEKVYLH LGAVKSAFKL YINGKFVGMG KDSKLASEFD ITPYITKGKN LIAMEVRRWT DASYLECQDM WRFSGISRDC YLYMRPKVHL YDLSISAGLD KNYTNGKLTT SVEVWNETPS DVSKYQVEVS LFDKEQLLYQ EQKATIGLKK AFGKTELQFE AQLPQVRAWS AETPYLYRLQ MALYDAEGKV KEVVSRPIGF RTIEIEGANI LVNGKRILFK GVNRHETDPH TGQVVSQEQM ENDVKQMKAL NFNAVRTSHY PNDPYFYDLC DKYGLYVMDE ANIESHGMHY EMDKTIGNDP VWEYAHLLRM ERMVKRDKNH PSVLFWSMGN ESGNGWNFYK GYQHIKGLDS SRPIHYELAH YDWNTDIESR MYRRIPFLID YALSNHTKPF LQCEYAHAMG NSVGNFQEYW DVYEHYPKLQ GGFIWDFIDQ GLYKTLSNGK KIVTYGGDYG DKNTPSDNNF LINGVIASDR SWHPHAYEVR KVQQEIGFQY QNNQLILRNK HFFKDLLNYE IYWQLLKEGV PVQSGNITNL IVLPQSEATF LLPPLKTDDK AEYILQCTAR LKQDEGLLKK GTELAFAEFP LTSYSPQKAI ADTTPLQVEE TASHILLYNK HYTAKIDKQT GKWVSFQVKN EELFAPEGLE VNLWRAGTDN DFGAGLPKKL QQLQEADKKA DSVRISVEKL NSGQVKITLR KRLVEGTINY TQELLFDGKP SVTVSNHFKP LKNDKTLTFK IGNHLTLLPF QRIQWYGRGP WESYWDRKTS AMVGLYEGAI VSQYYPYVRP QENGNKTDVR WAKLSKKKGV NIAIYSTGSL LNINALPYSP AQLFPGIEKG QTHAGELTPD KYTHLDIDLQ QLGLGGDNSW GNLPMEQYLL YLYQPYSYSY RIEAF
|
| |