Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Coch_1011 |
Symbol | |
ID | 8367431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Capnocytophaga ochracea DSM 7271 |
Kingdom | Bacteria |
Replicon accession | NC_013162 |
Strand | - |
Start bp | 1185702 |
End bp | 1188761 |
Gene Length | 3060 bp |
Protein Length | 1019 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644983437 |
Product | Beta-galactosidase |
Protein accession | YP_003141127 |
Protein GI | 256819848 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00111283 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATAC AACATACTCT ATTCTTATTC ATTTTTGCTT TGGCGAGTAG TCTCAGCACA GCGCAAACCC ACGATTGGGA AAACTTAGCG GTAAGCAGTA TCAACACCGA AAAAAGCCAT AGCCACTATG AGCCGGCTGG AAAAATACTC CTCAACGGCA ATTGGCAGTT TGCCTACTTC AAGCACCCTT CACAAGTGCC TGCTGATTTC TTTTTGGGCA AAGGCATTAC CCAATGGGAC GCTATAAAAG TGCCTTCAAA TTGGCAACTG CAAAGCAACC GATATGACCC TCCTGTGTTT ACCAATATCA AATATCCGTT TGAGATGAAC CCTCCTTATA CCCCTAAGGA CTACAACCCT ACGGGAGTGT ATAGAACGCA GTTCACGGTG CCTAGCAAGT GGAAAGGCGA ACAGGTGTTC ATTCACTTTG CGGGAGTGCA ATCGGCGATG GAGTTGTTCA TCAATGGTAA GCAAGTGGGC TATCACGAAG ATGCAATGTT ACCTGCCGAG TTCAACATCA CTCCTTACCT CAAAAAAGGC AAAAACGAAC TATATGTAAA AGTCTTGAAC TGGTCGGACG GCAGTTATAT AGAAGACCAA GATTTTTGGC GACTCAGCGG TATCTATCGC GATGTATACC TCTTTGCTAC CCCCGAGTTG CGTATGCGTG ACTTTTCAGT ATATCCTCAG CTCGATGCGC AATACCGCGA TGCTACCTTG CAGGTGCAAG TAGAGGTACA GAATTTAGGC GAAAAAGTAA GTGATGCTCT TGTGGTACAA ACCTCTCTTA AAGACAGCAA AGGTAATGTG ATAGGCACTG AAAAAGCCTC TATTGCGGAT ATTGCTGCGG GAAAAGAAGC TACCGTTAGT GCCCAAATAG CTGTAAAAAA TCCGTTGAAA TGGACTGCCG AAACCCCTAA CCTTTACAAA GTGGAACTCA GTTTGCTCAC CGCCAAAGGC AAGGTGTTAC AATCGTTTAC TCAAAATGTA GGATTTAGGA AGATTGAACT AAGCAACGGA TTGCTCCTTG TGAACGGCAA GCCTGTGAAG TTTAAAGGAG TGAACCGTCA CGAGTTCGAC CCTTATAACG GTCGCACCAT CACCCGCCAA TCGATGATTG ACGATATTAT CCTGATGAAA ACGCACAACA TCAATGCGGT GCGCACCTCT CACTACCCCA ATCAGCCTGA GTGGTATACC CTCTGCGACG AATATGGATT ATATGTGGTA GACGAAGCTA ATATCGAGAG CCACGGATTG TGGGAGAGTG GCTACTACAT AGGCGAACGC CCTGAATGGC AAAAGGACAT CGTGGAGCGC AATGTGAATA TGGTTGCTCG CGACAAGAAC CACCCTTGTA TCATCTATTG GTCGATGGGG AATGAATCGG GTTGGGGTAA GAACTTTGAT GCAGCTTACG AGGCGATAAA AGCCCTCGAC CCTCAAAAGC GCCCCGTGCA CTACGAGTCT AAAAACCCTG CTTATGCAGG CGTGCTCTCG CATTACGATA TCATCTCTAA TATGTACACC GAGCTCAACC ACCTGAACAA TCTCTTTACC GAAGACCCCA AACGCCCTGT GATTATCTGC GAATACGCCC ATTCTATGGG TAACAGCTTA GGCAACTTCC GCAAGTATTG GGAGCTTTTT GCTACCAATG AGCGCTACCA AGGTGGTTTT ACGTGGGACT GGAAAGATCA AGCGTTGCGT TGCAAAGATA AGAACGGCAA AGAGTATTGG AACATCATCA ATCATATCGA CAAGGCGAAT GTGAACGACG GATTGGTAAA TGCCACAGGC GTTCCTCAAC CCGAAATGCA CGAACTGAAA AAGGTATATC AGTATTTCAA TGTAAAGGAT ATTGATATCA AGACAGGCTT GGTACTCATC AGCAATAGCA ACTACTTTGT AAATAGCGAC GAGGTGTATT TGCAATGGGA ACTTATTGAG AATGGCAAGC CTATCGCCAA TGGGGTAATC AACGACCTGA ACATCGCCCC ACAAAGCCAA AGAGCCCTAC AAATACCTTT CAAAACAAAA TTAGTACAAA ACGGCAAGGA ATACTTTATG AACTTCCATT TTAAGAATAA AAAGGCTACT GCTTGGGCTT CAAAAGATTT TGAAGTAGCC AAAGAACAAC TCGCTTTCCC TAACCGTGTT GAGAGAGAAT TCACCAAGCC CTCCGATAAA AAACTAACAT TTACTGACGA AGCTACAAAC TTCACCGTAA AAGGCGATAA TTTTACAGCC GTATTCAGCA AAAAAACAGG CGGTTTAAGT CAATTTACAC ATAAAGGGAA AAACCTGCTT TCAGAAGCGA TGGTGCCCTC TTTTTGGCGT GTACCTACCG ATAACGATGA AGGTGGTTTT GAACAATCAT ATGCCTCAGC TTGGCGCAAA GCGGGATTAA AAGAAGCTAT GGTAACAGCT ACCGAAATGA AAGCTACGCA AATAGGGGAA ACCCAACTGA AAATAGTAGC ACACAACCGC ATTGAAACCA AAGCGGGCAA TATCAGCCAA CAAGTAACTT ACCTCATCAA TGGAGACGGA CGTATAGATA TCAGCACAAA TGTGGAAGTG CCTGCTTCTG TGCCTGCTTT GGCAAGAGTG GGAATGCTCC TAACACTCGA CAAGAGTTTT AACAAAGTAG AATGGTACGG CAAAGGTCCT TATGAAACTT ATGCCGATAG AAAAGAATCA GCTTTTGTGG GTATTCACAG CGGTGCAGTA AAGGATATGC ACTTTCCTTA TGTGATGCCT TCTGAAAACG GCAACCATAT CGATACCCGT TGGCTCAAAC TCCTTTCGGG TACTACTGAA CTATATATCA GTGCTCCTAA ACTCTTTAAT TTCAACGTGC AAGACTATTC AGACGACGCG CTGAACCAAT CCAAAGAAAC CCAAGAACTG CGCCGTGGAG ACCACACTTA TTTGCACATC GATGAGGCTC AAATGGGTGT AGGAGGAGAC GACAGCTGGT CGCCACGCGT ACATAAAGAG TTTTTGCTCA ACCAACCGTA TTATCATTAC GAATTTAGCA TTCAGGTAGG GGGGAAATGA
|
Protein sequence | MRIQHTLFLF IFALASSLST AQTHDWENLA VSSINTEKSH SHYEPAGKIL LNGNWQFAYF KHPSQVPADF FLGKGITQWD AIKVPSNWQL QSNRYDPPVF TNIKYPFEMN PPYTPKDYNP TGVYRTQFTV PSKWKGEQVF IHFAGVQSAM ELFINGKQVG YHEDAMLPAE FNITPYLKKG KNELYVKVLN WSDGSYIEDQ DFWRLSGIYR DVYLFATPEL RMRDFSVYPQ LDAQYRDATL QVQVEVQNLG EKVSDALVVQ TSLKDSKGNV IGTEKASIAD IAAGKEATVS AQIAVKNPLK WTAETPNLYK VELSLLTAKG KVLQSFTQNV GFRKIELSNG LLLVNGKPVK FKGVNRHEFD PYNGRTITRQ SMIDDIILMK THNINAVRTS HYPNQPEWYT LCDEYGLYVV DEANIESHGL WESGYYIGER PEWQKDIVER NVNMVARDKN HPCIIYWSMG NESGWGKNFD AAYEAIKALD PQKRPVHYES KNPAYAGVLS HYDIISNMYT ELNHLNNLFT EDPKRPVIIC EYAHSMGNSL GNFRKYWELF ATNERYQGGF TWDWKDQALR CKDKNGKEYW NIINHIDKAN VNDGLVNATG VPQPEMHELK KVYQYFNVKD IDIKTGLVLI SNSNYFVNSD EVYLQWELIE NGKPIANGVI NDLNIAPQSQ RALQIPFKTK LVQNGKEYFM NFHFKNKKAT AWASKDFEVA KEQLAFPNRV EREFTKPSDK KLTFTDEATN FTVKGDNFTA VFSKKTGGLS QFTHKGKNLL SEAMVPSFWR VPTDNDEGGF EQSYASAWRK AGLKEAMVTA TEMKATQIGE TQLKIVAHNR IETKAGNISQ QVTYLINGDG RIDISTNVEV PASVPALARV GMLLTLDKSF NKVEWYGKGP YETYADRKES AFVGIHSGAV KDMHFPYVMP SENGNHIDTR WLKLLSGTTE LYISAPKLFN FNVQDYSDDA LNQSKETQEL RRGDHTYLHI DEAQMGVGGD DSWSPRVHKE FLLNQPYYHY EFSIQVGGK
|
| |