Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Coch_1483 |
Symbol | |
ID | 8367920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Capnocytophaga ochracea DSM 7271 |
Kingdom | Bacteria |
Replicon accession | NC_013162 |
Strand | + |
Start bp | 1758313 |
End bp | 1761243 |
Gene Length | 2931 bp |
Protein Length | 976 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644983916 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003141590 |
Protein GI | 256820311 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.430226 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAGA CTTATTTACT AATTACTTTT TTAAACTTAT CACTATTTAC TACACTTGCC CAAAATAAAT CTCCTTTAAC AACTACCGAT GCCCAAGCGC AACAGCAGTG GGTGAACGAC ACTTATAACA AAATGAGCCT TGATGAAAAG ATTGGGCAAC TGTTTATGGT ATCGGTATTT TCAAGCCATA TAGGCACAAA AAAAGCCGAA GAGGTAAAAG ACTTCATAAA GAAATACTAC ATTGGAGGGA TTATCTTCTC CAAAGGAGGA CCGAAGCGTC AGGCAAAGCT CACCAATGAA TACCAAGCAC TCTCTAAAAT TCCGCTTTTT ATGGCAATGG ATGCCGAGTG GGGATTGGCG ATGCGCTTGG ACTCTACTTA TGCTTACCCG TGGAATATGA CCCTTGGGGC AATCAAAGAT AACTATCTTG TAGAACGTAC CGGTAGACGT ATAGGGTTAC ACTGCAAGCA ATTGGGTATG CAGTTTAACT TTGCTCCTGA TATTGATATT AACACAAATC CGAATAACCC TATTATAGGA AATCGTTCGT TTGGCGAGGA TAAAGAGAAT GTAACCCAAA AAGGGCTTGC TTTTACTCGC GGAATGCAGT CGGTAGGGGT GCTGGGCAGT GCCAAACACT TTCCTGGTCA CGGTGATACT TCTAAAGATT CTCATAAAAC TTTACCGCTT GTATCTTTCG CTGCTAACCG TATTGACGAG GTAGAACTTT ACCCTTTTAA AGCCCTCATC AGAGAAGGAA TAGCGAGCAT AATGGTAGGG CATCTCAATA TTCCTTCCTT AGAACCTAAA ACAGGGTTAC CTTCCTCGTT ATCAAGTGCT ATTATCACTG ATTTACTCAA AAAGAAATTA GGCTATCAAG GGCTTATCTT TACCGATGCA CTTGGTATGA AAGGGGTATC GGAATACTTG CCCGTAGGCG AAGTGGAAGT AGAAGCCTTC TTGGCGGGTA ACGATATTTT GCTGATGCCA GCTAATGTAG CTAAGGGTTT TGAGGCAATG AAGAAAGCCT ACCAGAACAA ACGCATCAGT GAGGAACGCT TAGCACATTC AGTAAAGAAA ATACTAATGG CTAAATACAA AGTGGGGCTC ACATCTTTTA AACAGCTAGA CGAAAGCACT GTAACCTCTA CCTTGCACAC AATAGAAGAC GATTTGCTCA CTGAGGAACT GTTTGAAAAT GCGATTACGG TAGGGCAAAA CAAGGAAAAT ATACTGCCTA TAAAAGACTT GGACAAGCGC AAAATAGCCT ATGTGAAGTT TGGCAACGAC AGTGGTTGGG CGTTTTATTC CACTTTACGC AAATACGCAG AGACCGTGCT TATTGAGCCT AAGAATGAGG CACAGCTATA TGAAGCAGTT GAACCTTTTG ACACGATTAT CATCGGGTTG CACAAACCAG ATAAGACCCC TTGGGACGCA TACAAGTTTA CAGAGAACGA ACTGAAATGG TTAGAGCATA TTGCTAAGAA AAAAAAGGTA ATCCTCGCAG TGTTTACTCG CCCCTATGCG ATGCTCGACG TGAAAAATAT AGCTCCTATT AAAGCTATTG TATTTGCTTA CCAAAATAAC AAAGTAGCCC AAGAGAAAGC CGCTCAGCTC ATCTTTGGTG CTATTGAGGG TAAAGGAGTT TTGCCTGTAA CAGCACACCC TAACTTACCC GTAGGCACTT CAATTGCTAC TCCCAGAATA GGACGCTTAG CTTACGGCTT ACCTGAAAGC GTGGGGCTCA ATTCTCTTAA ACTAAAAGAG ATAGACAGTA TTGCTCTTGA TGCTGTAGCA CAAAAGATGA CACCCGGGAT GCAAGTATTG GTTGCCAAAA GAGGAAAGGT AGTTTATCGC AAGAATTTTG GTACTTTAGA TTACAATCCA GCCAATAAAG TGACCGACAA CACTATTTAC GACCTCGCTT CGCTTACGAA GATATTGGCT ACTCTTCCCG AATTGATGCG TCTTTATGTG CGTGGAGATT TCAAACCCGT AGATACTTTT GAGGATTTGC TACCCAAGCT CAAACATACA AACAAGGGAG ATTTGGTAAT GAAAGATGTG CTCTCGCACT ATGCTCAGTT CCAGTCGTGG ATTCCTTTTT ACCGAAAAAC CTTAGACATT GATAAAAAGC CTTCGCCTGA GTATTACAGC ACTACCAAAA GTGATTCTTT CCCTACTGAG GTAGCCAAAG ACTTATATCT TAGAGAAGGA TATGCTGACA GTATTTATAA AACAATTGAC GAGAGTGAAC TTATCAAAGA TAAAAAGTAC TTGTATAGCG ACCTCCCTTA TTATTATTTC AAGAAGTACA TAGAAAAAAA AAACAAGAAG CCCCTACAAG AAATCGTTCA GAAGCACTTT TATAGAGGTT TAGGAGCCTA CCAGCTTACT TTTCTTCCTT TGCAACGCTT CTCTCCTGTA AATATTGCAC CTGCCGAAGA TGAAAAGACC TTTCGTTCTC AGGAGCTTCG AGGGTACGTA CACGACCAAG GAGCAGCCCT TTTAGGAGGG GTTGGTGGAC ACGCAGGACT TTTTGGAACT GCCGACGATG TAGCCAAAGT GATGCAAATG TACCTCCAAC AAGGGTATTA CGGTGGTACG TGGTTCTTGC AACCTCAGGC TATAAAAATC TTCAACACTT GTAACTACTG TCCAGAAGGT AACCGTCGAG GTTTAGGATT TGATAAGCCA CAGTTAGGCA AATCAGGTCC TACTTGTGGT TGTGTGCCAA TGGATAGCTT TGGACATACC GGTTTTACGG GTACTTTTGC GTGGGCTGAC CCCACTAATG AAATAGTAAT AGTTATCCTC TCGAACCGCA CTTATCCCTC TTCTGACAAT AAACTTTTGG TAAACCGATT GGTGCGCCAA AAGATACAAG GAGTGGTGTA CCAAGCCCTA TCTGCGAATA AAAACTTTTA A
|
Protein sequence | MRKTYLLITF LNLSLFTTLA QNKSPLTTTD AQAQQQWVND TYNKMSLDEK IGQLFMVSVF SSHIGTKKAE EVKDFIKKYY IGGIIFSKGG PKRQAKLTNE YQALSKIPLF MAMDAEWGLA MRLDSTYAYP WNMTLGAIKD NYLVERTGRR IGLHCKQLGM QFNFAPDIDI NTNPNNPIIG NRSFGEDKEN VTQKGLAFTR GMQSVGVLGS AKHFPGHGDT SKDSHKTLPL VSFAANRIDE VELYPFKALI REGIASIMVG HLNIPSLEPK TGLPSSLSSA IITDLLKKKL GYQGLIFTDA LGMKGVSEYL PVGEVEVEAF LAGNDILLMP ANVAKGFEAM KKAYQNKRIS EERLAHSVKK ILMAKYKVGL TSFKQLDEST VTSTLHTIED DLLTEELFEN AITVGQNKEN ILPIKDLDKR KIAYVKFGND SGWAFYSTLR KYAETVLIEP KNEAQLYEAV EPFDTIIIGL HKPDKTPWDA YKFTENELKW LEHIAKKKKV ILAVFTRPYA MLDVKNIAPI KAIVFAYQNN KVAQEKAAQL IFGAIEGKGV LPVTAHPNLP VGTSIATPRI GRLAYGLPES VGLNSLKLKE IDSIALDAVA QKMTPGMQVL VAKRGKVVYR KNFGTLDYNP ANKVTDNTIY DLASLTKILA TLPELMRLYV RGDFKPVDTF EDLLPKLKHT NKGDLVMKDV LSHYAQFQSW IPFYRKTLDI DKKPSPEYYS TTKSDSFPTE VAKDLYLREG YADSIYKTID ESELIKDKKY LYSDLPYYYF KKYIEKKNKK PLQEIVQKHF YRGLGAYQLT FLPLQRFSPV NIAPAEDEKT FRSQELRGYV HDQGAALLGG VGGHAGLFGT ADDVAKVMQM YLQQGYYGGT WFLQPQAIKI FNTCNYCPEG NRRGLGFDKP QLGKSGPTCG CVPMDSFGHT GFTGTFAWAD PTNEIVIVIL SNRTYPSSDN KLLVNRLVRQ KIQGVVYQAL SANKNF
|
| |