Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_1217 |
Symbol | |
ID | 5709760 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | - |
Start bp | 1283665 |
End bp | 1286565 |
Gene Length | 2901 bp |
Protein Length | 966 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641275721 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_001541034 |
Protein GI | 159041782 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000342609 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.287325 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTAAGT CAAGTGACTT CATACCCAGT GGTGTTAAAA GAAGCGATAT TTACCATGGT GGGTGGATTG ATCATGATAA GGATGGTTTA ATGGCCCCCT TCGAGGACCC CAGTAGGCCT ATTGATGAGA GGGTTGAGGA TCTCTTGAGG AGGATGAGTC TTGAGGAGAA GGTGGCTCAA TTAAGGTCGG ATCTAACTGA TAGGTTGGAT GTAGGTAACT TATCAGTGGT CCTTAGGGGT ACTGAACCGA CTGAGGGTGC TGTTAAGGCT AATGATATTC AGAGGAGGTT CCTTGAGGAC ACTAGGCTTG GTGTACCGGC TATTATTCAC GATGAGTGCC TCCACGGATG CATGGCTAAG CACTCAACAG TCTTCCCCCA AGCCATAGCG CTGGCTGCGG CCTGGGACGT GGACTTAATG TATAGGGTTG CTAAGGCTAT TGCAAGGGAG ACTAGGGCCA GGGGTATTAG GCAATGCCTA TCACCCGTGG TTAACCTAAC CTTCGATGCT AGGGCTGGTA GGACTGAGGA GACTTATGGT GAAGACCCCT ACTTAGCCTC ACAGTTAGCC TACGCCTACG TTAAGGCGCT TAGGGAGGAG GGTATTGTGG CTACCCCTAA GCACTACATA ATGAACTTCG TTGGTGATGG TGGTAGGGAC AGCGCTGAAA TACACATGAG TGAGAGGTTC ATTAGGGAGA CTGAGTTACC GGTTTTCAGA GCGGCCATTA AGGCTGGGGC ATTATCATTA ATGGCTGCCT ACAACTCCAT AGACGGGGTA CCATGCTCAA TGAACAAGTA CTGGCTAACG GAGGTACTTA GATGGGAATT GGGGTTCGAG GGCTTCGTGG TGTCTGACTA CGGTTCAGTC ACCGGTATAG TTAATAGGCA CTACATAACC GATAACCCGG AGGAGGTTGC TAAACTAGCC CTTGAGGCTG GCCTAGATGT TGAATTCCCA GGATTCTCAA TATACGGTGA ACCACTGGTT AGGGCTATTA GGAGGGGGTT GATTAGCGAG GAGGCGCTTA ATGAGGCTGT TAGGAGGGTT TTGAGGGCTA AATTCCTAAT AGGCCTATTC GACTCACCTT ACGTTGACCC CGAGGAAGCT AAGGTTATTG GCTCAGAGGA GCATAGGCGG TTAGCCCTTG AGGCCGCTGA GAAGGCTATT GTACTGCTTA AGAATGATGG TGTACTGCCT ATTGATAAGT CTAGGGTTAA GGCAATAGCC CTAATAGGCC CCTTCGCGGA TGAGGTTAAG TTAGGTGGCT ATAGTGCAAT ACCTAAGAGT GTAATAACAC CCCTTGAAGC CTTTAAGGCC AGGGGCATTA ATGTGATTCA CGCTAAGGGA TGCATAGGGG ATATGGATGC TGACCACCCA ATACCAACAA GGTACCTTAC ACCAATGGGT GAGCCTAATA GACACGGCTT GAGGGGTGAG TACTTCAATA ACCCTAACCT TGAGGGTGAG CCTATTGGCG TTAGGATTGA TGCACCGTGG GAGGGATTCT TCAGACTTGA CATAGGCTAC GACCCACCAT ACCAGGGCCT TGACCCAGGA AGATACTCCA TTAGGTGGAT TGGTTACATT ACCCCACCCG TATCAGGCAC GTATGAGTTT AAGGTTTACG CCGCTGGCGG TGGATTTAGG CTTACTGTTG ATGGTAAGAC AATTGTGGAT TCATGGGGTG TGGCCAGTAA TTCACCTAAG AGTGGTTCAA TTAGGCTTGA GGGTGGGAGG CAGTATGAGA TTAGGCTTGA GTACGGTAGG TTGAACTACG GTTACGCCTA CATTAAGCTT GGCTGGGATT TAATCGAGGA CTCAATGATT AAGGAGGCTG TTGACGCCGC TTCGAAGGCT GATGCAGTGG TTGTGTTTGC AGGCATTATT GAGGGTGAAC AGAGGGATAG GGCATCATTA AGGTTACCCA AGTGCCAGGA GAGGCTTATT GAGGAGGTGC TTAAGGTTAA TAAGAACGTT GCAGTGGTTT TAACCACTGG AAGCCCAGTC GTCGGTGAAT GGATTAATAA TGTCCCAGCC CTGGTTGAGG CCTGGTACCC TGGTGAAATG GGCGGGGAGG CTATTGCCCA AGTGCTTTTA GGTGAATATA ACCCGGGTGG TAAATTACCT TTAACCTGGC CTATTCACGA GGGCCAGGAA CCATTATACT ACTTCACTAA GCCCAGTGGT AGGGTTTATG ATTACGTTAA CATGCCTCCA ACACCACTAT TCCCGTTCGG ACACGGGCTA AGCTACACTC AATTCAAGTA CAGTGACCTC AAGGTGAGCG TTAATGAGGA TGATGGAGTG GTTTCAGTAT CACTTAACGT TGAGAATATA GGTAAGTATG AGGGTGATGA AGTTGTTCAA CTATACGTTA GGGATAGGTA CTCAAGTATA GCTAGACCAT TAATGATGCT TAAGGGCTTT AGGAGAATAA CGCTTAAGCC CGGTGAGAAG ACTGTAGTTG AATTCAAGTT AACCTTAGAT GACTTAGCAA TGTATGATGC AGGGTTCAGG CGTATTGTGG AGCCTGGAGC ATACCAGGTC CTTGTTGGTT CATCATCAAT GGACATTAGG CTAATGGGTG AATTCAAGTT AACTCAACTG GTTAAGGGTA TTGTGAGCGT GACGAGTGTT AATGCTGATA AGGTCAATGT TAAGGCTGGT GAATCAATTA GGGTTAAGGC CACGTTAAGA AATGAAGGTA AAGTAGGTGA CCTAGTACCC ATTACGCTTA AGGTTAATGG TAGGGTGATT GAGGAGCATA GAGTGTACTT GGATCCAGGT GAGGAGAGGA TAGTGAACTT CACGGTGAAG CTACATGAGG CAGGGAAGCA GGTGGTTTCA GTGGCAGTGC CCGAGGGGGA GAAGTCAGTA ACCATAGATG TAACCCAATA G
|
Protein sequence | MSKSSDFIPS GVKRSDIYHG GWIDHDKDGL MAPFEDPSRP IDERVEDLLR RMSLEEKVAQ LRSDLTDRLD VGNLSVVLRG TEPTEGAVKA NDIQRRFLED TRLGVPAIIH DECLHGCMAK HSTVFPQAIA LAAAWDVDLM YRVAKAIARE TRARGIRQCL SPVVNLTFDA RAGRTEETYG EDPYLASQLA YAYVKALREE GIVATPKHYI MNFVGDGGRD SAEIHMSERF IRETELPVFR AAIKAGALSL MAAYNSIDGV PCSMNKYWLT EVLRWELGFE GFVVSDYGSV TGIVNRHYIT DNPEEVAKLA LEAGLDVEFP GFSIYGEPLV RAIRRGLISE EALNEAVRRV LRAKFLIGLF DSPYVDPEEA KVIGSEEHRR LALEAAEKAI VLLKNDGVLP IDKSRVKAIA LIGPFADEVK LGGYSAIPKS VITPLEAFKA RGINVIHAKG CIGDMDADHP IPTRYLTPMG EPNRHGLRGE YFNNPNLEGE PIGVRIDAPW EGFFRLDIGY DPPYQGLDPG RYSIRWIGYI TPPVSGTYEF KVYAAGGGFR LTVDGKTIVD SWGVASNSPK SGSIRLEGGR QYEIRLEYGR LNYGYAYIKL GWDLIEDSMI KEAVDAASKA DAVVVFAGII EGEQRDRASL RLPKCQERLI EEVLKVNKNV AVVLTTGSPV VGEWINNVPA LVEAWYPGEM GGEAIAQVLL GEYNPGGKLP LTWPIHEGQE PLYYFTKPSG RVYDYVNMPP TPLFPFGHGL SYTQFKYSDL KVSVNEDDGV VSVSLNVENI GKYEGDEVVQ LYVRDRYSSI ARPLMMLKGF RRITLKPGEK TVVEFKLTLD DLAMYDAGFR RIVEPGAYQV LVGSSSMDIR LMGEFKLTQL VKGIVSVTSV NADKVNVKAG ESIRVKATLR NEGKVGDLVP ITLKVNGRVI EEHRVYLDPG EERIVNFTVK LHEAGKQVVS VAVPEGEKSV TIDVTQ
|
| |