Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_0104 |
Symbol | |
ID | 5709274 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | - |
Start bp | 121894 |
End bp | 125040 |
Gene Length | 3147 bp |
Protein Length | 1048 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641274608 |
Product | Alpha-mannosidase |
Protein accession | YP_001539948 |
Protein GI | 159040696 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0383] Alpha-mannosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCGTG AAGTTGATTT AGGGGGTGTT GAGGGCTTAG CCTTCGAGTT AATGGCATCC TCAGTGGGTA GGTATGCTTC AATTAATGAC TGGGAATTAC TGAATAATGG GGTTAACGTT AATTTACCGT ACAGGACCGG TGCAGGGCCT AATGATAACC TCATGTTTAG GACAAGGGTT ACTGTACCTG AAACCAAGCA TAGGTGGTTC ATTAAGATTC TACTCACCGG TAATGCACTG ATTAGAATTA ATAATGAGGC CTGGGGTTAC GATGAGGCCC ACACCTACTT CCCAGTTAAC CCTGGGGTTA ACGTTATTGA GGTTAACGCC ACACCGAGGG CGCTCTTCGG TCAGCATACC TGGGACTTCA GGTTTGATTA CGCTTACTTA ATTGAGGTTA ATTGGAGTAT TATGAGGCTT GGATTAAGGT TACTGGCACT CATAGACTTC ATTGAGCATT TACCTAAGGA TAACCCGCTT AGAGTTGACC TTGAGGAATT ACTAATTAAC GTGACTAAGG GGATTAGGGT TAACCCAACG TTGAGTCAAG TCACCTTAAC CCTAATGATG CTTTACGACT CACCGTACTC TCAATTCTTC AATAGAATGG ACCTCAGGAG ACCTGATGGT AGCTATGTAA TGGGTGTTGG GTTACATGGA CTAGGGGTTG TTAAGGGTTT TCTAAGCGAC ATACCTAAAA CCATAGACCC CGATTCAATA CCAATCCATG AAATTGAGGA TAAGTTAAAT TCAGGGTTAA TTAAGTTGAG TGAAAAGTAC CCTAAGGAGG GTTTACTGGT TGCTGTTGGG CATAGTCACA TTGATGCAGC GTGGCTGTGG CCTAGGGGTG AGACTATTAG GAAGGTTATT AGGACGTTCT CAACAATGGT TAACCTAATT AAGGAGTATG GTATTAGCTT CCTTCAAAGC TCAGCCCAGT ACTATAAGTG GGTTGAGGAT AATGACCCCG GTCTCTTCAA TGAGGTTAAG AAGCTTATTG AATCAGGTAA ATGGATTATT GCGGGTGGAA TGTGGATTGA GAGTGATGCT AATATTATTG ATGGTGAATC CCTAGCTAGG CAATTCCTAT ACGGTCAAAG ATACTTCCTA AGCAGATTCG GGAGAATGGC TAAGGTGGGT TGGTTACCGG ACACCTTCGG CTTCTCGGCG AATCTACCTC AAATAATGAG GAAGAGCGGT ATAGAGGTCT TCTCATCATG GAGAATAATA ACCCACTCCT TAACAGAATT CCCACTCCAC GCCTTCACCT GGATTGGTAT CGATGGTTCA GAAATACCCA CGCAGGTAAT CCTAGTTAAC TACAATAATG CCCACACCCC ACTTAACGCC TACCAGGCAT GGTCAATGTA CAGGGGTAAG GATACACTAC CCCAGTTAAT ATACCCCTAC GGCTACGGTG ACGGTGGCGG TGGACCCACT AGGGAGATGA TTGAGTACAG GGAATTAATC AATGAGCTAC CAAGCGTCCC AAGGGTTATT GAATTCAGGG AGGATGATTA CGTGGCTGCA TTAAGCAGTG TTAAGGATAA GTTACCTAAA TGGAGTGGTG AAATACACGT AGAGAACTTC ATAGGAACAT ACACCACCAA CCTACCGATT AAGGAATTAG TAGCTAAATC AGAGGCACAG CTCACTGATG CTGAGGCATC AGTAACAATG GCGTATGCAG TGGGTGCAGG TGACCATGGT TTAAGTGAAA TCAATGAACT GTGGATGAGA CTACTATTCA ATCAATTCCA CGACATAGTA CCAGGCTCAG CAATCAAGGA GGTTTACGAT GAAGCGTACA GTGAGTTAAG GAACTTATTA CTAAGGAGCA GTGAATTAAT GAGTGAAGCC GTTTCGGCGG TGGGGAGGAA ATTGGGTTTA GGGGATTCAT TAATCGTGTT TAACCCACTC CCATGGGGTA GAAGCGGGGT TATTAAAATA CCGAGGAGCA TTGAAGTCAA CCTGGAGTGC CAGGATAATG GTGAAGACAG GTTCGTGTAC GTGAATACAC CAGCAATGGG GTTTAGTGCA TATGGAGTAA ACGGTAAGTG CATAAACCCG AGTAATGGGG TTTTAGTCAC CAGGACTGGT GACGGCTTCT CCCTTGAGAA TGAATACGTT AAGGTTAATT TAAATAATAA GGGTGATGCG GATTCATTGA TCATTAAGAA GAGTGGCGTC AACGTATTGA AGGAGCCCAT TAAATTAATG GCCCATGTTG AGAGGCCTCT TATTTCAGAT GCATGGCGCT TCTCACTAGA CTCATTGAAC GAGGGCTATG AGTTAAGCAC CGTGTCGGAG CCGTCATTAA CCATCACTGG TCCATTAATA TCATGTGTGA GTAGTATTAA GGAGTTTAAT AAGTCGAGGA TAACCCAGGA GGTGTGTTTA AGGAAGGGTT CACCGGTTGT TGAGGTTAAG TACAGTATTA ATTGGTTGGA TAAGGGCATA CTGGTGAAGA CCTGGATTAA CACAGTAATT AATGCCGAGG AGGCGGTCTT CGACATACCC TTCGGGTCAC TGCGCAGGTC GACTAATCCG CAGGTTCAAT TAAGGGAGGG GAAGATTGAG GTTCCAGCCC TCAGGTGGGT TGATGTTTCA GATAACGTGA AGGGCCTAGC CGTAATAGCA CCATCAAGAC ATGGCTACAG TGTGAGCGGT GGCAGGATTG GGTTAAGTCT ACTAAGGTCA CCAACCTTCC CAAACCCATG GAGTGATGTG GGTGAAATGG AGACAAGCAT ATACCTTTAC CCGCATTTAG GTAATTACCA GGACGCTGAG GTACCTAGGG TAACCTACGA GATTACCCAT GGTTTAAAGT ACGTGATAAT CACTGGTGAG TCACGGAACC AGGGAGTAGG CCAATGGAGC TTCATGCACG TTAACCCACC TGAGGCAATG TTAACTGCGC TTAAGGTTGC TGAGGACTCA ATGAATGAGC TAATACTGAG GCTCTATAAT CCATATGAGA GGCAGGTTAA TGTGAGCATA GGCATTAACG GTGTTAAGGT GAAGGGAGTT ACTGAAACCG ACATTATTGA GAAGAATGAG GTAGGAACAT TAAGTGGACT CAATAACATA TTATTAAAGC CATTTGAAAT TAAAACAATT AGAATTAAAT ATGATGCAGG AAGCTAA
|
Protein sequence | MSREVDLGGV EGLAFELMAS SVGRYASIND WELLNNGVNV NLPYRTGAGP NDNLMFRTRV TVPETKHRWF IKILLTGNAL IRINNEAWGY DEAHTYFPVN PGVNVIEVNA TPRALFGQHT WDFRFDYAYL IEVNWSIMRL GLRLLALIDF IEHLPKDNPL RVDLEELLIN VTKGIRVNPT LSQVTLTLMM LYDSPYSQFF NRMDLRRPDG SYVMGVGLHG LGVVKGFLSD IPKTIDPDSI PIHEIEDKLN SGLIKLSEKY PKEGLLVAVG HSHIDAAWLW PRGETIRKVI RTFSTMVNLI KEYGISFLQS SAQYYKWVED NDPGLFNEVK KLIESGKWII AGGMWIESDA NIIDGESLAR QFLYGQRYFL SRFGRMAKVG WLPDTFGFSA NLPQIMRKSG IEVFSSWRII THSLTEFPLH AFTWIGIDGS EIPTQVILVN YNNAHTPLNA YQAWSMYRGK DTLPQLIYPY GYGDGGGGPT REMIEYRELI NELPSVPRVI EFREDDYVAA LSSVKDKLPK WSGEIHVENF IGTYTTNLPI KELVAKSEAQ LTDAEASVTM AYAVGAGDHG LSEINELWMR LLFNQFHDIV PGSAIKEVYD EAYSELRNLL LRSSELMSEA VSAVGRKLGL GDSLIVFNPL PWGRSGVIKI PRSIEVNLEC QDNGEDRFVY VNTPAMGFSA YGVNGKCINP SNGVLVTRTG DGFSLENEYV KVNLNNKGDA DSLIIKKSGV NVLKEPIKLM AHVERPLISD AWRFSLDSLN EGYELSTVSE PSLTITGPLI SCVSSIKEFN KSRITQEVCL RKGSPVVEVK YSINWLDKGI LVKTWINTVI NAEEAVFDIP FGSLRRSTNP QVQLREGKIE VPALRWVDVS DNVKGLAVIA PSRHGYSVSG GRIGLSLLRS PTFPNPWSDV GEMETSIYLY PHLGNYQDAE VPRVTYEITH GLKYVIITGE SRNQGVGQWS FMHVNPPEAM LTALKVAEDS MNELILRLYN PYERQVNVSI GINGVKVKGV TETDIIEKNE VGTLSGLNNI LLKPFEIKTI RIKYDAGS
|
| |