Gene Cmaq_0104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0104 
Symbol 
ID5709274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp121894 
End bp125040 
Gene Length3147 bp 
Protein Length1048 aa 
Translation table11 
GC content44% 
IMG OID641274608 
ProductAlpha-mannosidase 
Protein accessionYP_001539948 
Protein GI159040696 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0383] Alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGTG AAGTTGATTT AGGGGGTGTT GAGGGCTTAG CCTTCGAGTT AATGGCATCC 
TCAGTGGGTA GGTATGCTTC AATTAATGAC TGGGAATTAC TGAATAATGG GGTTAACGTT
AATTTACCGT ACAGGACCGG TGCAGGGCCT AATGATAACC TCATGTTTAG GACAAGGGTT
ACTGTACCTG AAACCAAGCA TAGGTGGTTC ATTAAGATTC TACTCACCGG TAATGCACTG
ATTAGAATTA ATAATGAGGC CTGGGGTTAC GATGAGGCCC ACACCTACTT CCCAGTTAAC
CCTGGGGTTA ACGTTATTGA GGTTAACGCC ACACCGAGGG CGCTCTTCGG TCAGCATACC
TGGGACTTCA GGTTTGATTA CGCTTACTTA ATTGAGGTTA ATTGGAGTAT TATGAGGCTT
GGATTAAGGT TACTGGCACT CATAGACTTC ATTGAGCATT TACCTAAGGA TAACCCGCTT
AGAGTTGACC TTGAGGAATT ACTAATTAAC GTGACTAAGG GGATTAGGGT TAACCCAACG
TTGAGTCAAG TCACCTTAAC CCTAATGATG CTTTACGACT CACCGTACTC TCAATTCTTC
AATAGAATGG ACCTCAGGAG ACCTGATGGT AGCTATGTAA TGGGTGTTGG GTTACATGGA
CTAGGGGTTG TTAAGGGTTT TCTAAGCGAC ATACCTAAAA CCATAGACCC CGATTCAATA
CCAATCCATG AAATTGAGGA TAAGTTAAAT TCAGGGTTAA TTAAGTTGAG TGAAAAGTAC
CCTAAGGAGG GTTTACTGGT TGCTGTTGGG CATAGTCACA TTGATGCAGC GTGGCTGTGG
CCTAGGGGTG AGACTATTAG GAAGGTTATT AGGACGTTCT CAACAATGGT TAACCTAATT
AAGGAGTATG GTATTAGCTT CCTTCAAAGC TCAGCCCAGT ACTATAAGTG GGTTGAGGAT
AATGACCCCG GTCTCTTCAA TGAGGTTAAG AAGCTTATTG AATCAGGTAA ATGGATTATT
GCGGGTGGAA TGTGGATTGA GAGTGATGCT AATATTATTG ATGGTGAATC CCTAGCTAGG
CAATTCCTAT ACGGTCAAAG ATACTTCCTA AGCAGATTCG GGAGAATGGC TAAGGTGGGT
TGGTTACCGG ACACCTTCGG CTTCTCGGCG AATCTACCTC AAATAATGAG GAAGAGCGGT
ATAGAGGTCT TCTCATCATG GAGAATAATA ACCCACTCCT TAACAGAATT CCCACTCCAC
GCCTTCACCT GGATTGGTAT CGATGGTTCA GAAATACCCA CGCAGGTAAT CCTAGTTAAC
TACAATAATG CCCACACCCC ACTTAACGCC TACCAGGCAT GGTCAATGTA CAGGGGTAAG
GATACACTAC CCCAGTTAAT ATACCCCTAC GGCTACGGTG ACGGTGGCGG TGGACCCACT
AGGGAGATGA TTGAGTACAG GGAATTAATC AATGAGCTAC CAAGCGTCCC AAGGGTTATT
GAATTCAGGG AGGATGATTA CGTGGCTGCA TTAAGCAGTG TTAAGGATAA GTTACCTAAA
TGGAGTGGTG AAATACACGT AGAGAACTTC ATAGGAACAT ACACCACCAA CCTACCGATT
AAGGAATTAG TAGCTAAATC AGAGGCACAG CTCACTGATG CTGAGGCATC AGTAACAATG
GCGTATGCAG TGGGTGCAGG TGACCATGGT TTAAGTGAAA TCAATGAACT GTGGATGAGA
CTACTATTCA ATCAATTCCA CGACATAGTA CCAGGCTCAG CAATCAAGGA GGTTTACGAT
GAAGCGTACA GTGAGTTAAG GAACTTATTA CTAAGGAGCA GTGAATTAAT GAGTGAAGCC
GTTTCGGCGG TGGGGAGGAA ATTGGGTTTA GGGGATTCAT TAATCGTGTT TAACCCACTC
CCATGGGGTA GAAGCGGGGT TATTAAAATA CCGAGGAGCA TTGAAGTCAA CCTGGAGTGC
CAGGATAATG GTGAAGACAG GTTCGTGTAC GTGAATACAC CAGCAATGGG GTTTAGTGCA
TATGGAGTAA ACGGTAAGTG CATAAACCCG AGTAATGGGG TTTTAGTCAC CAGGACTGGT
GACGGCTTCT CCCTTGAGAA TGAATACGTT AAGGTTAATT TAAATAATAA GGGTGATGCG
GATTCATTGA TCATTAAGAA GAGTGGCGTC AACGTATTGA AGGAGCCCAT TAAATTAATG
GCCCATGTTG AGAGGCCTCT TATTTCAGAT GCATGGCGCT TCTCACTAGA CTCATTGAAC
GAGGGCTATG AGTTAAGCAC CGTGTCGGAG CCGTCATTAA CCATCACTGG TCCATTAATA
TCATGTGTGA GTAGTATTAA GGAGTTTAAT AAGTCGAGGA TAACCCAGGA GGTGTGTTTA
AGGAAGGGTT CACCGGTTGT TGAGGTTAAG TACAGTATTA ATTGGTTGGA TAAGGGCATA
CTGGTGAAGA CCTGGATTAA CACAGTAATT AATGCCGAGG AGGCGGTCTT CGACATACCC
TTCGGGTCAC TGCGCAGGTC GACTAATCCG CAGGTTCAAT TAAGGGAGGG GAAGATTGAG
GTTCCAGCCC TCAGGTGGGT TGATGTTTCA GATAACGTGA AGGGCCTAGC CGTAATAGCA
CCATCAAGAC ATGGCTACAG TGTGAGCGGT GGCAGGATTG GGTTAAGTCT ACTAAGGTCA
CCAACCTTCC CAAACCCATG GAGTGATGTG GGTGAAATGG AGACAAGCAT ATACCTTTAC
CCGCATTTAG GTAATTACCA GGACGCTGAG GTACCTAGGG TAACCTACGA GATTACCCAT
GGTTTAAAGT ACGTGATAAT CACTGGTGAG TCACGGAACC AGGGAGTAGG CCAATGGAGC
TTCATGCACG TTAACCCACC TGAGGCAATG TTAACTGCGC TTAAGGTTGC TGAGGACTCA
ATGAATGAGC TAATACTGAG GCTCTATAAT CCATATGAGA GGCAGGTTAA TGTGAGCATA
GGCATTAACG GTGTTAAGGT GAAGGGAGTT ACTGAAACCG ACATTATTGA GAAGAATGAG
GTAGGAACAT TAAGTGGACT CAATAACATA TTATTAAAGC CATTTGAAAT TAAAACAATT
AGAATTAAAT ATGATGCAGG AAGCTAA
 
Protein sequence
MSREVDLGGV EGLAFELMAS SVGRYASIND WELLNNGVNV NLPYRTGAGP NDNLMFRTRV 
TVPETKHRWF IKILLTGNAL IRINNEAWGY DEAHTYFPVN PGVNVIEVNA TPRALFGQHT
WDFRFDYAYL IEVNWSIMRL GLRLLALIDF IEHLPKDNPL RVDLEELLIN VTKGIRVNPT
LSQVTLTLMM LYDSPYSQFF NRMDLRRPDG SYVMGVGLHG LGVVKGFLSD IPKTIDPDSI
PIHEIEDKLN SGLIKLSEKY PKEGLLVAVG HSHIDAAWLW PRGETIRKVI RTFSTMVNLI
KEYGISFLQS SAQYYKWVED NDPGLFNEVK KLIESGKWII AGGMWIESDA NIIDGESLAR
QFLYGQRYFL SRFGRMAKVG WLPDTFGFSA NLPQIMRKSG IEVFSSWRII THSLTEFPLH
AFTWIGIDGS EIPTQVILVN YNNAHTPLNA YQAWSMYRGK DTLPQLIYPY GYGDGGGGPT
REMIEYRELI NELPSVPRVI EFREDDYVAA LSSVKDKLPK WSGEIHVENF IGTYTTNLPI
KELVAKSEAQ LTDAEASVTM AYAVGAGDHG LSEINELWMR LLFNQFHDIV PGSAIKEVYD
EAYSELRNLL LRSSELMSEA VSAVGRKLGL GDSLIVFNPL PWGRSGVIKI PRSIEVNLEC
QDNGEDRFVY VNTPAMGFSA YGVNGKCINP SNGVLVTRTG DGFSLENEYV KVNLNNKGDA
DSLIIKKSGV NVLKEPIKLM AHVERPLISD AWRFSLDSLN EGYELSTVSE PSLTITGPLI
SCVSSIKEFN KSRITQEVCL RKGSPVVEVK YSINWLDKGI LVKTWINTVI NAEEAVFDIP
FGSLRRSTNP QVQLREGKIE VPALRWVDVS DNVKGLAVIA PSRHGYSVSG GRIGLSLLRS
PTFPNPWSDV GEMETSIYLY PHLGNYQDAE VPRVTYEITH GLKYVIITGE SRNQGVGQWS
FMHVNPPEAM LTALKVAEDS MNELILRLYN PYERQVNVSI GINGVKVKGV TETDIIEKNE
VGTLSGLNNI LLKPFEIKTI RIKYDAGS