Gene Cmaq_1217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1217 
Symbol 
ID5709760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1283665 
End bp1286565 
Gene Length2901 bp 
Protein Length966 aa 
Translation table11 
GC content48% 
IMG OID641275721 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001541034 
Protein GI159041782 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000342609 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.287325 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAAGT CAAGTGACTT CATACCCAGT GGTGTTAAAA GAAGCGATAT TTACCATGGT 
GGGTGGATTG ATCATGATAA GGATGGTTTA ATGGCCCCCT TCGAGGACCC CAGTAGGCCT
ATTGATGAGA GGGTTGAGGA TCTCTTGAGG AGGATGAGTC TTGAGGAGAA GGTGGCTCAA
TTAAGGTCGG ATCTAACTGA TAGGTTGGAT GTAGGTAACT TATCAGTGGT CCTTAGGGGT
ACTGAACCGA CTGAGGGTGC TGTTAAGGCT AATGATATTC AGAGGAGGTT CCTTGAGGAC
ACTAGGCTTG GTGTACCGGC TATTATTCAC GATGAGTGCC TCCACGGATG CATGGCTAAG
CACTCAACAG TCTTCCCCCA AGCCATAGCG CTGGCTGCGG CCTGGGACGT GGACTTAATG
TATAGGGTTG CTAAGGCTAT TGCAAGGGAG ACTAGGGCCA GGGGTATTAG GCAATGCCTA
TCACCCGTGG TTAACCTAAC CTTCGATGCT AGGGCTGGTA GGACTGAGGA GACTTATGGT
GAAGACCCCT ACTTAGCCTC ACAGTTAGCC TACGCCTACG TTAAGGCGCT TAGGGAGGAG
GGTATTGTGG CTACCCCTAA GCACTACATA ATGAACTTCG TTGGTGATGG TGGTAGGGAC
AGCGCTGAAA TACACATGAG TGAGAGGTTC ATTAGGGAGA CTGAGTTACC GGTTTTCAGA
GCGGCCATTA AGGCTGGGGC ATTATCATTA ATGGCTGCCT ACAACTCCAT AGACGGGGTA
CCATGCTCAA TGAACAAGTA CTGGCTAACG GAGGTACTTA GATGGGAATT GGGGTTCGAG
GGCTTCGTGG TGTCTGACTA CGGTTCAGTC ACCGGTATAG TTAATAGGCA CTACATAACC
GATAACCCGG AGGAGGTTGC TAAACTAGCC CTTGAGGCTG GCCTAGATGT TGAATTCCCA
GGATTCTCAA TATACGGTGA ACCACTGGTT AGGGCTATTA GGAGGGGGTT GATTAGCGAG
GAGGCGCTTA ATGAGGCTGT TAGGAGGGTT TTGAGGGCTA AATTCCTAAT AGGCCTATTC
GACTCACCTT ACGTTGACCC CGAGGAAGCT AAGGTTATTG GCTCAGAGGA GCATAGGCGG
TTAGCCCTTG AGGCCGCTGA GAAGGCTATT GTACTGCTTA AGAATGATGG TGTACTGCCT
ATTGATAAGT CTAGGGTTAA GGCAATAGCC CTAATAGGCC CCTTCGCGGA TGAGGTTAAG
TTAGGTGGCT ATAGTGCAAT ACCTAAGAGT GTAATAACAC CCCTTGAAGC CTTTAAGGCC
AGGGGCATTA ATGTGATTCA CGCTAAGGGA TGCATAGGGG ATATGGATGC TGACCACCCA
ATACCAACAA GGTACCTTAC ACCAATGGGT GAGCCTAATA GACACGGCTT GAGGGGTGAG
TACTTCAATA ACCCTAACCT TGAGGGTGAG CCTATTGGCG TTAGGATTGA TGCACCGTGG
GAGGGATTCT TCAGACTTGA CATAGGCTAC GACCCACCAT ACCAGGGCCT TGACCCAGGA
AGATACTCCA TTAGGTGGAT TGGTTACATT ACCCCACCCG TATCAGGCAC GTATGAGTTT
AAGGTTTACG CCGCTGGCGG TGGATTTAGG CTTACTGTTG ATGGTAAGAC AATTGTGGAT
TCATGGGGTG TGGCCAGTAA TTCACCTAAG AGTGGTTCAA TTAGGCTTGA GGGTGGGAGG
CAGTATGAGA TTAGGCTTGA GTACGGTAGG TTGAACTACG GTTACGCCTA CATTAAGCTT
GGCTGGGATT TAATCGAGGA CTCAATGATT AAGGAGGCTG TTGACGCCGC TTCGAAGGCT
GATGCAGTGG TTGTGTTTGC AGGCATTATT GAGGGTGAAC AGAGGGATAG GGCATCATTA
AGGTTACCCA AGTGCCAGGA GAGGCTTATT GAGGAGGTGC TTAAGGTTAA TAAGAACGTT
GCAGTGGTTT TAACCACTGG AAGCCCAGTC GTCGGTGAAT GGATTAATAA TGTCCCAGCC
CTGGTTGAGG CCTGGTACCC TGGTGAAATG GGCGGGGAGG CTATTGCCCA AGTGCTTTTA
GGTGAATATA ACCCGGGTGG TAAATTACCT TTAACCTGGC CTATTCACGA GGGCCAGGAA
CCATTATACT ACTTCACTAA GCCCAGTGGT AGGGTTTATG ATTACGTTAA CATGCCTCCA
ACACCACTAT TCCCGTTCGG ACACGGGCTA AGCTACACTC AATTCAAGTA CAGTGACCTC
AAGGTGAGCG TTAATGAGGA TGATGGAGTG GTTTCAGTAT CACTTAACGT TGAGAATATA
GGTAAGTATG AGGGTGATGA AGTTGTTCAA CTATACGTTA GGGATAGGTA CTCAAGTATA
GCTAGACCAT TAATGATGCT TAAGGGCTTT AGGAGAATAA CGCTTAAGCC CGGTGAGAAG
ACTGTAGTTG AATTCAAGTT AACCTTAGAT GACTTAGCAA TGTATGATGC AGGGTTCAGG
CGTATTGTGG AGCCTGGAGC ATACCAGGTC CTTGTTGGTT CATCATCAAT GGACATTAGG
CTAATGGGTG AATTCAAGTT AACTCAACTG GTTAAGGGTA TTGTGAGCGT GACGAGTGTT
AATGCTGATA AGGTCAATGT TAAGGCTGGT GAATCAATTA GGGTTAAGGC CACGTTAAGA
AATGAAGGTA AAGTAGGTGA CCTAGTACCC ATTACGCTTA AGGTTAATGG TAGGGTGATT
GAGGAGCATA GAGTGTACTT GGATCCAGGT GAGGAGAGGA TAGTGAACTT CACGGTGAAG
CTACATGAGG CAGGGAAGCA GGTGGTTTCA GTGGCAGTGC CCGAGGGGGA GAAGTCAGTA
ACCATAGATG TAACCCAATA G
 
Protein sequence
MSKSSDFIPS GVKRSDIYHG GWIDHDKDGL MAPFEDPSRP IDERVEDLLR RMSLEEKVAQ 
LRSDLTDRLD VGNLSVVLRG TEPTEGAVKA NDIQRRFLED TRLGVPAIIH DECLHGCMAK
HSTVFPQAIA LAAAWDVDLM YRVAKAIARE TRARGIRQCL SPVVNLTFDA RAGRTEETYG
EDPYLASQLA YAYVKALREE GIVATPKHYI MNFVGDGGRD SAEIHMSERF IRETELPVFR
AAIKAGALSL MAAYNSIDGV PCSMNKYWLT EVLRWELGFE GFVVSDYGSV TGIVNRHYIT
DNPEEVAKLA LEAGLDVEFP GFSIYGEPLV RAIRRGLISE EALNEAVRRV LRAKFLIGLF
DSPYVDPEEA KVIGSEEHRR LALEAAEKAI VLLKNDGVLP IDKSRVKAIA LIGPFADEVK
LGGYSAIPKS VITPLEAFKA RGINVIHAKG CIGDMDADHP IPTRYLTPMG EPNRHGLRGE
YFNNPNLEGE PIGVRIDAPW EGFFRLDIGY DPPYQGLDPG RYSIRWIGYI TPPVSGTYEF
KVYAAGGGFR LTVDGKTIVD SWGVASNSPK SGSIRLEGGR QYEIRLEYGR LNYGYAYIKL
GWDLIEDSMI KEAVDAASKA DAVVVFAGII EGEQRDRASL RLPKCQERLI EEVLKVNKNV
AVVLTTGSPV VGEWINNVPA LVEAWYPGEM GGEAIAQVLL GEYNPGGKLP LTWPIHEGQE
PLYYFTKPSG RVYDYVNMPP TPLFPFGHGL SYTQFKYSDL KVSVNEDDGV VSVSLNVENI
GKYEGDEVVQ LYVRDRYSSI ARPLMMLKGF RRITLKPGEK TVVEFKLTLD DLAMYDAGFR
RIVEPGAYQV LVGSSSMDIR LMGEFKLTQL VKGIVSVTSV NADKVNVKAG ESIRVKATLR
NEGKVGDLVP ITLKVNGRVI EEHRVYLDPG EERIVNFTVK LHEAGKQVVS VAVPEGEKSV
TIDVTQ