Gene Cmaq_1276 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1276 
Symbol 
ID5708676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1344173 
End bp1346239 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content46% 
IMG OID641275782 
Producthypothetical protein 
Protein accessionYP_001541093 
Protein GI159041841 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4354] Predicted bile acid beta-glucosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0000117637 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTTAGGT ATACGTGCGG GGATGTCTTG GTGAGTGGTA TACCGCTTGG TGGTATTGGT 
TCTGGTGGTG TTGAGGTTAG TAATGATGGT AGGCTTATTA ATGCTAGGTT TGCTAATAAT
TGGGCTTACC CGATTAGGGA TTTGAGGGGT TTTCACATTT TCATTAAGCC TCATGATGCC
TCAGGGTTCT TCATGCATTG TAGAGTTAAT GTGCTTGGCC TTGAGGGTAG GGGTGCCTTA
ATTGGCTTTG AGGGGCGTTG GCCCTTTGCT TGGCTTAGGG CATTTAGAAA TGGTGTGAAT
GTTGAGGTTG AGGCTTTTTC ACCAATAATA CCAGGGAACC TGAAGGACTC AACACTACCG
GTAATAGGGT TTACTATTAG GGTTAAGGGC TCTGATGCCT TAGCCGCTGT ATCAGTGCCT
AATGTGGTTG GCACTAATCC AATTGGTAGA ATTAATAGGA GCATTAATGG TGGCGTATTA
TTCACTAACA ATAAGGCTCC CGATAATGAT CCAGCTAAGG GTAATATAGC CCTAATTACT
GAGGAGCCTA GGTTCACTAT TACTCAATAT AATATTAATA GTAAACCCGA GCACGCCCTT
AAGGCTAGGA CTTGGAAGGG TGCCTTTGAG AACCCGGAAC CCTGGTTAAC CATAGATAAG
GGTGGTGTAC CCACTGGTGA GGAGCCCCAT GAGGTTACTG GGCTTTGGGA TGACCCAGCA
GGCTTAATTG CCTTAAACGT ACCTAATGGT GGGGAGGTTA GGTTTACCTT ATCCTGGTTC
TTCAATGGTA GGTGGCATTT ATATAATTAC GGACACTACT ACGAGAACTT CTTCAAGGAT
TCAAGCGAGG TTGCTAGGTA TGTGCTTGAT GAGTTTGATA GGCTTAGGAC CAGTACCCTT
GATTGGCAGA ATAGCTTAAT TGACCCAGCA TTACCTGATT GGCTTAGGGA TGCTGTAGTG
AACTCAACCT ACATATTAAC CACCAGTACC TGGCTTACGA GGGATGGTAG GTTCAGTATC
CTTGAGGGTG TTGAGGTTTG CCCATGCCAT GGTACATTAG CTGGAGCATG CTATGAGACT
GGTTCACTAC CGGTTGTCTT AATGTTCCCT GAATTGGAGA AGTCACTTCT AAGGCAGTTC
ACCGAGGCCA TGAGGAGTGA TGGCTATATT CCACATAGCT TAGGCATCTA TAGCCTGGAC
CATATTGAGG ATGGAACCAC TGCGCCACCG AGGTGGAAGG ACTTGAATTC AACATACATA
CTCCTAGTGC ATAGATACTT CAAGAGGAGT AATGATGTTG AGTTCATTAA GGAGATTTAC
CCCAAGCTAA TTAAAGCCTT TGAATGGGTC CTGGTTCAGG ATAAGGATGG TGATGGTGTA
CCTGAACTCA GTGGTGATGG TGACACCGGC TTTGATGCAA TGTCGGTTAA AGGCTTTGAC
AGCTACACTA CCAGCCTTTG GATTGCGGCT TTAATGGTCA TGGGTGAGTT AGCTAAGCTT
ATGGGTGACC AAGCTACGTT GAGTAAAGTG GAGTCAACAT TACTTAAGGC TAGGGACTCC
TATAATAGGC GTTGGCTTGG GGATAGGTTT AAGGCCTGGG ATGAACCAGA CATGGGTAAG
GCATCCTTCC TGGCTCAGAT TTGGGGTGAG TGGTGGAGCC TAATGCTTGG CTTAGGTCAC
ATTACTGATG AGGATAAGGT TAAGGCCGCC ATGGGCACTA TAATCAGGGT TAATGGTTCA
GCATCACCAT ACACTACACC TAATCTCGCT GATGAGGATA AGGGCATTAT AGGTTACAGC
CCGCAAACAT ACTCATCCTG GCCTAGGCTG GTTTTCACAA TGATGAGTGT TGCAAGGGAG
CTTGGGGTGG ATGGGTGGCT TGATGTGGTT AAGAAGGAGT GGGATAACTT GGTTAGGCAG
GGTTTAACAT GGAATCAACC ATCGAGAATA GATGGCAGGA CAGGTAAACC TGAACCTGAG
AGAGGGTTCC TTGACCATTA CATTGGTAGC CCGGCGCCGT GGAGCCTAAC GTATAAGTAC
GCCTTAAGTA AGTTGAAGAT TCATTAA
 
Protein sequence
MVRYTCGDVL VSGIPLGGIG SGGVEVSNDG RLINARFANN WAYPIRDLRG FHIFIKPHDA 
SGFFMHCRVN VLGLEGRGAL IGFEGRWPFA WLRAFRNGVN VEVEAFSPII PGNLKDSTLP
VIGFTIRVKG SDALAAVSVP NVVGTNPIGR INRSINGGVL FTNNKAPDND PAKGNIALIT
EEPRFTITQY NINSKPEHAL KARTWKGAFE NPEPWLTIDK GGVPTGEEPH EVTGLWDDPA
GLIALNVPNG GEVRFTLSWF FNGRWHLYNY GHYYENFFKD SSEVARYVLD EFDRLRTSTL
DWQNSLIDPA LPDWLRDAVV NSTYILTTST WLTRDGRFSI LEGVEVCPCH GTLAGACYET
GSLPVVLMFP ELEKSLLRQF TEAMRSDGYI PHSLGIYSLD HIEDGTTAPP RWKDLNSTYI
LLVHRYFKRS NDVEFIKEIY PKLIKAFEWV LVQDKDGDGV PELSGDGDTG FDAMSVKGFD
SYTTSLWIAA LMVMGELAKL MGDQATLSKV ESTLLKARDS YNRRWLGDRF KAWDEPDMGK
ASFLAQIWGE WWSLMLGLGH ITDEDKVKAA MGTIIRVNGS ASPYTTPNLA DEDKGIIGYS
PQTYSSWPRL VFTMMSVARE LGVDGWLDVV KKEWDNLVRQ GLTWNQPSRI DGRTGKPEPE
RGFLDHYIGS PAPWSLTYKY ALSKLKIH