Gene Cmaq_0652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0652 
Symbol 
ID5709754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp687614 
End bp689074 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content48% 
IMG OID641275153 
Productglycoside hydrolase family protein 
Protein accessionYP_001540482 
Protein GI159041230 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value7.39457e-10 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAGT TCCCAAGCGA CTTCAGATTC GGCTTCTCCA CAGTGGGTAC TCAGCATGAG 
ATGGGTACCC CTGGTTCTGA ATTCGTAAGT GACTGGTATG TGTGGCTTCA TGACCCTGAG
AACATTGCTT CGGGCTTAGT TAGCGGTGAT TTACCTGAAC ATGGGCCAGG TTACTGGGAC
TTGTATAAGC AGGACCACTC AATAGCTAGG GATCTTGGGC TTGATGCAGC ATGGATAACT
ATTGAGTGGG CTAGGGTGTT CCCTAAGCCG ACCTTTGACG TTAAGGTTAA GGTTGATGAG
GATGATGGAG GTAACGTGGT TGACGTTGAG GTTAATGAAT CAGCATTAGA GGAGTTACGC
AGGCTAGCTG ACTTAAATGC TGTTAATCAC TATAGGGGGA TTTTAAGTGA TTGGAAGGAG
AGGGGTGGTT TACTGGTGAT TAACCTTTAC CACTGGGCTA TGCCTACGTG GCTTCATGAC
CCAATAGCCG TTAGGAAGAA TGGACCTGAT AGAGCCCCCT CCGGTTGGCT TGATAAGAGA
TCCGTTATTG AGTTCACTAA GTTCGCAGCC TTCATAGCCC ATGAGTTAGG TGACTTAGCT
GACATGTGGT ATACGATGAA TGAACCTGGG GTAGTGATAA CTGAGGGTTA CCTTTACGTT
AAGTCAGGCT TCCCACCAGG TTACCTGGAC TTAAACTCCC TAGCCACTGC GGGTAAGCAT
TTAATTGAGG CTCATGCCAG AGCCTACGAC GCCATTAAAG CCTACTCAAG GAAACCAGTG
GGCCTAGTCT ACTCCTTCGC AGACTATCAG CCGCTTAGGC AGGGTGATGA GGAGGCTGTT
AAGGAGGCTA AGGGACTTGA CTACTCATTC TTCGACGCTC CAATTAAGGG TGAATTAATG
GGGGTTACTA GGGATGACTT GAAGGGTAGG CTTGACTGGA TTGGGGTAAA CTACTACACT
AGGGCCGTAT TGAGGAGGAG GCAGGATGCT GGTCGGGCAT CAGTAGCCGT GGTGGATGGA
TTCGGCTACT CCTGTGAACC TGGAGGCGTA TCTAATGATA GGAGACCATG CAGTGACTTC
GGCTGGGAAA TATACCCTGA GGGTGTTTAC AATGTCTTAA TGGACCTATG GAGGAGGTAT
AGGATGCCCA TGTACATCAC TGAGAACGGT ATAGCTGATG AGCATGATAA GTGGAGGTCA
TGGTTCATAG TATCGCACCT GTATCAAATT CACAGGGCCA TGGAGGAGGG GGTGGATGTT
AGAGGGTACT TCCACTGGAA CCTAATAGAT AACTTGGAGT GGGCTGCAGG ATATAGGATG
AGGTTCGGCC TAGTTTACGT TGACTATGCA ACCAAGAGGA GGTATTTTAG GCCAAGCGCC
CTGGTTATGA GGGAGGTGGC TAAACAGAAG GCTATACCGG ATTACTTAGA GCATTACATT
AAACCACCTA GAATTGAATG A
 
Protein sequence
MIKFPSDFRF GFSTVGTQHE MGTPGSEFVS DWYVWLHDPE NIASGLVSGD LPEHGPGYWD 
LYKQDHSIAR DLGLDAAWIT IEWARVFPKP TFDVKVKVDE DDGGNVVDVE VNESALEELR
RLADLNAVNH YRGILSDWKE RGGLLVINLY HWAMPTWLHD PIAVRKNGPD RAPSGWLDKR
SVIEFTKFAA FIAHELGDLA DMWYTMNEPG VVITEGYLYV KSGFPPGYLD LNSLATAGKH
LIEAHARAYD AIKAYSRKPV GLVYSFADYQ PLRQGDEEAV KEAKGLDYSF FDAPIKGELM
GVTRDDLKGR LDWIGVNYYT RAVLRRRQDA GRASVAVVDG FGYSCEPGGV SNDRRPCSDF
GWEIYPEGVY NVLMDLWRRY RMPMYITENG IADEHDKWRS WFIVSHLYQI HRAMEEGVDV
RGYFHWNLID NLEWAAGYRM RFGLVYVDYA TKRRYFRPSA LVMREVAKQK AIPDYLEHYI
KPPRIE