Gene Cmaq_1537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1537 
Symbol 
ID5709052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1619226 
End bp1620425 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content45% 
IMG OID641276045 
Productglycoside hydrolase family protein 
Protein accessionYP_001541350 
Protein GI159042098 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.37521 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAGT TAAAGGTTCC AAGCGGCTTC ATGATTGGTG CAGCGTTATC AGCCTACCAG 
GTGGAGGGGA ATAATGTTAA CGCCGACTGG TGGCATTACG AGGGGGAGAG GTTACCTAGG
AGTGGTTCAG CGTGTGACTT CTGGAATAGG TATAGGGGTG ATATTGAGTT AGCTGCCTCA
CTTGGATTAA AGGCGCTTAG AATATCCATA GCCTGGGATA GAGTCATGCC CAGTGAGGGT
AAGGTTGATG ATGAGTCAAT GGATAGGTAC GTGGATATGA TTAAGGAGAT TAGGGGCCAT
GGTATGGAAC CCGTAGTAAC CCTCCACCAC TTCGTTAATC CAATGTGGTT CGCAACAAGG
GGTGGTTGGG TTAAGGAGGA TAATGTGAAG TACTTCCTGG ACTTCGTTAA GTATGTTGCT
GATTCAGTGG GTGATAGGGT TAGGTTCTGG TTAACCATTA ATGAAATCAA CCTATACCCA
ATACTAGCAT ACCTACTGGG TGTCTTCCCA CCCTTCATAA TGAACATGGA GTACATGTGG
AAGGCCTTGA TGAATCTACT TAAAGCCAGT GATAAGGCCT ATGAATTAAT CAAGAAGCCG
AGTAATCAAG TTGGGTTAAT AATACACATT ATGCCTGCTA GGCCGGCTTC AAGAATATCC
ATAACGGATT GGGGATTAGC CATGGGTATG AATTACGTGT TAAACAAGAT GATAGTGAAC
ACTCTAGCTA AGGGCAGGTT ACCTAATTGG CTTGGTGGCG GGGAGGTTGG TAAACTGGAT
TACGTTGGGT TAAACTACTA CACTGTGGCT AAGGTTAAGT TTAATCCATT AACCATGGGT
GAATTAGTGA CCTCTAGGCA GAGTCAAAGG GGTTGGGTTA TTAACCCAGG TGGCTTGAAA
TGGGCTATTA GGCTGGTTAG GAGAATAGGG AAGCCAATAA TGATTACTGA GAACGGCATA
GCCACGGATA ATGATGAGGA CAGGATAAGC TTCATTGAGA AGCACTTGGC AATAGCAATT
AAGGAGAAGG TACTGGGTTA CCTATACTGG AGTCTCCTTG ATAACTACGA GTGGGAAATG
GGCTATAATG CTAAGTTTGG TTTAATTGAA TGCGACCCAG TGACCTTAAC CAGGAGGCCT
AGGGGAAGCG CCTACTTCCT AGGTAAATTA GCCAGCGGTA ACCCAATTAC CTTGCATTAA
 
Protein sequence
MSKLKVPSGF MIGAALSAYQ VEGNNVNADW WHYEGERLPR SGSACDFWNR YRGDIELAAS 
LGLKALRISI AWDRVMPSEG KVDDESMDRY VDMIKEIRGH GMEPVVTLHH FVNPMWFATR
GGWVKEDNVK YFLDFVKYVA DSVGDRVRFW LTINEINLYP ILAYLLGVFP PFIMNMEYMW
KALMNLLKAS DKAYELIKKP SNQVGLIIHI MPARPASRIS ITDWGLAMGM NYVLNKMIVN
TLAKGRLPNW LGGGEVGKLD YVGLNYYTVA KVKFNPLTMG ELVTSRQSQR GWVINPGGLK
WAIRLVRRIG KPIMITENGI ATDNDEDRIS FIEKHLAIAI KEKVLGYLYW SLLDNYEWEM
GYNAKFGLIE CDPVTLTRRP RGSAYFLGKL ASGNPITLH