Gene Cmaq_0830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0830 
Symbol 
ID5709840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp871868 
End bp873262 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content45% 
IMG OID641275333 
Productglycoside hydrolase family protein 
Protein accessionYP_001540655 
Protein GI159041403 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATCTA GGCATGGTTT TAAGATAGCC CTAATAGGGG CTGGTAGTGC GGCGTGGGCT 
ATTGGTCTTA TTAAGGACCT AGCCCTAATA CCAAGCTTAA GCGGTAGTAC CGTGGTTTTA
ATGGATATTG ATGAGGATAG GTTAGCTTTA GTTAGTAGGT TTGCCAAGAG GTATGTTTCT
GAGGTTAAGG GTAATTTAAA CATAGTTACC ACCACTGATA GGAGGGAGGC TATTAGGGAT
GCTGACTTCG TGGTTAACTC AACCCTGGCT AAGGGGCATG GGCACTATGA AAGGATGAGG
GAGGTTTCTG AGAAGTACGG GTACTATAGG GGTATTAATA GTGTTGAGTG GAACATGGTG
TCTGATTACC ACACAATATG GGGCTACTAC CAGTTTAAAC TAGCCCTAGA TATAGCTAAT
GATGTGGTGG ATTACGCACC TAATGCATGG TTACTTAACG TATCTAACCC GGTCTTCGAG
TTAACAACAT TGATTAGTAG GGAGACTAAG GCTAGGGTTA TTGGGTTGTG TGACGGCTAC
TACGCCTATA GGGATTTACT CAGGGTTCTT GGTCTCGAGG AGGGTAAGGC TGAGGTTGAG
GTTATTGGTG TTAATCATGA TGACTGGTTA ACTAGGCTTA AGTACAATGG CGAAGACGCA
TACCACCTTA TTGATGAGTG GATCAGCACT AAGTCCAGTC AATACTTCGA GAAGTGGAGG
GAGGAGCAGA GTAACCCCTT TGATGTTCAT GTTTCACCAG TGGCGGTTGA CATGTATAGG
ATGTATGGCC TATGGCCAAT AGGGGACACC GTTAGGAGTG GTACATGGAA GTACCACTGG
GATCTTAAGA CCAAGCAATA CTGGTATGGG CCACTCGGTG GACCTGACTC AGAGATTGGG
TGGGCCATGT ACTTAACGTG GCATAAGATC GAGTTCAATG AGCTTAAGAG GGCGCTTGAG
AATGAGGCTA AGCCATTAAC AGACTACATA CCGCCAGTTA GAAGTGAGGG TGAGCCGGTT
ACAATGGTTA TTGAGGCTAT TGTTGAGGAT AGTGGTAAGG TAATTGAGGT TAATGTACCT
AATCAGGATG CAATACCTGG AATACCCAGT GATGTGGCTG TTGAAATGCC GGCTAGGGTG
GATGCTAAGG GTGTTCATAG ATTAAGCTTC AGTAACCTAC CTAAGGCGTG GGGTAAGGTG
CTTAAGTACG CTATAATGCC TAGGGTAATT AGGGGTGAGT GGGCGATTGA GGCATTCCTA
GGGGGTGGTA GAGACACGTT ATTTAACTGG CTTATAATTG ATCCAAGGAC TAAGTCCAGT
GATCAAGTCA ACCAGGTTAT AGATGCAATA CTTAAAATAC CTGGAAATGA GGAAATGGCT
AAACACTTCA GTTAA
 
Protein sequence
MASRHGFKIA LIGAGSAAWA IGLIKDLALI PSLSGSTVVL MDIDEDRLAL VSRFAKRYVS 
EVKGNLNIVT TTDRREAIRD ADFVVNSTLA KGHGHYERMR EVSEKYGYYR GINSVEWNMV
SDYHTIWGYY QFKLALDIAN DVVDYAPNAW LLNVSNPVFE LTTLISRETK ARVIGLCDGY
YAYRDLLRVL GLEEGKAEVE VIGVNHDDWL TRLKYNGEDA YHLIDEWIST KSSQYFEKWR
EEQSNPFDVH VSPVAVDMYR MYGLWPIGDT VRSGTWKYHW DLKTKQYWYG PLGGPDSEIG
WAMYLTWHKI EFNELKRALE NEAKPLTDYI PPVRSEGEPV TMVIEAIVED SGKVIEVNVP
NQDAIPGIPS DVAVEMPARV DAKGVHRLSF SNLPKAWGKV LKYAIMPRVI RGEWAIEAFL
GGGRDTLFNW LIIDPRTKSS DQVNQVIDAI LKIPGNEEMA KHFS