Gene Cmaq_1215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1215 
Symbol 
ID5709758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1281040 
End bp1282425 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content44% 
IMG OID641275719 
Productglycoside hydrolase family protein 
Protein accessionYP_001541032 
Protein GI159041780 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.505434 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0851438 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAC AAATTAAGAA CATAAAGATT TGTATAATAG GTGGAGGCAG TCATACGTTT 
ATAGCAAGTA TACTTAGGGA TATTGCATTA ACAAAGAGTA TTCATGGAAT CACACTAACC
CTAATGGATA TTGATGAACA TAGATTAGCT AGAAGCTACA TGCTGGCCAG GAAGTATTTC
GATGAACTTA AGGTACCCAT TAATCTAGAA AGAACCACTG ATACTAAGGC TTGTATTGAA
GGCGCCTCCT TCGTAATTAA CCTAGCCTTC GCAATAGGTT ACGATCACTG GGGCATTCAG
GTTGAGGCTG CTGAGAGGCA TGGGTACTAT AGGGGTATTG ATGCAACTGA GTGGAATATG
GTGTGCTGCT ACCCATCATT AACCGGGTTT AAGCAGTATA ATGTGGCGTT GAAAATAGCA
GGCATAATGG ATGAGATTAA TAGGGATGCT TGGTTAATTC AAGTCTCCAA CCCAGTTCTC
GAAACAACAA CTTTAGTACA TAGGCAGTAT CCTAAGCTTA AAATTATTGG TTACTGCCAT
GGAGCGCCCG GCGGTGTTAG ATTATTGGTT GAGAAGGCGT TGAAACTTGA TATGAGGAGG
ATTGAGTGGC AGGCGGTTGG CTTAAATCAC GTGGTGTTTC TAACTAGGTT TAAGTATAAT
GGTGAAGACG CCTACCACTT GATTGATGAG TGGATTGAGA AGAAGGCTGA GGAATTCTGG
GCTAGTTACG TGCCTGGCCC ATGGGAAGAG ACCTTGAGTA GGGCTGCTGT GGACATGTAT
AGACTCTACG GCCTATACCC ACTTGGTGAC ACGGCTAGGA GTGGGACGTG GAAGTACCAT
AGGGATCTTA AAACCAAGAT ATATTGGTAT GGACCCATTG GTGGTGTTGA TTCTGAGGTA
GGGTGGGGGA TTAGGATGCT TAGGAATCAG GAGGCTGAGG CTAAGTTGGA GAATGCTGCA
TTCAACCCAA GCATTAAGGC CACTGAGGCT TATCCACCGG TTAAGAGTGG TGAGCAGATT
ATTGATTTCA TAGATAGTGT TGTTAATAAC GTTGAGAGGA GAATGATACT AAACATACCC
AATAATGGCG TATTACCTAG ACTACCAAGC GACGCCATAG TTGAGGCGCC GGTGTACGTT
AAGGGTGAGG TAATTAGGCC TGAGGCCATT GAGAATGTAC CAAATAAAAT GTACTCATAC
GTATGGTACC CTAGAATAGC CGTCACTGAG AGGGCACTTG AAGCCTACTT AGCTGGTAGC
AAGGAGTTAC TTATTGAAGC ATTGATGTTC GACCCAAGGA CCAAGAGCAC TGAACAGGCG
AGGGAGGTTA TTGATGAAAT ACTGAACCTA CCGTTTAACG AGGATATGAA GAAGCACTAT
AAGTGA
 
Protein sequence
MSEQIKNIKI CIIGGGSHTF IASILRDIAL TKSIHGITLT LMDIDEHRLA RSYMLARKYF 
DELKVPINLE RTTDTKACIE GASFVINLAF AIGYDHWGIQ VEAAERHGYY RGIDATEWNM
VCCYPSLTGF KQYNVALKIA GIMDEINRDA WLIQVSNPVL ETTTLVHRQY PKLKIIGYCH
GAPGGVRLLV EKALKLDMRR IEWQAVGLNH VVFLTRFKYN GEDAYHLIDE WIEKKAEEFW
ASYVPGPWEE TLSRAAVDMY RLYGLYPLGD TARSGTWKYH RDLKTKIYWY GPIGGVDSEV
GWGIRMLRNQ EAEAKLENAA FNPSIKATEA YPPVKSGEQI IDFIDSVVNN VERRMILNIP
NNGVLPRLPS DAIVEAPVYV KGEVIRPEAI ENVPNKMYSY VWYPRIAVTE RALEAYLAGS
KELLIEALMF DPRTKSTEQA REVIDEILNL PFNEDMKKHY K