Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_1215 |
Symbol | |
ID | 5709758 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | - |
Start bp | 1281040 |
End bp | 1282425 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641275719 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001541032 |
Protein GI | 159041780 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.505434 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.0851438 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAAC AAATTAAGAA CATAAAGATT TGTATAATAG GTGGAGGCAG TCATACGTTT ATAGCAAGTA TACTTAGGGA TATTGCATTA ACAAAGAGTA TTCATGGAAT CACACTAACC CTAATGGATA TTGATGAACA TAGATTAGCT AGAAGCTACA TGCTGGCCAG GAAGTATTTC GATGAACTTA AGGTACCCAT TAATCTAGAA AGAACCACTG ATACTAAGGC TTGTATTGAA GGCGCCTCCT TCGTAATTAA CCTAGCCTTC GCAATAGGTT ACGATCACTG GGGCATTCAG GTTGAGGCTG CTGAGAGGCA TGGGTACTAT AGGGGTATTG ATGCAACTGA GTGGAATATG GTGTGCTGCT ACCCATCATT AACCGGGTTT AAGCAGTATA ATGTGGCGTT GAAAATAGCA GGCATAATGG ATGAGATTAA TAGGGATGCT TGGTTAATTC AAGTCTCCAA CCCAGTTCTC GAAACAACAA CTTTAGTACA TAGGCAGTAT CCTAAGCTTA AAATTATTGG TTACTGCCAT GGAGCGCCCG GCGGTGTTAG ATTATTGGTT GAGAAGGCGT TGAAACTTGA TATGAGGAGG ATTGAGTGGC AGGCGGTTGG CTTAAATCAC GTGGTGTTTC TAACTAGGTT TAAGTATAAT GGTGAAGACG CCTACCACTT GATTGATGAG TGGATTGAGA AGAAGGCTGA GGAATTCTGG GCTAGTTACG TGCCTGGCCC ATGGGAAGAG ACCTTGAGTA GGGCTGCTGT GGACATGTAT AGACTCTACG GCCTATACCC ACTTGGTGAC ACGGCTAGGA GTGGGACGTG GAAGTACCAT AGGGATCTTA AAACCAAGAT ATATTGGTAT GGACCCATTG GTGGTGTTGA TTCTGAGGTA GGGTGGGGGA TTAGGATGCT TAGGAATCAG GAGGCTGAGG CTAAGTTGGA GAATGCTGCA TTCAACCCAA GCATTAAGGC CACTGAGGCT TATCCACCGG TTAAGAGTGG TGAGCAGATT ATTGATTTCA TAGATAGTGT TGTTAATAAC GTTGAGAGGA GAATGATACT AAACATACCC AATAATGGCG TATTACCTAG ACTACCAAGC GACGCCATAG TTGAGGCGCC GGTGTACGTT AAGGGTGAGG TAATTAGGCC TGAGGCCATT GAGAATGTAC CAAATAAAAT GTACTCATAC GTATGGTACC CTAGAATAGC CGTCACTGAG AGGGCACTTG AAGCCTACTT AGCTGGTAGC AAGGAGTTAC TTATTGAAGC ATTGATGTTC GACCCAAGGA CCAAGAGCAC TGAACAGGCG AGGGAGGTTA TTGATGAAAT ACTGAACCTA CCGTTTAACG AGGATATGAA GAAGCACTAT AAGTGA
|
Protein sequence | MSEQIKNIKI CIIGGGSHTF IASILRDIAL TKSIHGITLT LMDIDEHRLA RSYMLARKYF DELKVPINLE RTTDTKACIE GASFVINLAF AIGYDHWGIQ VEAAERHGYY RGIDATEWNM VCCYPSLTGF KQYNVALKIA GIMDEINRDA WLIQVSNPVL ETTTLVHRQY PKLKIIGYCH GAPGGVRLLV EKALKLDMRR IEWQAVGLNH VVFLTRFKYN GEDAYHLIDE WIEKKAEEFW ASYVPGPWEE TLSRAAVDMY RLYGLYPLGD TARSGTWKYH RDLKTKIYWY GPIGGVDSEV GWGIRMLRNQ EAEAKLENAA FNPSIKATEA YPPVKSGEQI IDFIDSVVNN VERRMILNIP NNGVLPRLPS DAIVEAPVYV KGEVIRPEAI ENVPNKMYSY VWYPRIAVTE RALEAYLAGS KELLIEALMF DPRTKSTEQA REVIDEILNL PFNEDMKKHY K
|
| |