Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_0830 |
Symbol | |
ID | 5709840 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | - |
Start bp | 871868 |
End bp | 873262 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641275333 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001540655 |
Protein GI | 159041403 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATCTA GGCATGGTTT TAAGATAGCC CTAATAGGGG CTGGTAGTGC GGCGTGGGCT ATTGGTCTTA TTAAGGACCT AGCCCTAATA CCAAGCTTAA GCGGTAGTAC CGTGGTTTTA ATGGATATTG ATGAGGATAG GTTAGCTTTA GTTAGTAGGT TTGCCAAGAG GTATGTTTCT GAGGTTAAGG GTAATTTAAA CATAGTTACC ACCACTGATA GGAGGGAGGC TATTAGGGAT GCTGACTTCG TGGTTAACTC AACCCTGGCT AAGGGGCATG GGCACTATGA AAGGATGAGG GAGGTTTCTG AGAAGTACGG GTACTATAGG GGTATTAATA GTGTTGAGTG GAACATGGTG TCTGATTACC ACACAATATG GGGCTACTAC CAGTTTAAAC TAGCCCTAGA TATAGCTAAT GATGTGGTGG ATTACGCACC TAATGCATGG TTACTTAACG TATCTAACCC GGTCTTCGAG TTAACAACAT TGATTAGTAG GGAGACTAAG GCTAGGGTTA TTGGGTTGTG TGACGGCTAC TACGCCTATA GGGATTTACT CAGGGTTCTT GGTCTCGAGG AGGGTAAGGC TGAGGTTGAG GTTATTGGTG TTAATCATGA TGACTGGTTA ACTAGGCTTA AGTACAATGG CGAAGACGCA TACCACCTTA TTGATGAGTG GATCAGCACT AAGTCCAGTC AATACTTCGA GAAGTGGAGG GAGGAGCAGA GTAACCCCTT TGATGTTCAT GTTTCACCAG TGGCGGTTGA CATGTATAGG ATGTATGGCC TATGGCCAAT AGGGGACACC GTTAGGAGTG GTACATGGAA GTACCACTGG GATCTTAAGA CCAAGCAATA CTGGTATGGG CCACTCGGTG GACCTGACTC AGAGATTGGG TGGGCCATGT ACTTAACGTG GCATAAGATC GAGTTCAATG AGCTTAAGAG GGCGCTTGAG AATGAGGCTA AGCCATTAAC AGACTACATA CCGCCAGTTA GAAGTGAGGG TGAGCCGGTT ACAATGGTTA TTGAGGCTAT TGTTGAGGAT AGTGGTAAGG TAATTGAGGT TAATGTACCT AATCAGGATG CAATACCTGG AATACCCAGT GATGTGGCTG TTGAAATGCC GGCTAGGGTG GATGCTAAGG GTGTTCATAG ATTAAGCTTC AGTAACCTAC CTAAGGCGTG GGGTAAGGTG CTTAAGTACG CTATAATGCC TAGGGTAATT AGGGGTGAGT GGGCGATTGA GGCATTCCTA GGGGGTGGTA GAGACACGTT ATTTAACTGG CTTATAATTG ATCCAAGGAC TAAGTCCAGT GATCAAGTCA ACCAGGTTAT AGATGCAATA CTTAAAATAC CTGGAAATGA GGAAATGGCT AAACACTTCA GTTAA
|
Protein sequence | MASRHGFKIA LIGAGSAAWA IGLIKDLALI PSLSGSTVVL MDIDEDRLAL VSRFAKRYVS EVKGNLNIVT TTDRREAIRD ADFVVNSTLA KGHGHYERMR EVSEKYGYYR GINSVEWNMV SDYHTIWGYY QFKLALDIAN DVVDYAPNAW LLNVSNPVFE LTTLISRETK ARVIGLCDGY YAYRDLLRVL GLEEGKAEVE VIGVNHDDWL TRLKYNGEDA YHLIDEWIST KSSQYFEKWR EEQSNPFDVH VSPVAVDMYR MYGLWPIGDT VRSGTWKYHW DLKTKQYWYG PLGGPDSEIG WAMYLTWHKI EFNELKRALE NEAKPLTDYI PPVRSEGEPV TMVIEAIVED SGKVIEVNVP NQDAIPGIPS DVAVEMPARV DAKGVHRLSF SNLPKAWGKV LKYAIMPRVI RGEWAIEAFL GGGRDTLFNW LIIDPRTKSS DQVNQVIDAI LKIPGNEEMA KHFS
|
| |