Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_1692 |
Symbol | |
ID | 5709110 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | + |
Start bp | 1769660 |
End bp | 1771891 |
Gene Length | 2232 bp |
Protein Length | 743 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641276200 |
Product | Alpha-glucosidase |
Protein accession | YP_001541505 |
Protein GI | 159042253 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.90141 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATTTT CAATGAAAGT AAGCCTAATA AACGATGATG CATTAAAGGT AAGTATAATT AAAAGCGGCA GTAGGTACCG GGAATCCCCT GCTGTTGTTG TTAAGCCGAG TGTTGAATTA GTAAGTGGTG AGAATAGGCT TGGTCCATGG CTTGTTAAGG TTGCTGAAGA TTCCATTAAT GTAAGTGTAA ATAACATGAA TGCAACATTA AGGTTCAGTT ATAGTAATGA TCAAATAATA GTGAGGGGTA ATTTAGGCCT CAATGATGCA GTTTATGGAC TTGGTGAAAA GGCGTTACCA TTGAATAGGA AAAGGTTCAG GGTAACCATG TGGAACACTG ACGCCTATGG GTACAGGTAT GGTTCAGATC CACTGTATGT ATCAATACCG TTCTTCATAA TTACTAATAA GAATGGGGCA ATAGGCCACT TCGCTGATTC CACGGCTAAG GTAATTATTG ATCTTGGTGC AGAGAAGGAG GATGAGTTCA CGGTTATTGT GAATGATTAT CAACTGGATT ACTACATTAT TAGGGGGCCT AGGCTTAAGG ATGTGGTTAC TAGGTTCATT AACTTAACAG GTAAACCCAC CTTAATGCCT AAATGGGCGC TTGGGCATCA GCAAAGTAGG TACAGTTACT ACCCCCAGGA TAGGGTTATT GAGATTATTA AGACCTTTAA GGAGAAGGAA CTGGATAACA CTGTTGTATA CCTTGATATA CATTACATGG ATGGCTACAG AATATTCACC TGGAGTAAGG ATAGGTTCCC TAATCCCACT GAATTAGCTA AGGCGGCTCA TGAACTTGGT GTTAAATTAG TAACCATAGT GGATCCGTAT GTTAAAGTTG ATCCAAATTA CTACGTGTTT AAGGAGGGTA TTAATGGTAA TCACCTGTCG CTTGATGATG ATGGTGGATT ATCCATAGTT CAGGGTTGGC CAGGTAAATC AGCATTACCG GACTTCTTTA ATAAGGAGGC TAGGGAGTGG TGGGCTAGTC TCATTGAGCG TTGGGTTAGG GAGTATGGTG TTGACGGTAT TTGGCTAGAC ATGAATGAAC CAGCGGCCTT CGATTATCCC AATCACACTG TTTCAAGTAA AGTAATAACT CATAGACTTG ATGATGATTC AAGGGTGCCT CATGACTTCC TCCACAACGC CTATGCGCTA TATGAGGCTA TGGCAACATA TGATGGCTTA GTTAAGGCGG GTAGAAGACC ATTCGTATTA TCCAGGGCCG GTTACGCCGG TATCCAGAGG TATGCGGCAG TTTGGACTGG TGATAATACC AGTAATTGGG AACACTTGAG ACTGCAATTG CAGATACTCC TGGGTTTAAG TATATCAGGT GTCACATTCA TTGGCGCTGA TGTAGGTGGC TTTGCAAAAT ATGTTCCAGG GAGTGGTGGA AATGTTTTGT TTACTTTAAG TCCTGAACTA CTGGTTAGGT GGTATGAGTG GGCTATTTTC TTCCCACTGC TGAGGAACCA TGCCTCAATT GGGTCACCTG ACCAGGAACC CTGGGCCTTT GGGCCAAGAA CACTTGAATT AATTAAGAAT CTTCTGAGGC TCAGGGCTAG GTTAACCCCA TACTTATACT CATTAATGTG GCTTAGCCAC ATTAATGGTG AACCAATAGT TAGGCCATTG ATATACGAGT ACCCTAATGA TGAGGAGGTT ATTAATATTG ATGATGAATT CATGCTTGGG CCATTCATGC TAATAGCACC AATGTTAACC AGTGGTAATG CCAGGGAGGT TTACTTACCT GAGGGGGAAT GGGTTAATAT GTGGAGTGGT GAGGTGCTTA ACAAGGGATT CCACATTGTT GATGCACCAC TTGGTAAGCC ACCAGTATTC CTTAGGAGGG GTTCACTGAT ACCTGTACAG GAGACTCAGG GTGTTTTAGG CGTGCTGACG GTATTGGGTG AGGGGGAATT CACTGTTTAC GATGATGATG GTGAATCATC ATCACCAACA CCATCAACAT TAAGCCTAAG GATTAGTGGT GAATCAATTA CAGTAGGTAA TTGGATTAAT CCAATGCCTC AATCACCATC ATCAATAATA CTTGAGGCCT ATGTTAATAA GGAACCAGGT AAAGTAACTA TTAATGATAC TGAGGTGGCT AAGGCTAAGT TCAATATTGA ACCAGGTCCA CCATCATGGT ACATGGATAA GCTACTCTAC ATTAGAGCGG CAACTGGGAG TAATGTTAAA ATAATTAATT AA
|
Protein sequence | MEFSMKVSLI NDDALKVSII KSGSRYRESP AVVVKPSVEL VSGENRLGPW LVKVAEDSIN VSVNNMNATL RFSYSNDQII VRGNLGLNDA VYGLGEKALP LNRKRFRVTM WNTDAYGYRY GSDPLYVSIP FFIITNKNGA IGHFADSTAK VIIDLGAEKE DEFTVIVNDY QLDYYIIRGP RLKDVVTRFI NLTGKPTLMP KWALGHQQSR YSYYPQDRVI EIIKTFKEKE LDNTVVYLDI HYMDGYRIFT WSKDRFPNPT ELAKAAHELG VKLVTIVDPY VKVDPNYYVF KEGINGNHLS LDDDGGLSIV QGWPGKSALP DFFNKEAREW WASLIERWVR EYGVDGIWLD MNEPAAFDYP NHTVSSKVIT HRLDDDSRVP HDFLHNAYAL YEAMATYDGL VKAGRRPFVL SRAGYAGIQR YAAVWTGDNT SNWEHLRLQL QILLGLSISG VTFIGADVGG FAKYVPGSGG NVLFTLSPEL LVRWYEWAIF FPLLRNHASI GSPDQEPWAF GPRTLELIKN LLRLRARLTP YLYSLMWLSH INGEPIVRPL IYEYPNDEEV INIDDEFMLG PFMLIAPMLT SGNAREVYLP EGEWVNMWSG EVLNKGFHIV DAPLGKPPVF LRRGSLIPVQ ETQGVLGVLT VLGEGEFTVY DDDGESSSPT PSTLSLRISG ESITVGNWIN PMPQSPSSII LEAYVNKEPG KVTINDTEVA KAKFNIEPGP PSWYMDKLLY IRAATGSNVK IIN
|
| |