Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_0471 |
Symbol | |
ID | 5709821 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | - |
Start bp | 510368 |
End bp | 511381 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641274974 |
Product | aldo/keto reductase |
Protein accession | YP_001540306 |
Protein GI | 159041054 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00023215 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.05091 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTATG TTAAGCTTGG TTGGTCTGGT CTTAAGGTTT CTAAAATAGC ATTCGGTGCA ATGTCCATCG GTGACCCCAA CCTACAGAGT CATGGTAGCT CCAGTTGGGT TGCCGGTAGG GATCAAGCCC TTAAAGTCCT TAAGAGGGCT TGGGATTTAG GCATAAACTT CATTGATACT GCTAATGTTT ATTCGAGGGG TAGGAGTGAG GAGATTGTTG GTGAATTGGT TAAGGGTATG AGGGAGGATG TGGTTATCGC CACTAAGGTT TTTGGGCAAA TGGGTAATGG TCCTAATGCT AGGGGTTTGT CTAGGAAGCA TATTATGTGG CAGGTTAGGG AGTCTTTGAG GAGGTTGAAT ACTGGTTACA TTGATCTCTA CCAGATTCAT AGGTTTGATT ACGATACACC CATTGAGGAG ACTTTATCCA CATTAACAGA CCTAGTACAC CAAGGATTAG TAAGGTACAT TGGGGCATCA AGCATGTGGA CATGGCAATT CGCAAAAATG ATATACACAG CAGAAATGAA AGGATACGAG AAGTTCGTAA GTATGCAAAA CGTCTACAAT CTACTCTACA GGGAGGAGGA GAGGGAAATG ATACCATTCT GCAAGGCTCA TGGAATCGGT ATAATACCAT GGAGCCCAAC TGCAGCTGGT ATACTGTCAG GTAAGTACTA TAAGGATGGT AAGATAATTG TGCCTGAAAC CGAGACTAGG GTTAGGCCAG GTAGTGGTGA TTATAGGATT TATGTGGAAC CCCCTGAGAA CGCTGAGATA CTGAGGAGGG TTATTGAGGT TGCTAATAAT AAGGGAGCGA CTCCAACGCA AATAGCGTAC GCATGGCTAC TGCATAAGGG TGTTACAGCA CCAATAATAG GAACCACTAA GCCGGAGCAC GTGGAAGAAG CTGTGAATGC TATTAGTATT AAGTTAACTG ACGATGAGGT AAAGTACCTT GAGGAACCAT ATAAGCCTAA GCCAGTACTT CACATACCTC CACCACCCAT GTAG
|
Protein sequence | MEYVKLGWSG LKVSKIAFGA MSIGDPNLQS HGSSSWVAGR DQALKVLKRA WDLGINFIDT ANVYSRGRSE EIVGELVKGM REDVVIATKV FGQMGNGPNA RGLSRKHIMW QVRESLRRLN TGYIDLYQIH RFDYDTPIEE TLSTLTDLVH QGLVRYIGAS SMWTWQFAKM IYTAEMKGYE KFVSMQNVYN LLYREEEREM IPFCKAHGIG IIPWSPTAAG ILSGKYYKDG KIIVPETETR VRPGSGDYRI YVEPPENAEI LRRVIEVANN KGATPTQIAY AWLLHKGVTA PIIGTTKPEH VEEAVNAISI KLTDDEVKYL EEPYKPKPVL HIPPPPM
|
| |