Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Oter_0992 |
Symbol | |
ID | 6205615 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Opitutus terrae PB90-1 |
Kingdom | Bacteria |
Replicon accession | NC_010571 |
Strand | - |
Start bp | 1219159 |
End bp | 1221957 |
Gene Length | 2799 bp |
Protein Length | 932 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641690615 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001817880 |
Protein GI | 182412814 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0295666 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0129496 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACACC CCGTGCTCCC GCGGTGCCTC GCACCCTTGC TCTCCTGCGC GGTTTTGCTC ACCGCCCTCC TGGTCCTGCC TCACTCCCTC GCTGCGGCCG CCGCAGTTCC GGCCGCCGGC CGCGAGCGAA TCCTCCTCGA CGCCGGCTGG CGGTTCGCAC TCGGTCACGC CACGGATCCC GCGCGCGACT TCGGTCACGG CACCGGCTAC TTTTCCTATC TCGCCAAAAC CGGTTTCGGG GACGGTGCCG CGAGTCCGGC GTTCGACGAC CGCACGTGGC GCCAGCTCGA CCTGCCGCAC GACTGGGCCG TCGAAGTGCC GTTCGATCCG CGCGGCAGTC ACAGCCACGG CTACAAGGCG GTCGGCCCGC GTTTCCCGGA GCGCAGCGTG GGCTGGTATC GTCGCAGCTT CCACGTGCCC GAGTCAGATC TCGGCCGGCG AATCCGGCTG GATTTCGACG GCGTCTTTCG CGCCGCGCGT GTGTTCGTGA ATGGGTTTTT TGTCGGCGAG GAGCCGAGCG GCTACCTCGG TGCGAGCTAC GACGTTTCCG AATATCTGAA CTACGGCGGC GACAACGTCA TCGCCGTGCG GGTCGACGCG TCGATGGAGG AGGGTTGGTT CTATGAAGGC GCCGGGATCT ACCGGCACGT TTGGTTGGTG AAGACCGCGC CGCTTCACGT CGCGCTCGAT GGCACATGGG TCCGTACCGA CGTTTCGCCG AAGTTCGCCA CCGTCACGAT CGAGACGCGC GTGAACAACG CCGGCCGCGC GCCCGCTGAC TACACGCTCG AGCAGGAAAT CTTCGGCCCC GACGGCAAGT CGCTCGCCCG TTCCAGCGCC GCGGCGCCGG CCGTCGCGCC CGGCGGCGTT GGCGTGCATC GCGATTCGCT TCGCGTCAAC GCCCCACAGC TCTGGTCTCT CGAGTCGCCG ACGATGCACC GCGTCGTGAC CACGATCCGT CAGGCCAACG CCATCGTCGA TCGCTACGAA ACCCCGTTTG GCATTCGCAC GATCCGGTTC GATCCGAACC GGGGATTTTT CCTGAACGGG CAGCGCGTCG TGCTGAAGGG CACCAACAAT CACCAGGACC ACGCCGGGGT CGGCGCGGCG ATTCCAGATA CGTTACAGGA ATTCAGGATT CGCCGCTTGA AGGAAATGGG CAGCAACGCC TATCGCGCTT CGCACAATCC TTCGACCCCC GAGTTGCTCG ATGCCTGTGA TCGCCTCGGC ATGCTCGTCA TCGAGGAAAA CCGGTTGATG GGCATCAATC CTTACCACCT CGGCCAGCTC GAGCGAATGA TCCGGCGCGC GCGCAATCAC CCGAGCATCA TCCTCTGGTC GCTCGGCAAC GAGGAATGGG GCATCGAAGG CAACATCAAG GGCGCGCGGA TCACCGTGCC GATGCAGGAT TTCGCGCACC GGCTCGATCC GACGCGCCGC ACCACCGTTG CGATCAGCGG CGGCTGGGGC GGCATCTCGA GCACCGTCGA AGTCGCGGGC GTCAACTACG TCCGGCAGGC AAATGTGGAC AAACAACACG CGGAGTACCC CGAGCAGATC ATCGTCGGCA CCGAAGAAAC GACGACGCAG CAGACGCGCG GAATCTATTT CACCGATCGC GAGCGGGCGC ATCTCGCACC ACTCGAGGAC GGCTCGTCCG GCGGTAACTG CGAATTCGGC TGGCGCTACT ACGTCGCCCG GCCGTGGGCC GCCGGGCTGT TCTACTGGAC GGGATTCGAC TATCGCGGTG AACCCACGCC GTTCGGCTAT CCGGCGATCG CCTCGCAGTT CGGCATCCTC GACACCTGCG GTTTTCCGAA AGACAGCTTT TACTACCTGA AATCGTGGTG GACGACTGAG CCGGTGCTGC ACGTGTTCCC GCACTGGAAT TGGGCCGGTC GCGAGGGTCA ACCGCTCGAG GTGCGCGTCC ACAGCAACTG CGGGGAGGTC GAGCTCTTCC TCAACGGTGC GTCACTCGGC CGAAAAACCA TGGAGCCGAA CGGCCATCTG GCGTGGGCGG TCAACTACAC GCCCGGCACA CTGCTCGCGC GCGGTTTCCG CGACGGCAAG GAAATCGCTA CTACGACGGT TGAAACCACC GGCGCCCCGG TCGCGCTCGC GCTGTCGGCC GATCGCCGGG AACTGCGCGC CGACAGCCGC GATGTCGCGG TGATCACGGT CGAAGCCCGC GATGCCGAGG CACGGCTCGT GCCGACGGGA AATGTGCCCG TGACGTTCAC GCTGCGCGGA CCCGGCCGGA TCATTGGCGT CGGCAACGGC GATCCGTCGT CGCACGAGCC CGACCAGTTT GTCGCGAGTG TGCGCGGGAT TAATCTCGGC GAGTGGAACG CGCCCGACGG TTCGGTGAAG ACGGGCCAGA ATGTGTTCGA AGCGACGTTT GATCGCCCCG CGCTCGGCGC AGGCGAGACG ATGACGCTCC TGTTGAATGC GCTCGGGACG AACCAAACCG CCACGTTGAA CGGTGAGCCG CTGGTTCGCG ACGCCGCGCC CGCCCAGGCG AAGATCGAGC TGCCGCTCGC CGCGGACACC CTGCGGCCGA CCGGCAACGT GCTCCGGATC GAAGCGACCC GCTACGAAGA CTGGGGCACG CGCGACAGCC TGAAGCAGCT TTGGCCGGCC ACGCTGCGCA TCGTCACGCC CGCGCCCGCC TGGCAGCGCT CGACGTTCAA CGGACTCGCT CAAGTCATCG TTCAGACCAC CGGCGAGCCG GGCGCGATTG AGCTCGTCGC GACGAGCGAC GGATTGAAGA GCGCCAGCGT CGAACTGACG AGCCGGTGA
|
Protein sequence | MKHPVLPRCL APLLSCAVLL TALLVLPHSL AAAAAVPAAG RERILLDAGW RFALGHATDP ARDFGHGTGY FSYLAKTGFG DGAASPAFDD RTWRQLDLPH DWAVEVPFDP RGSHSHGYKA VGPRFPERSV GWYRRSFHVP ESDLGRRIRL DFDGVFRAAR VFVNGFFVGE EPSGYLGASY DVSEYLNYGG DNVIAVRVDA SMEEGWFYEG AGIYRHVWLV KTAPLHVALD GTWVRTDVSP KFATVTIETR VNNAGRAPAD YTLEQEIFGP DGKSLARSSA AAPAVAPGGV GVHRDSLRVN APQLWSLESP TMHRVVTTIR QANAIVDRYE TPFGIRTIRF DPNRGFFLNG QRVVLKGTNN HQDHAGVGAA IPDTLQEFRI RRLKEMGSNA YRASHNPSTP ELLDACDRLG MLVIEENRLM GINPYHLGQL ERMIRRARNH PSIILWSLGN EEWGIEGNIK GARITVPMQD FAHRLDPTRR TTVAISGGWG GISSTVEVAG VNYVRQANVD KQHAEYPEQI IVGTEETTTQ QTRGIYFTDR ERAHLAPLED GSSGGNCEFG WRYYVARPWA AGLFYWTGFD YRGEPTPFGY PAIASQFGIL DTCGFPKDSF YYLKSWWTTE PVLHVFPHWN WAGREGQPLE VRVHSNCGEV ELFLNGASLG RKTMEPNGHL AWAVNYTPGT LLARGFRDGK EIATTTVETT GAPVALALSA DRRELRADSR DVAVITVEAR DAEARLVPTG NVPVTFTLRG PGRIIGVGNG DPSSHEPDQF VASVRGINLG EWNAPDGSVK TGQNVFEATF DRPALGAGET MTLLLNALGT NQTATLNGEP LVRDAAPAQA KIELPLAADT LRPTGNVLRI EATRYEDWGT RDSLKQLWPA TLRIVTPAPA WQRSTFNGLA QVIVQTTGEP GAIELVATSD GLKSASVELT SR
|
| |