Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Oter_3221 |
Symbol | |
ID | 6204389 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Opitutus terrae PB90-1 |
Kingdom | Bacteria |
Replicon accession | NC_010571 |
Strand | - |
Start bp | 4204588 |
End bp | 4207575 |
Gene Length | 2988 bp |
Protein Length | 995 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641692888 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001820101 |
Protein GI | 182415035 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.179721 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCCTG AGCCGGCGTT CACCCCGCTC GCCGGGGAAG CCCGCCGGCA CCGACGCCTC GCTGTGACCT CCACATCCCG CACGAAATTC GTACTGCTCA TCGGGTGCCT GTTGCGCCTC GGCACCGCGC TGCACGCCGC CGCCGGCGTT GACGCAGCCG GCCAGCTGGT GCCGGTCGTA TCCCAACACG AAAACGGCTT CGCTTTCGCG CGCCTCGACC GCGGCGTCCA GTTCCGCGTC GGCGACGTCA CCACCAACGT CCTGCTCTAC GGCCCGTCGA TCGTCCGCGT GAATGCGAAC CTCGGCCAGG CGCACACCAC GCAGCCCAGC CTCGCGGTGG TCGCCCAACC CGCCACGGTA ACCTTCACCG TGGAGGACTC GCCGGAGGCG CTCACGATCC GGACGGCAAA ACTGAGCGTC ATCGTCGCGA AAAAATCCGG CGCGCTGACC TTCCTCGGCT CCGACGGCCG GCTGCTCACC CGCGAACGCG CCGCCGCTCC GGACGAGATC AAGGAGGTCA CGATCTCTGG CGCGCCCACG TACGAAATCA GTCGCACGTT CACGCTCGCG CCGGACGAAT CGCTCTACGG TCTCGGCCAG TACAACCGGC CGTACATGGA TTATCGCGGC CAGGAAGTCC TCCTCGTTCA GACAAACATC GGGATCGTCG TTCCGTTCCT GATCTCCACC CAACGCTATG GGGTCCTCTG GGACATCTAC TCGAAGATGA CCTTCAAGGA CGACGTCTCC GGTGCCACGC TCTGGGCGGA AAGCGCGCCG GCGGGCGTCG ACTACTATTT CATCGCCGGC GACACGATGG ACGGCGTGAT CGCCGGTTAC CGCACGCTCA CCGGCGCCGC GCCGATGCTT CCCAAGCAGG CGTTCGGCCT GTTCATGAGC AAGGAGCGCT ATCCCACGCA GGAGCGGCTG CTCGAGGTGG CGCGCACCTT CCGCCAGGAA GGGTTCCCCC TCGATTACAT CGTGCAGGAT TGGCAATACT GGGGCGGCGC CGATGGCACG TGGAGCGGCA TGACGTGGAA CCAGGAGCGG TTCCCGGATC CCGCGGCGCT CACGAAAACG CTGCACGAGG AGCTGCGCCT GAAGCTCATG GTCTCCATCT GGCCGTCGGT CGGCAACAAC ACGGCCCTGG CGCGCGAGCT GGACGCGAAG GGGCTGCGCT TTGCCCCGCT GCACTGGATT TCGAAGAACG CGCGCGTCTA CGACGCGTTC GGCCCCGAGG GCCGTGCGAT CTACTTCAAG CACATCAAGC AAGGACTGCT CGACGTCGGC GTCGACGCAC TCTGGATGGA CGGCACCGAA GTCGAGGTCG GCACCGCCTG CCACGATCCC GCCGCCGTCG AGCGTGACAT CAAGAACCTC GGCCGCAACG CGCTTGGCGA CTTCACGCGC TACCTGAATC CCTACAGCCT CGAAACCACC CGCGGAACTT ACGAGGGCCA GCGCGGAACG AGCGACCAGC GCGTCTTCAC GCTCACCCGC TCCGCCTGGG CCGGCCAGCA GCGTTACGCC GCGCTGCCCT GGTCGGGCGA CACGACCGCG AGTTGGGAAA CGCTCCGCCA TCAGATCGCC GGCGGCATCA ACATCGCCTT CGCCGGCCTG CCTTACTGGA CGCAGGACAC CGGCGGCTTC TTCGTCAACT ATCCGGACGG CGAGCGAAAC CCGGAATATC AAGAGCTCTA CGCCCGCTGG AACCAGTTCG CGATCTTCAA CCCGATCTAT CGGATCCACG GGACCAATAT CGAGCGCGAG CCGTATCGCT TCAAAGCCTT CGCGCCCGCG ATCTACGACT CACTTCTCTC CGCGGTCCAG CTCCGCTACG CCCTGCTGCC CTACCTCTAC TCGCTCGCGT GGCAGACGAC GGCGCACGAC TACACGATGA TGCGCGGGCT GCCGATGGAT TTCCCGGACG ATCCGGCGGT GCGGAAAACC GACGATGCCT TCATGTTCGG TCCCGCGTTT CTCGTGCATC CGATCACGCA CGCGATGTAT CACGTCAGCG CGCCGCCCGC CGCCACGATT CCGGCCGAGG CGCTCCGCAC GCCCGACGAC CAGCCGGGGC TCGCCGTGCA GTATTTCGCC GGCGTTGATT TTGGCCGCGC CGTCAGCACG AGCGTCGATG AAAAGGTCGA GCACGCCTGG CCCGGTCCGC CGCTTGCCAA TCCGCCGTCC GGTCTCGACG ACTTCGACAA CTTCTCCGCG CGCTGGACCG GCACCGTGAC CGCGCCCGAG GCTGGCGACT ACGAATTCGG CGTCGAATAC GACGACGGCG CGCGCCTTTA CCTCGACGGC AAACTGCTCG TCGATGACTG GAGCTACGGC GCCAAGCGCT ACCGCAGCGC TCGCCTCACC CTCGCGGCCG GTCAGCAGGT CGCCGTGAAG GCGGAGTTTC ACCAGGGCGG ACAGGAGCGC TACTTCCGGC TCGGCTGGCG CACGCCGAGC GAGAGTCGCG CCCTCGCCGC AGCCAGGAAG GAACTCGACA ACACAATGTC GACCTATCTG CCCGCTGGCG CAGCGTGGTA TGACTTCTGG ACCAACGAGC GCTTCGCGGG TGGCGCCACG GTCACGAAGG CCTGTCCGCT CGACACGTTC CCGGTTTACG TTCGCGCCGG CTCGATCGTG CCGATGGGCC CGGCCGATCT GCAATATGCG ACCGAGCGGC CTGACGCGCC CTACACGATC CGCATTTATC CCGGCGCGAA CGCGACCTTC ACGCTCTACG AGGACGACAA CGAAACCTAT GCCTACGAGC GCGGTGAGCG CGCGACCTAC GAGCTCACCT GGGACGATGC CGCCCAGACG CTCCATCTCG GCGGTCGCCA GGGTTCGTTC CCCGGTCTGG TCGCGCAACG CCAACTCGAG CTCGTTCTCA TCGGAGCGAA AACCCCGTCT GCTCCGACGA TCATCACCTA CACCGGCCAC CCGATGGCCG TTAGCCTCGC CTTCAATGAA AGCCTCCTCG CTCAATAA
|
Protein sequence | MTPEPAFTPL AGEARRHRRL AVTSTSRTKF VLLIGCLLRL GTALHAAAGV DAAGQLVPVV SQHENGFAFA RLDRGVQFRV GDVTTNVLLY GPSIVRVNAN LGQAHTTQPS LAVVAQPATV TFTVEDSPEA LTIRTAKLSV IVAKKSGALT FLGSDGRLLT RERAAAPDEI KEVTISGAPT YEISRTFTLA PDESLYGLGQ YNRPYMDYRG QEVLLVQTNI GIVVPFLIST QRYGVLWDIY SKMTFKDDVS GATLWAESAP AGVDYYFIAG DTMDGVIAGY RTLTGAAPML PKQAFGLFMS KERYPTQERL LEVARTFRQE GFPLDYIVQD WQYWGGADGT WSGMTWNQER FPDPAALTKT LHEELRLKLM VSIWPSVGNN TALARELDAK GLRFAPLHWI SKNARVYDAF GPEGRAIYFK HIKQGLLDVG VDALWMDGTE VEVGTACHDP AAVERDIKNL GRNALGDFTR YLNPYSLETT RGTYEGQRGT SDQRVFTLTR SAWAGQQRYA ALPWSGDTTA SWETLRHQIA GGINIAFAGL PYWTQDTGGF FVNYPDGERN PEYQELYARW NQFAIFNPIY RIHGTNIERE PYRFKAFAPA IYDSLLSAVQ LRYALLPYLY SLAWQTTAHD YTMMRGLPMD FPDDPAVRKT DDAFMFGPAF LVHPITHAMY HVSAPPAATI PAEALRTPDD QPGLAVQYFA GVDFGRAVST SVDEKVEHAW PGPPLANPPS GLDDFDNFSA RWTGTVTAPE AGDYEFGVEY DDGARLYLDG KLLVDDWSYG AKRYRSARLT LAAGQQVAVK AEFHQGGQER YFRLGWRTPS ESRALAAARK ELDNTMSTYL PAGAAWYDFW TNERFAGGAT VTKACPLDTF PVYVRAGSIV PMGPADLQYA TERPDAPYTI RIYPGANATF TLYEDDNETY AYERGERATY ELTWDDAAQT LHLGGRQGSF PGLVAQRQLE LVLIGAKTPS APTIITYTGH PMAVSLAFNE SLLAQ
|
| |