Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3867 |
Symbol | |
ID | 4074930 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008042 |
Strand | - |
Start bp | 119769 |
End bp | 122123 |
Gene Length | 2355 bp |
Protein Length | 784 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 638004524 |
Product | Alpha-glucosidase |
Protein accession | YP_611259 |
Protein GI | 99078000 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.191857 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTGTC TTAAAATCTG GGCTTTGGAA GCGGAAACCA CCACAGGTGT GATTTTGCGC GTCGAAGGCC GCCACCTACT GCATATATCT GTACTGGAGG AAAACAGGTT TCGCGTATCT CTGCAAAAAG ACGCAGAGTG GCGCTTGGAG CGAACATGGA CTGTGGCGCC TGCCGGGGAT GCCCCATGGG AGGGCCGTCG GCGCGAAGAC ATCTCTGGCT TTTCCTGCCC GCCCCCGCAG CTCTCGCAAA ACGAGACTGA ACTCGTTTTG TCGACTGAGA CAATGCGGCT TCGTATTTTG GCACCGCTAC AGATGGTGTG GGAGGCCAAG GTGAACGGAG CATGGAAAGC CTTCGCCGAG GATCGCCCAA CCGGAGGCAT TCACTTGGGC CTGCGGGATC ATGCCCATGC GCATTTTCTA TCCCGACACC CGCAAGAACC GGTTTTTGGA CTTGGCGAAA AGACAGGCCC GCTTAACCGT GTTGGTCAGC GCTACGAAAT GCGCAACTTA GATGCGATGG GTTACGATGC TGAGCGCACT GACCCACTCT ACAAGCACGT GCCCTTCACT CTGACGCGGA CACAAACTGC CGGGTGTTGG TCCATCTTTT ACGATAACTT GGCTAGTTGC TGGTTCGATT TCGGCAATGA ATTGGACAAC TACCACGCCC CTTATCGCGC CTATCGTGCC GAGGATGGCG ATCTCGATTT TTATATGACA TGGGCGCCTG AGTTGCTCGA CTTGGTCAAA CAACAAGAGC GATTGACCGG CGGCACCGCC TTTCCGCCGC GTTGGAGCCT TGGGTACTCT GGGTCAACAA TGTCCTACAC AGATGCACCG GACGCACAGG CGCAGCTGGA AGGTTTTCTA ACTAAGATTG CAGAGTATCG TATCCCTTGT GACAGTTTTC AGATGTCGTC GGGCTACACC TCGATAGGGC CGAAGCGGTA TGTGTTCAAC TGGAATGATG AGAAGGTGCC CGACCCTGCG GCTATGGCGG CAAAATTTGC CGATAAGGGG GTGCATCTCA TTGCCAATAT CAAACCTTGC CTGCTGCAGG ACCATCCGCG TTACGCCGAA GTCGCCAACG CAGGGTTGTT CGTGCAGGCG AGCGCAGAAG CGGACACGAA CTGCGGACCA GAACGCTCGG TCTTTTGGGA CGACGAAGGC TCGCATCTCG ACTTTACTAA CCAAGCGACT GTGGATTGGT GGAAGCAAAA TATCGGCGAG GCGCTGTTGC AGCGCGGTAT AGGCTCAACC TGGAACGATA ACAACGAATA CGAAATCTGG GACCGACACG CTCAATGCGC GGGTTTCGGT AAGGCAATCG ACATGTCGCT AATGCGTCCT GTAATGCCCA TACTTATGAC GCGCGCATCA ATGGAGGCGC AGGAGGCCCA TGCACCGGAG AAGCGGCCCT ACCTAATCAG TCGATCAGGC GCCCCGGGAT TGCAGCGTTA TGCCCAGACT TGGAGCGGAG ACAATCGCAC AGATTGGAAG ACGCTGCGCT GGAACCAGCG GATGGGCCTC GGCATGAGCA TGTCTGGCTT TTATAACATT GGACATGACA TCGGTGGCTT CTCTGGGCCG CGGCCAGAAC CGGAGCTGTT TGTGCGCTGG GTCCAAAATG GTGTGTTCCA TCCTCGTTTT ACCATTCACT CATGGAATGA TGATGCGACA GTAAACGAGC CTTGGATGTA TCCTGAGGTG ACGGACCACA TCCGCGCGGC GATAGAATTG CGGTACCAAC TTCTGCCTTA TCTCTACACC TGCCTGTGGC AGGCGGCCGA GCGCAGTGAG CCGATGCTAC GACCTTTGTT CCTTGATTTT GGTGCTGACC CTCAAGCTTG GGAGGAGAGC GATAGCTTTC TTCTCGGGCG GGATCTCTTA GTGGCGACAG TCTTGGACAA GGGTGTTGAT GCGATCTCGG TCTATTTACC GCGGCATTCC GGGGGCTGGT GGGATTTCCA CACTGGTCTT TGGCACGAAG GCGGACAATG GCTCACCGTG CCTGTCGCTC TTGATACCAT TCCGCTGTTT ATCCGCGGGG GCGCCGTGGT TCCGATGGGC CAAGGGGCAG ACCGCGCAGC ACCCGAGATG GAGATCGCGC GACTTTTGGC TGTGTTCCCC GCGCAAGGCG TACAGGAAAC GACCAGCCTC CTCTATGAGG ATGACGGTGT GACAAAGACG GGTAAGTGGT GCCTGTCTCA TCTCGTCTTG AACAGCACCA ATGACCACAT CAGCCTGATC TCGAAACGTG AGGGTAATGG CGCGCCGGTT TTACCAGAGG CACTGGTGGT CCTGCCGGCT GGTGAACGAC GCATTCTGAA GACGGGAACG AGGATCGACT TATGA
|
Protein sequence | MKCLKIWALE AETTTGVILR VEGRHLLHIS VLEENRFRVS LQKDAEWRLE RTWTVAPAGD APWEGRRRED ISGFSCPPPQ LSQNETELVL STETMRLRIL APLQMVWEAK VNGAWKAFAE DRPTGGIHLG LRDHAHAHFL SRHPQEPVFG LGEKTGPLNR VGQRYEMRNL DAMGYDAERT DPLYKHVPFT LTRTQTAGCW SIFYDNLASC WFDFGNELDN YHAPYRAYRA EDGDLDFYMT WAPELLDLVK QQERLTGGTA FPPRWSLGYS GSTMSYTDAP DAQAQLEGFL TKIAEYRIPC DSFQMSSGYT SIGPKRYVFN WNDEKVPDPA AMAAKFADKG VHLIANIKPC LLQDHPRYAE VANAGLFVQA SAEADTNCGP ERSVFWDDEG SHLDFTNQAT VDWWKQNIGE ALLQRGIGST WNDNNEYEIW DRHAQCAGFG KAIDMSLMRP VMPILMTRAS MEAQEAHAPE KRPYLISRSG APGLQRYAQT WSGDNRTDWK TLRWNQRMGL GMSMSGFYNI GHDIGGFSGP RPEPELFVRW VQNGVFHPRF TIHSWNDDAT VNEPWMYPEV TDHIRAAIEL RYQLLPYLYT CLWQAAERSE PMLRPLFLDF GADPQAWEES DSFLLGRDLL VATVLDKGVD AISVYLPRHS GGWWDFHTGL WHEGGQWLTV PVALDTIPLF IRGGAVVPMG QGADRAAPEM EIARLLAVFP AQGVQETTSL LYEDDGVTKT GKWCLSHLVL NSTNDHISLI SKREGNGAPV LPEALVVLPA GERRILKTGT RIDL
|
| |