Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtur_0321 |
Symbol | |
ID | 7083144 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dictyoglomus turgidum DSM 6724 |
Kingdom | Bacteria |
Replicon accession | NC_011661 |
Strand | - |
Start bp | 330218 |
End bp | 333001 |
Gene Length | 2784 bp |
Protein Length | 927 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643457428 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_002352255 |
Protein GI | 217966749 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACATTG AAGGAAAAGT AAAAGAGCTT GTATCTCGCT TAACCCTTGA AGAAAAGATA AAACTTCTTC CCACGAGACA AGCAGAAATT CCAAGACTTA ACATTCACGA ATTTTATATT GGAGGAGAGG CAGCTCACGG GGTTGCTTGG CTTGGAAAAG CCACAGTTTT TCCTCAGCCT ATTGGACTTT CCTCCTCCTT TGATAGAGAT CTTATGAGAA AGATAGGAGA GGTCGTATCT CAAGAAGCAA GGGCCTATTA TTACATGAGG GGTAAAATCG GAGGACTTAT GCTTTGGGCT CCAACAGTAG ACATGGGGAG AGATCCTCGT TGGGGAAGAA CAGAGGAATG CTATGGAGAG GATCCTTTTC TTGCCTCAGA GATGGCAGGT GCATATATAC AAAGTATGCA AGGAGAGGAC CCCGTATATC TTAAAACTGC TATGACTCCA AAACACTTTT TTGCCAATAA CAATGAAAAG GATAGAGATA AATTCTCAGC CAATATAGAT CCGAGAAATA TGTATGAGTA CTATCTTGAG GTATTTAGAA GGGTTATTGA AAAATACAAA GCCCAATGCA TCATGACAGC ATACAATGCG GTAAATGGAA TCCCCTGTAT TATAAATCCT ATAGTAAAAG AGGTGGTAAA AGAAAAATTT GGCCTTGAAG GATGCGTAGT TACCGATGCG GCAGACTTTA GCCAAACAGT TACCTCTCAT AAAACCTTTG AGAATCACTA TGAAACCCTT GCCTATGCCT TAAAAGCTGG AATTGACGCC TTCACCGATG ATCCTAATCT GGTAATAGAA TCTGCATGGC AAGCCTTAGA AAAAGGATTA ATAACTGAGG AAGATATAGA TAAAGCCATA TCAAACTCTC TCAAAGTAAG ATTTAGATTA GGAGAGTTTG ATGAGGAGAT AAGTAAGAAA TTCTATGTTC CTCCTACAAA GATATGTGAT AAGGAACATT CTCAACTTTC CTATCTGGCG GAATTAAAAT CTGTGGTACT TCTTAAAAAC GAAAACAAAT TTTTACCCAT CAAAAAAGAT AAAATAAATA AAATTGCTGT AATTGGACCT CTTGCCCACA AAAACTATAA TGATTGGTAT AGTGGAACTT ATCCATATAA GGTATCCATA TTACAAGGAA TTATAAATAG ACTCTATGAC AAAGAAATTC TATACCATGA CTCCTACGAT ACTGTGGCAA TAAAATCGGC TAAAAGCAAT AGATATCTAA GAGTTATAGA ATCAGAGATA AGCCCTATTT GGGCAATAAG CGATAAAATA ACCGAAAGAG AAAGTTTTAA ATACATAGAT TGGGGATGGG GAAAGAAAAG CCTTCAAGGA TTAGCAAATA GGAAATACTT AACCTGTGAT GATAGCACTT CAGCCATATT TTCTGCATCA GAAGAAGTAT TTGGATGGTT TGTAAAGGAA CTTTTTAACA TTGATCCTCT TGGAGATGGA ACTTACCTCA TTAAAACCTG GAATGGAAAA TATGCCTATA TAGATGAGAG AAAAGGAAAT GTACTAAAGT TTAAAGATAC CTTTGAAAAT CTGCCCGAAG AAAAATTCAT TATTGAGAAG ATAGAAAAAG GTATAGAGAA ATCTTGTGAG ATTGCTCAAA ATTCAGATCT GGTAATTTTA TGCGTGGGCA ATAATCCCAT GGTAAATGGA AGAGAAGATG AAGATAGAAT TGACATAGTC CTTCCTGAAC ATCAGGAAAA TTTAGTAAAA GAAGTATTCA GGGTAAATCC CAATGTAGTA TTACTTATAA TAAGTAGTTA TCCTTATGCC ATTTCTTGGG AAAAGGATCA TATTCCTGCC ATTCTCTGGT CTTCCCATGG AGGGCAAGAA ATGGGAAATG CCATAGCGGA TATTCTCTTG GGAAACTTCT CCCCTGCAGG AAGACTAAGT ATGACCTGGT ATAAATCTAT TCATGACATT CCACCTATTA CTGACTATGA TATTATCAGG GGTAAGAGAA CTTATATGTA TTTCGACAAA GAGTCTCTCT TCCCCTTTGG ACATGGTTTA ACTTATACAG AGTTTACATA CAAAAATCTA GTTTTAAACT CAAAAAACTT TAAATTAAAC GAGGAGATAA AAATCAGTTT TGAAATTGAG AATACTGGCG ATATGGACTC CGATGAAGTA CCACAAGTAT ATGTGAGAGC CCTAAATTCA AAAGTCAAAA GACCTAATAT ACAACTTAAA GGATTTACAA GAGTCTTTAT TCCAAAAGGA GAAAAAAGAA AAATTGAGAT AACTATTCCT GTATCAGATC TTTTCATATG GGACGTAAGG CAAAAAAGAT ATCTTGTAGA AAAAGGAGAA TATGAAATTC TTATTGGAGC ATCATCTAAG GATATTAAAT TAAGGGATAA AATTTATGTG GAAGGAGAAG AGATTAAAAA TAGAGACCCT TTTAATAAAA CAAAAGCCTT TAATTTTGAC GATTGTTATA ATGTATCTTT TAATACAAAG GGAAATTTCA AAGAAACTTA TGTGATCTTT AATGATAACA ACTCTTATAT TCTCTTCAGA GATCTGGAGT TTTACAGTAG GCCTAAAAGG CTTGTCATGG AAATTTCTTC AAACTATTCC AAAATAATTT TAAATTTTAG CAAAATAGAT AAAGAATTTT CTTTTGAAAC TTTGGATACT AATAATGAAT GGAGAGAAAT CGTTTATCTT TTGGATGAAA ATATTGAGGG TGTTCATGAC CTATACATAA GGGGAGAAAA AGGATTAAAA ATAAACTGGT TTAGATTTGA ATAG
|
Protein sequence | MNIEGKVKEL VSRLTLEEKI KLLPTRQAEI PRLNIHEFYI GGEAAHGVAW LGKATVFPQP IGLSSSFDRD LMRKIGEVVS QEARAYYYMR GKIGGLMLWA PTVDMGRDPR WGRTEECYGE DPFLASEMAG AYIQSMQGED PVYLKTAMTP KHFFANNNEK DRDKFSANID PRNMYEYYLE VFRRVIEKYK AQCIMTAYNA VNGIPCIINP IVKEVVKEKF GLEGCVVTDA ADFSQTVTSH KTFENHYETL AYALKAGIDA FTDDPNLVIE SAWQALEKGL ITEEDIDKAI SNSLKVRFRL GEFDEEISKK FYVPPTKICD KEHSQLSYLA ELKSVVLLKN ENKFLPIKKD KINKIAVIGP LAHKNYNDWY SGTYPYKVSI LQGIINRLYD KEILYHDSYD TVAIKSAKSN RYLRVIESEI SPIWAISDKI TERESFKYID WGWGKKSLQG LANRKYLTCD DSTSAIFSAS EEVFGWFVKE LFNIDPLGDG TYLIKTWNGK YAYIDERKGN VLKFKDTFEN LPEEKFIIEK IEKGIEKSCE IAQNSDLVIL CVGNNPMVNG REDEDRIDIV LPEHQENLVK EVFRVNPNVV LLIISSYPYA ISWEKDHIPA ILWSSHGGQE MGNAIADILL GNFSPAGRLS MTWYKSIHDI PPITDYDIIR GKRTYMYFDK ESLFPFGHGL TYTEFTYKNL VLNSKNFKLN EEIKISFEIE NTGDMDSDEV PQVYVRALNS KVKRPNIQLK GFTRVFIPKG EKRKIEITIP VSDLFIWDVR QKRYLVEKGE YEILIGASSK DIKLRDKIYV EGEEIKNRDP FNKTKAFNFD DCYNVSFNTK GNFKETYVIF NDNNSYILFR DLEFYSRPKR LVMEISSNYS KIILNFSKID KEFSFETLDT NNEWREIVYL LDENIEGVHD LYIRGEKGLK INWFRFE
|
| |