Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtur_1799 |
Symbol | |
ID | 7082978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dictyoglomus turgidum DSM 6724 |
Kingdom | Bacteria |
Replicon accession | NC_011661 |
Strand | - |
Start bp | 1837005 |
End bp | 1838345 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643458909 |
Product | beta-galactosidase |
Protein accession | YP_002353685 |
Protein GI | 217968179 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | [TIGR03356] beta-galactosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000188458 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTAAGT TGGTTTTTCC TAAGGACTTT TTATGGGGAA CAGCGACAGC ATCTTATCAG ATAGAAGGGG CTTGGAATGA GGATGGTAAA GGAGAAAGTA CTTGGGATAG ATTTTCCCAT ACTCCTGGAG CAATATATCA AAATCAAAAT GGAGATGTAG CATGTGATCA TTATCACCGC TATGAGGAAG ATGTAAAGCT CATGGCTGAA ATAGGACTTA AGGCTTATAG GTTTTCAATT TCTTGGCCCA GAATATTTCC CGAAGGAAGA GGAAAGATTA ATCCTAAGGG TGTCTCCTTT TATGAAAGAT TAATTAATAA ACTTCTTGAG AAAAATATTA AGCCAGCTAT AACTTTGTAT CATTGGGATC TTCCTCAAGC TCTTGAAGAT AAAGGGGGAT GGCTAAATAG GGATACCGCA AAGTACTTCT CAGAATATGC AAGCTTTATT TTTTATAAAT TTGGGGATAT GGTGCCTATA TGGATCACCT TAAACGAGCC CTTTGTTAAT GCTTTTCTTG GTTATGCATG GGGGTGGCAT GCTCCAGGTA AAAAAGATCT TAAGGGTGCT TTTGTGGCTG GGCATAATCT TCTTCTTGCT CATGGTCTTG CAGTTCAGGC ATATAAAGAG GGAGGATATA ATGGAAATAT TGGAATTACC ATAAATGTTG CAGCAGTTTA TCCTTATACT AATTCTGAGG AAGATTTGAG GGCAGTACAA GTGCAAGATG CTTTTGAGAA TAGATGGTTT ATTGAGCCTA TTTTTAGGAA GAAATATCCA GAAGTAATAT GGAAGATCTT AGAGAAAAAT TATTTGAGCT TTGATTTTCC TATCTCCGAT TTTGATATTA TATCCTCTCC TATAGATTTT TTGGGTATAA ACTATTACAC TAGAAACATT GTGGCTCATG ACGAGAGTAA TAAATTTTTA GGTCTAAAAA GAATAGAGGG GCCCAATGAA CGTACAGAGA TGGGATGGGA AATATATCCT GATGGGCTAT ATGACATTCT TATTCAGCTT TATAGGGATT ATAAAATTCC TATTTATATC ACTGAGAATG GAGCAGCTTA TAATGATAAA TTAGAGAATG GAAAGGTAGA GGATAATAAG AGGATAGAGT ACTTAAGAGA ACATATTAAA AGGGCATATT TTGCTATTAG GGATGGAGTA GATTTAAGAG GATATTTTAT ATGGTCCCTT ATGGATAATT TTGAGTGGGC TCATGGGTAT AGTAAGAGAT TTGGAATTAT ATATGTAGAT TATGATACTC AAAAAAGAAT ACTTAAAGAT AGTGCCTACT TTTATAAAAA AGTTATCGAG GAAAACGGAA TAGAGGAGTA G
|
Protein sequence | MVKLVFPKDF LWGTATASYQ IEGAWNEDGK GESTWDRFSH TPGAIYQNQN GDVACDHYHR YEEDVKLMAE IGLKAYRFSI SWPRIFPEGR GKINPKGVSF YERLINKLLE KNIKPAITLY HWDLPQALED KGGWLNRDTA KYFSEYASFI FYKFGDMVPI WITLNEPFVN AFLGYAWGWH APGKKDLKGA FVAGHNLLLA HGLAVQAYKE GGYNGNIGIT INVAAVYPYT NSEEDLRAVQ VQDAFENRWF IEPIFRKKYP EVIWKILEKN YLSFDFPISD FDIISSPIDF LGINYYTRNI VAHDESNKFL GLKRIEGPNE RTEMGWEIYP DGLYDILIQL YRDYKIPIYI TENGAAYNDK LENGKVEDNK RIEYLREHIK RAYFAIRDGV DLRGYFIWSL MDNFEWAHGY SKRFGIIYVD YDTQKRILKD SAYFYKKVIE ENGIEE
|
| |