Gene Dtur_1799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtur_1799 
Symbol 
ID7082978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDictyoglomus turgidum DSM 6724 
KingdomBacteria 
Replicon accessionNC_011661 
Strand
Start bp1837005 
End bp1838345 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content35% 
IMG OID643458909 
Productbeta-galactosidase 
Protein accessionYP_002353685 
Protein GI217968179 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000188458 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTAAGT TGGTTTTTCC TAAGGACTTT TTATGGGGAA CAGCGACAGC ATCTTATCAG 
ATAGAAGGGG CTTGGAATGA GGATGGTAAA GGAGAAAGTA CTTGGGATAG ATTTTCCCAT
ACTCCTGGAG CAATATATCA AAATCAAAAT GGAGATGTAG CATGTGATCA TTATCACCGC
TATGAGGAAG ATGTAAAGCT CATGGCTGAA ATAGGACTTA AGGCTTATAG GTTTTCAATT
TCTTGGCCCA GAATATTTCC CGAAGGAAGA GGAAAGATTA ATCCTAAGGG TGTCTCCTTT
TATGAAAGAT TAATTAATAA ACTTCTTGAG AAAAATATTA AGCCAGCTAT AACTTTGTAT
CATTGGGATC TTCCTCAAGC TCTTGAAGAT AAAGGGGGAT GGCTAAATAG GGATACCGCA
AAGTACTTCT CAGAATATGC AAGCTTTATT TTTTATAAAT TTGGGGATAT GGTGCCTATA
TGGATCACCT TAAACGAGCC CTTTGTTAAT GCTTTTCTTG GTTATGCATG GGGGTGGCAT
GCTCCAGGTA AAAAAGATCT TAAGGGTGCT TTTGTGGCTG GGCATAATCT TCTTCTTGCT
CATGGTCTTG CAGTTCAGGC ATATAAAGAG GGAGGATATA ATGGAAATAT TGGAATTACC
ATAAATGTTG CAGCAGTTTA TCCTTATACT AATTCTGAGG AAGATTTGAG GGCAGTACAA
GTGCAAGATG CTTTTGAGAA TAGATGGTTT ATTGAGCCTA TTTTTAGGAA GAAATATCCA
GAAGTAATAT GGAAGATCTT AGAGAAAAAT TATTTGAGCT TTGATTTTCC TATCTCCGAT
TTTGATATTA TATCCTCTCC TATAGATTTT TTGGGTATAA ACTATTACAC TAGAAACATT
GTGGCTCATG ACGAGAGTAA TAAATTTTTA GGTCTAAAAA GAATAGAGGG GCCCAATGAA
CGTACAGAGA TGGGATGGGA AATATATCCT GATGGGCTAT ATGACATTCT TATTCAGCTT
TATAGGGATT ATAAAATTCC TATTTATATC ACTGAGAATG GAGCAGCTTA TAATGATAAA
TTAGAGAATG GAAAGGTAGA GGATAATAAG AGGATAGAGT ACTTAAGAGA ACATATTAAA
AGGGCATATT TTGCTATTAG GGATGGAGTA GATTTAAGAG GATATTTTAT ATGGTCCCTT
ATGGATAATT TTGAGTGGGC TCATGGGTAT AGTAAGAGAT TTGGAATTAT ATATGTAGAT
TATGATACTC AAAAAAGAAT ACTTAAAGAT AGTGCCTACT TTTATAAAAA AGTTATCGAG
GAAAACGGAA TAGAGGAGTA G
 
Protein sequence
MVKLVFPKDF LWGTATASYQ IEGAWNEDGK GESTWDRFSH TPGAIYQNQN GDVACDHYHR 
YEEDVKLMAE IGLKAYRFSI SWPRIFPEGR GKINPKGVSF YERLINKLLE KNIKPAITLY
HWDLPQALED KGGWLNRDTA KYFSEYASFI FYKFGDMVPI WITLNEPFVN AFLGYAWGWH
APGKKDLKGA FVAGHNLLLA HGLAVQAYKE GGYNGNIGIT INVAAVYPYT NSEEDLRAVQ
VQDAFENRWF IEPIFRKKYP EVIWKILEKN YLSFDFPISD FDIISSPIDF LGINYYTRNI
VAHDESNKFL GLKRIEGPNE RTEMGWEIYP DGLYDILIQL YRDYKIPIYI TENGAAYNDK
LENGKVEDNK RIEYLREHIK RAYFAIRDGV DLRGYFIWSL MDNFEWAHGY SKRFGIIYVD
YDTQKRILKD SAYFYKKVIE ENGIEE