Gene Dret_0037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0037 
Symbol 
ID8417839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp44682 
End bp46526 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content61% 
IMG OID645036600 
Productmalto-oligosyltrehalose trehalohydrolase 
Protein accessionYP_003196917 
Protein GI258404175 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR02402] malto-oligosyltrehalose trehalohydrolase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTGG TTCAGGAATT GGGCGCCTGG TATGCGCCGG AAGCGACCCG ATTTACGGTC 
TGGGCGCCGT TGCGGTCATC CGTTGAGCTG CGGCTTTTCG ACCCTGAGGA GCGTCGCGTG
GAAATGGCGT TTGCCGCCGG AGACGGCTGC TGGCGGGCCG AGGTGCCCGA TGTGGCTCCC
GGTACACGGT ACTCCTTTGT CCTCGACGGG GACCTCGAAC GGCCGGATCC GGCCTCTTTC
GCCCAGCCCG ACGGGGTTCA CGGTCCCTCG GTCGTCGTGG ACCATTTCTC GGAGACACGG
AGTGCGGCAA GTTGGGCCGG TCCGCAATGG GAGGAGTCGG TTTTTTACGA ACTCCATGTC
GGGACATTTA CGGAGCAGGG GACCTTTGAA GCCATTATCG GCCGACTTCC GTCGCTCAAG
GACCTGGGAG TGACCTGTCT CAGTCTCATG CCGGTGGCCC ACTTCCCCGG TCAGCGCAAC
TGGGGGTATG ACGGTGTCTA TCCCTTCGCA GCCCATACCA CATATGGCGG GGTGCAGGGA
TTGAAACGCT TGGCGGACGC CTGCCACAGT CTCGGCCTGG GGATTGTTCT CGATGTCGTC
TACAACCATT TCGGACCCGA GGGGAATTAT TTGCGCGATT TCGGGCCCTA TTTCACGGAT
GTCTACCACA CGCCCTGGGG GGAGGCGGTT AATTTTGACG GCCCTTACAG CGATGGGGTG
CGCAAATATT GTATTGAAAA CGCGTTGTAC TGGCTTAAGA CCTGTGATCT GGACGGATTA
CGCCTGGACG CGGTTCACGC CATTTATGAT GCTGCGGCCG TGCCTTTTCT CGAAGAGCTC
ACTCGCGAGG TTGACGCCCT TTCGCTGGAC AGCGGTCGCC AGCGGTGGCT CATCGCGGAA
TCCGACCGGA ATGATGCGCG TTTTCTTCGC CCTTCCGAGC AGGGTGGCCT CGGCCTGCAT
GCCCAATGGA ACGATGATTT CCACCATGCC CTGCACGCCT TGGTGACCGG GGAGAGGCAT
GGCTATTATC AAGATTTCGG GCGGGTGGAG GATCTGGGCC AAGCTTGGCG GCAAAATTAT
GTCTACAGCG GGCAGTATTC GCCGTTTCGT AAGCGCCGCC ACGGTAACTG CGCTGTGGAC
CGGGCCAAAT CGCAATTTGT GGTTTGCAGC CAGAATCACG ATCAGGTCGG CAATCGGTTG
CGCGGTGACC GTCTGAGCAC GATGGTCTCC TTCGAGACCC TCAAGGTGGT AGCCGCGGCC
CTGTGTCTGA GCCCCTTTCA GCCTATGCTG TTCATGGGCC AGGAATATGG GGAAAAGCGG
CCGTTTCCCT ATTTCATCGA CCATACCGAT CCAAAGCTTG TGCAGGCTGT TCGCCAGGGC
CGGAAAAAAG AATTTCAGGC CTTTTGGCGT GGCACGCCCC TTGATCCCAA GGCCGAGACA
ACCTTTACCA GCGCGGTTCT CAACTGGGAG ACCCAAGACG AGCGGGCCCG GCAATTGTGG
GCCTGGTACA AGCGCCTCCT TGCCTTGCGC CGGTCGCATC CGGTCCTGGG GCCGGATATG
GCCGTGGCCC GAGAAGTCCG CGAAAGTGCG ACGCCGGCGT GCTTGTGGGT CCATGCCCGG
CGGGCAGAGA CTGAAGTGGT GCTGTACTTC GCCTTTGGGG ACCAGCCGGA AGCGCCGGCG
CCCTGGGAGC TGCCTGAAGG AACGTGGCAG CGGTTGGAGG ACAGCAGCGC CCGTCGTTGG
GGCGGTCCGG GAACGCACGG TCCGGAAGAA CTTTGCCACC CCCAGGCCCC GCAATTCCAC
GCCCGGCAGG TCCAGCTCTG GGAGAGAGTA GCGAGGAAAG AATAG
 
Protein sequence
MNVVQELGAW YAPEATRFTV WAPLRSSVEL RLFDPEERRV EMAFAAGDGC WRAEVPDVAP 
GTRYSFVLDG DLERPDPASF AQPDGVHGPS VVVDHFSETR SAASWAGPQW EESVFYELHV
GTFTEQGTFE AIIGRLPSLK DLGVTCLSLM PVAHFPGQRN WGYDGVYPFA AHTTYGGVQG
LKRLADACHS LGLGIVLDVV YNHFGPEGNY LRDFGPYFTD VYHTPWGEAV NFDGPYSDGV
RKYCIENALY WLKTCDLDGL RLDAVHAIYD AAAVPFLEEL TREVDALSLD SGRQRWLIAE
SDRNDARFLR PSEQGGLGLH AQWNDDFHHA LHALVTGERH GYYQDFGRVE DLGQAWRQNY
VYSGQYSPFR KRRHGNCAVD RAKSQFVVCS QNHDQVGNRL RGDRLSTMVS FETLKVVAAA
LCLSPFQPML FMGQEYGEKR PFPYFIDHTD PKLVQAVRQG RKKEFQAFWR GTPLDPKAET
TFTSAVLNWE TQDERARQLW AWYKRLLALR RSHPVLGPDM AVAREVRESA TPACLWVHAR
RAETEVVLYF AFGDQPEAPA PWELPEGTWQ RLEDSSARRW GGPGTHGPEE LCHPQAPQFH
ARQVQLWERV ARKE