Gene Dret_0041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0041 
Symbol 
ID8417843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp52682 
End bp54190 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content64% 
IMG OID645036604 
Product4-alpha-glucanotransferase 
Protein accessionYP_003196921 
Protein GI258404179 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1640] 4-alpha-glucanotransferase 
TIGRFAM ID[TIGR00217] 4-alpha-glucanotransferase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCACCG ATCGCGCCAG CGGTATCCTG CTCCATATCA CCTCCCTGCC CGGCACCTAT 
GGCGTGGGCG ATGTCGGACC GCAGGCCAGG CGGTTTGTGG ATTTTCTCGC CGCCGCCGGC
CAGCGCTATT GGCAGGTTCT CCCCGTGCAC CCGACCCAGG ACGGGGCCGG TCACTCCCCG
TACAGCAGCA CCTCCGCCCA TGCCGGCAAC GAACTGCTCA TCAGCCCCGA GGATCTGGTC
GCGGAGGGGC TGCTCGGGGA AAGCCACATC CGGCCTTGGC GGCGCCAGGC CGGGGAATCT
CAGGCGCATT ATGACTGGGC TCAGGCCGCC AAACGCGAAT GCTGTGCTGT GGTCCAGGAT
CGCCTCCATC GTCGCCTGCT TCCCGCCTTT GAGGCCGAAT TCCAGGACTT CTGCCGCACC
CAGCGCGGCT GGCTGGACGA CTTTGCGTTG TTCAGCGCTG CGAAAAACCA TTTTGGTCCC
CGGCGGTCCT GGCGGTATTG GCCTGAGGAG ATCCGCCTCA GGCAGCCACG AGCGTTGCAG
TGGATGCGCC AGGAACTCGC CTTGGCCATT GAACGTGAAA AGCTCGGTCA ATGTCTGTTT
TTCCGCCAAT GGCGCCGCCT GCAGGCCCGT TGCCGCGCCA AGGAGGTGGC CTTGATGGGC
GATGTTCCCA TCTATGTCGA CTATGAAAGC GCTGATGTCT GGAGTCATCC GGAATATTTC
AAGCTCGATG CAACGCTCCG TCCGCCTGTG GTCGCCGGCG TGCCGCCCGA CTATTTCAGC
GCTACCGGCC AGCGGTGGGG CAATCCGGTC TATGATTGGC GGAAGTTGTG TCAGGACGGC
TTCCATTGGT GGGTGGATCG CCTGGAGGGC GAACTGCAGC TGTGCGACGT CCTGCGCTTG
GACCATTTCC GGGGCTTTTC CGCCTGCTGG GAGGTGCCGG TCGAACATGA GACCGCTGAG
AACGGCTCTT GGGTCCACGT CCCGGGCGAG GCGTTGTTCG CCTTGTTGCA GGAGCATTGG
GGCCGACTGC CACTTGTGGC CGAGGATCTG GGATATATTA CCGACGAGGT CCGGCATCTC
AAGCAACGCT TCGATTTGCC CGGCATGGCG CTGCTGGTCT TTGCCTTTGA TGGGGATCCC
CAGAACGCTT TTTTGCCGGA GCATCACGAC CCCAATCTGG TCGTTTATAC CGGCACGCAC
GATACGAACA CCGTTCGGGG ATGGTACGAA GAAGAAATCA CTGCGGAGCA GCGGTCTCAG
CTCCGCCGTC ATCTCGGTCA TGCCCCCGCT GCTTCGCACG TGGCCCAGGA TCTCGTCCGC
CTGGCCCTGG AGAGCCGGGC GCGGACGGCG ATCCTCCCGC TACAGGATGT GCTGGGGCTC
GGCAGCGAGG CCCGGATGAA CGTACCCGCG GCCCCCGAGG GCAATTGGAT CTGGCGGACT
TCCTCAGAAG CACTGCACGA AGACGTGGCT GCCTGGCTGG CTGAAGTGAC GGCTGCCAGC
GGCCGGTAG
 
Protein sequence
MLTDRASGIL LHITSLPGTY GVGDVGPQAR RFVDFLAAAG QRYWQVLPVH PTQDGAGHSP 
YSSTSAHAGN ELLISPEDLV AEGLLGESHI RPWRRQAGES QAHYDWAQAA KRECCAVVQD
RLHRRLLPAF EAEFQDFCRT QRGWLDDFAL FSAAKNHFGP RRSWRYWPEE IRLRQPRALQ
WMRQELALAI EREKLGQCLF FRQWRRLQAR CRAKEVALMG DVPIYVDYES ADVWSHPEYF
KLDATLRPPV VAGVPPDYFS ATGQRWGNPV YDWRKLCQDG FHWWVDRLEG ELQLCDVLRL
DHFRGFSACW EVPVEHETAE NGSWVHVPGE ALFALLQEHW GRLPLVAEDL GYITDEVRHL
KQRFDLPGMA LLVFAFDGDP QNAFLPEHHD PNLVVYTGTH DTNTVRGWYE EEITAEQRSQ
LRRHLGHAPA ASHVAQDLVR LALESRARTA ILPLQDVLGL GSEARMNVPA APEGNWIWRT
SSEALHEDVA AWLAEVTAAS GR