Gene Dret_2127 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2127 
Symbol 
ID8419977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2418542 
End bp2420800 
Gene Length2259 bp 
Protein Length752 aa 
Translation table11 
GC content62% 
IMG OID645038720 
Producthypothetical protein 
Protein accessionYP_003198989 
Protein GI258406247 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.655892 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAAC GTTGGATCCC CTGGAATTTC ATCATCAAAC GCGCCGCCAA GGCCTATGGC 
ATCATGGACC CCCTGACCTT TCTGGCGCGA TTGCGCCGCT TTGCCCAGCC GTCAGAAGTC
CAGGAACCGA TCGAGCTCCT GCGTGCGGGA ATCATTTTCC ACGCCCGGGG CCTGATCAAC
ACCAAAGCGA TCCAGCACAA TTTGGACTGG GTCTGGCCGT ATTGGGTCCA AAAACAGTTC
AATCCCCGCG ATGTCTCCTT TGTCCCCCGG GCCTTTTCCT TCAGCCACAT CAATCTCACC
CACCGCAACT GGACCGGTGT CGGGCGGCCG GACGTTGCCT TGTATCCCAT CGTCGATCCC
CGTGGTCTGG TGACTGTGTT GCACGAAGGA TGGTCCCTGG ATTTCTGGAT TCTCCGCCCC
GATGGCTCCC TGGTGACGCC CTCGGTCCAC AGTGAAACCG AACAGAGCTG GGACCTGGAC
GACCACCTGG CAGTGACCAC AACCAGTCGG GACAGTGGCG ATGTCCTCAC CAGCAGGGTC
TTCATGGAAC GCGAACCCGG AAGTGGCCCG GTAACCAAGG TCCGCGTCCA CGCCTCCGGC
CCGCCTGGGG GCTGGCTCGT GCTGGCCTTA CGCCCCTACA ACCCCGAAGG GGTCCAGTTT
ATTGAACGCG TGCGCTACAA AAACGAGGAA CGCCTGTTGC GGGTCAACGG TCGCACCGAT
CTCTATCTCC ATCCCAGACC GGGAAAAACC GTCTTCGCGG ACTACCATGA AGGCGACGTC
TGCCACGCCC TGCAACGCAA AGCCCATATG GAGCAAATCA ATTGTTCAGT GGGCATGGCG
ACCGGTGCGG CCCTGTTCCC CCTCGAGCAG GGCACAGCAG AACTCAAGAT CGATATCCCC
ATGGCTCGGG AACTGCGCCG CGGGCACCAC AAGCCACTGC CTCGGGAGGA CTGGAAAGAC
GCCCTGGCCC CGGCCGCATC TTTGGAAGTC CCTGACCCGC AAATCGATTT TCTGTACCGA
GCCGCTCTGC GTTCCCTGGT CCTGCTGTCC GCAGAGGAGA TCGTTCCCGG CCCCTTTACC
TACAAGCGAT TCTGGTTTCG CGACGCCTGC CTCATGCTCC ACGCCTTGCT GTGTGCCGGC
TTGACCGATC GCAGCGAGCG GATTCTCTCC ACTTTTCCAG GGCGGCAGAA AATAAGCGGG
TACTTCCAGT CCCAGGAAGG GGAATGGGAC TCCAACGGGC AGGTCCTCTG GCTCGCGCAA
CGCTTCCGGG AATTGACCGG TCGCGAACTC GGAGAGCAGT GGACCGACGC GGTCCGCAAA
GCCACGGACT GGATTGAGCG CAAGCGGCTG ACCAAGACCG AGAACGCCCC GCACGAAGGA
TTGCTGCCCG CCGGATTCAG CGCCGAGCAC TTCGGCCCCA ACGACTATTA TTATTGGGAC
GATTTCTGGG CTCAGGCTGG TCTCCAGGCC GGCACGAAAA TCCTCGGCGA CCACGGCCAG
GACAAGGCTG CCGCCACCTG TGCCCAGCGC GCCGAGTCGT TGGGTCAAGC CATCCGCAAC
AGCCTCGACT CCATCGCCAG GGAACGAAAA CAGGGCGCTG TACCAGCTTC TCCCTACCGG
CGGCTGGACT CCGGGGCCAT CGGTTCTCTG GTCGCCGACT ACCCGCTTGG TCTGACCGGG
CCCAACGATC CGGAAATCGA GGCCACCGCC GAATACCTTT TTCAAAATTG CCTCTTGAAC
GGCGGATTTT TTCAGGACAT GATCCACTCT GGAATCAACG CCTATCTGAC CTTGTGTCTG
GCCCAGACCC TGCTGCGGCG CGGTGACGGC CGGTTCCAGG AACTCGTTCG CGCCGTGGCT
GATCTGGCTA CCTCCACGGG CCAATGGCCG GAGGCGATCC ATCCGCTGAC CCTCGGCGGG
TGCATGGGCG ACGGGCACCA CGGCTGGGCG GCAGCGGAGT GGGTCATGAT GATCCGCAAT
ATGTTCGTGC GCGAGGAAGA AAACACCCTC ATTCTCGGTT CTGGTATTTT TCCCGAATGG
TTTGAGGCCA GGGCCCCCAT TGGCTTCGGG CCCACGCCAA CCCCGTATGG CCCCGTCCAG
GTGCGGTTTG AACCCTCTGA TGCCGGGTGG CAGGCCCAGG TCCAGGCTGA CTGGCACGGC
GAAAAGCGCC CGGCTGTGGA AATCCGCGCG CCAGGTTTTC AGCCCTGTAT CCTGACCGAC
CTGTCGCAGC CCGTGACGCT CATTGCGGAG ACGACATGA
 
Protein sequence
MIKRWIPWNF IIKRAAKAYG IMDPLTFLAR LRRFAQPSEV QEPIELLRAG IIFHARGLIN 
TKAIQHNLDW VWPYWVQKQF NPRDVSFVPR AFSFSHINLT HRNWTGVGRP DVALYPIVDP
RGLVTVLHEG WSLDFWILRP DGSLVTPSVH SETEQSWDLD DHLAVTTTSR DSGDVLTSRV
FMEREPGSGP VTKVRVHASG PPGGWLVLAL RPYNPEGVQF IERVRYKNEE RLLRVNGRTD
LYLHPRPGKT VFADYHEGDV CHALQRKAHM EQINCSVGMA TGAALFPLEQ GTAELKIDIP
MARELRRGHH KPLPREDWKD ALAPAASLEV PDPQIDFLYR AALRSLVLLS AEEIVPGPFT
YKRFWFRDAC LMLHALLCAG LTDRSERILS TFPGRQKISG YFQSQEGEWD SNGQVLWLAQ
RFRELTGREL GEQWTDAVRK ATDWIERKRL TKTENAPHEG LLPAGFSAEH FGPNDYYYWD
DFWAQAGLQA GTKILGDHGQ DKAAATCAQR AESLGQAIRN SLDSIARERK QGAVPASPYR
RLDSGAIGSL VADYPLGLTG PNDPEIEATA EYLFQNCLLN GGFFQDMIHS GINAYLTLCL
AQTLLRRGDG RFQELVRAVA DLATSTGQWP EAIHPLTLGG CMGDGHHGWA AAEWVMMIRN
MFVREEENTL ILGSGIFPEW FEARAPIGFG PTPTPYGPVQ VRFEPSDAGW QAQVQADWHG
EKRPAVEIRA PGFQPCILTD LSQPVTLIAE TT