Gene Dret_1256 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1256 
Symbol 
ID8419084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1471685 
End bp1475146 
Gene Length3462 bp 
Protein Length1153 aa 
Translation table11 
GC content64% 
IMG OID645037831 
Producthypothetical protein 
Protein accessionYP_003198122 
Protein GI258405380 
COG category[S] Function unknown 
COG ID[COG4717] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.537962 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.251467 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCCC GTTTACGCCT GACCCGTCTG TCGATCCAGG ATTTCCCCGG GCTCACCGGG 
CCACTCGACC TCTCCCCCCA TCTTGGAGCG GAGTTGACAC TTGTCTCCGG CTCCAACGCC
TCTGGCAAGA CCGTCACGGC CAGGGCCATT GCCGCTGCTC TCTGGCCCAA GTCCGCGCTG
GCCTCGCAAG CCACGTTGCT GGCCCACTGG GCCAATAGCG AGCAAGAGTG GCGGGTGCGG
CTGCGCGGGG GCAGACTCCA GACCCAGTGT AACGGCCGTG ACTGCGCTCC CCCGTATTTC
GCGCCGGAAA CGGTCATGGA CCGCTATTTC CTGGACCTGC GCGACCTCCT GACTGCCGAT
GATGCGGAAT TGGCCAGAAT CGTTCAGCGG GAATGCTCCG GGGGCTTTGA TATCCAGTCC
GCCAAAGCAG CATTGGGATA TAAATTCGTG CGGCCGCGGC GCAACCACTC CAGCGAGGCC
CTGCAACGGG CGCAGCGGGA GGTCCAGACC CTGCGAACCA CCTTGGCCGA TCTCAAACGC
GAAGAAGACC GCTTGCCCGA ATTGCGGGCC CAACTCGATC AGCAGGACGA ACAGGCCCGG
AGTGCCTGGC AATGGCAGCG CGGACTGGCC GCCCGCCAGC GTCAACGCGA GCGGGACCGT
TTGGCCCGGG AATTGGCGGC CTATCCCGAC GCCGTTGCCA AGTGCACTGG AAACGAAGCC
GCCGAACTCG ACCGGTTGCG AACCGCTTTA CAACAGGAAA AAGAACGTTG TGCCGCGGCA
AACGACAAAA AGCGCCGGGC CCTGCAAACC ATGGCCGGGA CAGGCCTGGG CAGCGAGGGC
GTCGCCCAGG GACTCCTCGA GACCGCTAAG GCTGAAGTCG AGACATTGGA GGAGGCGGAA
CAAGCGCTGC AGGCGGCCCA AACCTCCCTG GCCCGGGCCC GAGCCGAACG TGACGCGGTC
GCCAGACATC TCGGCACCGT TGCGGACGAC CCCGGCCCGG CTCCTGATGT CGCGGCTGTG
CACGAATTGA CCCGGCTCGT GAGCGAGGGG CAAGAGCTCG AAGGACGGAT CTCGGGGCTG
CGAGCCATCC AATCCTGGAT GGAAAGCCAT GACGACGACG CTTGGGCCCC CACTTCAGAG
CAATTGGACG ATGCGGTGAA GGCATTGTGG CAATGGTTGG AAGCAGCCCC GGCGGCAAAC
GGTGGCACGG TCCTGTGGCT GGCCTCGGCA GGATTCATGG TCGGCGCAGG GGCGCTAGCG
GCCACCATAG TGCTGCCTGT CCCGGACGGA AGGGGAGTGG TTGCCAGCGT GCTGGCTGCG
CTGGGCCTGG GAATCGGGGG GTGGTGTGTT CGTCAGCGGC AGATTGCGGC CCGGAACCGG
GCTCGGGCAA AGGCAGTGCT GGATCGAAAT ACCGCGAGCG CCCCCGCGGC CTGGACTTTG
GCTGAAGTTT TTGAGCACCT CCGCCGCCTC GGGGAACAAC GGGAGCGGGC CGTGGCCAGA
CGCCGCCGCT TGGAGCGGTG GAGGGACCTC CAGCACCGGC TTCAGGAATG CGAGGCCCAA
TGGGCCGCCT TTCAGGACCG GGTCCGGACT TGGACCTGCC ACCACGGATT GCAACCATCG
GAATCCGTTT CCTTTCTGCG GCATCTCGCC TGGGAATTGC GGGCCTGGTC CGATGCCGAT
CTCGCTGTCG GACAAGCGCA GGGAGAGGTG GATGACGCCG CTGAGCGGTG CGAGCGAACC
CTGAAGGCGG TGCAAGGCCG TTTAGCGGTC TTTATTGCGC CGCCCGACAC GCTGCCGGAA
GCCAAAGCGG CGCTGAAGGT CCTTCAGCAA CGCCAAACAC AGTGGCAAGA CGCGTCGACC
GCCCTGGAAC ACGCGCAGGA AACACTGGAT CTCTCGCGCC AGCGATTGGC GGAAATCCAA
AAGGAATATG CGGCCCTGTT CACTGATCGC GCCCTGGAGG CCGGTGACGA ACAGGGGCTG
CTGCATTGTG TCCAACAGCG GGAGTCCTAT CTGGAGCTTT CTCAGAAACT TCACGCCATC
CAGACCGCTC TGGAGAGCGA CGTCGCCATG CTTCCGGACA CGGTCTGGAC CTTCGATGAC
GACTCCCTGC AACGCTTCCA GGAGCAGGCC TTGGCCGCTG AGCGGCAACG TACCGAAACG
GCCAATGCGA TCGCCGTCCT GGAGGACCGT ATTGCCCGAG CCCGGACCAG CCGAGACATG
GAAGAGGCCC AACAGCGGGT CTTGACTGCC AAAGACCAAC TCCTTCAAGA ACGGGACGAA
GCGGCTGCGA CCACCGCAGG GTACCTGCTC TGCGAGCACC TCCAGCGCCA CTGCCGAGAC
GAACAGGTGC CCGGGGTCAT GGGCCAGGCG AACACCCTGT TTGTCCAAAT GACCAATGGA
GCCTTCCAAT TGGCGTTCGA CCCTCAGACG CCCGCCTTTC GGGCCCGCCA GAGCGCCAGC
GGTCAAGAAC TGGAACTCGA CCAGCTTTCG GCAGCGACCC GAATCCAACT CCTGCTGGCG
GTCCGCCTCG GCTTTGTCCA GACCCAGGAA AACCACGCCC GATTGCCCTT GGTATTGGAT
GAAACCCTGG CCAATACCGA CGCCCTGCGG GCCGAATCGG TCATCCGCTC CCTTCTGGAG
ATCGGGGGCA GCGACCGGCA AATATTGTAT TTCACCGCCC AGGACGAAGA AATCCAGAAA
TGGCAGACAC TGGCCGAAAA GGCCGGAATC AGCTGGGCCC ATATCGATCT CGACACCGCC
GGCCAGGAGG CGGAAAAGCG CTGCAAACCG CGACTATCTG CGGCACCTTC CCGCCGCCGC
CACGTTCCGC CCGCCGGCGA TCTGGACCAT GCCGCATACG GAGAGCGCCT GGATGTCGCC
GGTATCGATC CCCGCCGGGA TAGTGCGGAA TCCCTGCACA TCTGGTATGT GCTGGACGAT
CCACCACTCG TGCAGCGGAT ACTTGAGATG GGCGTCACGC GGTGGGGCAG TCTGCAACAG
TTGCTCGAGA CAAGGGCCTT ACGGCTCGTG TCGCCGGACG CTCCGGAAAT CCGCCGGGCA
CGGGCTCTGG CACGGGCCTA CGAAGCCGCC TTGCGGGGGT GGCGTATCGG CCGGGGACGG
CCGGTGGGAC AAGAGACCCT CGAAGCGGCC GGGGTCAAGA ATACCTTTCT CGGCGAGGTC
AGTGCCATCC TGGACGAAGT CGACGGGAAT GCCGCTGCGC TCCTGGCTGC CCTGGAACAG
GGCCGGGTCA AACGATTTCG CAAGGAAACC CGTCTGGCCA TGCGCGACTA CTTCGAGGCC
CACGGCTTCC TGGACCAGCG TCTCCCCTTG GATGCCGCGG ACCTCCGGGA GCAGGTCATG
ATTGCAACGG CCGAGGATCT GGAACAGGGG TGTCTGACAG TTGCGGATAT TGAGACGGTC
TTGAAGCGAA CCGGGGGACA AGGCTTCGCC GGGGAAGACT GA
 
Protein sequence
MSSRLRLTRL SIQDFPGLTG PLDLSPHLGA ELTLVSGSNA SGKTVTARAI AAALWPKSAL 
ASQATLLAHW ANSEQEWRVR LRGGRLQTQC NGRDCAPPYF APETVMDRYF LDLRDLLTAD
DAELARIVQR ECSGGFDIQS AKAALGYKFV RPRRNHSSEA LQRAQREVQT LRTTLADLKR
EEDRLPELRA QLDQQDEQAR SAWQWQRGLA ARQRQRERDR LARELAAYPD AVAKCTGNEA
AELDRLRTAL QQEKERCAAA NDKKRRALQT MAGTGLGSEG VAQGLLETAK AEVETLEEAE
QALQAAQTSL ARARAERDAV ARHLGTVADD PGPAPDVAAV HELTRLVSEG QELEGRISGL
RAIQSWMESH DDDAWAPTSE QLDDAVKALW QWLEAAPAAN GGTVLWLASA GFMVGAGALA
ATIVLPVPDG RGVVASVLAA LGLGIGGWCV RQRQIAARNR ARAKAVLDRN TASAPAAWTL
AEVFEHLRRL GEQRERAVAR RRRLERWRDL QHRLQECEAQ WAAFQDRVRT WTCHHGLQPS
ESVSFLRHLA WELRAWSDAD LAVGQAQGEV DDAAERCERT LKAVQGRLAV FIAPPDTLPE
AKAALKVLQQ RQTQWQDAST ALEHAQETLD LSRQRLAEIQ KEYAALFTDR ALEAGDEQGL
LHCVQQRESY LELSQKLHAI QTALESDVAM LPDTVWTFDD DSLQRFQEQA LAAERQRTET
ANAIAVLEDR IARARTSRDM EEAQQRVLTA KDQLLQERDE AAATTAGYLL CEHLQRHCRD
EQVPGVMGQA NTLFVQMTNG AFQLAFDPQT PAFRARQSAS GQELELDQLS AATRIQLLLA
VRLGFVQTQE NHARLPLVLD ETLANTDALR AESVIRSLLE IGGSDRQILY FTAQDEEIQK
WQTLAEKAGI SWAHIDLDTA GQEAEKRCKP RLSAAPSRRR HVPPAGDLDH AAYGERLDVA
GIDPRRDSAE SLHIWYVLDD PPLVQRILEM GVTRWGSLQQ LLETRALRLV SPDAPEIRRA
RALARAYEAA LRGWRIGRGR PVGQETLEAA GVKNTFLGEV SAILDEVDGN AAALLAALEQ
GRVKRFRKET RLAMRDYFEA HGFLDQRLPL DAADLREQVM IATAEDLEQG CLTVADIETV
LKRTGGQGFA GED