Gene Dret_2544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2544 
Symbol 
ID8420401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013224 
Strand
Start bp35877 
End bp39185 
Gene Length3309 bp 
Protein Length1102 aa 
Translation table11 
GC content55% 
IMG OID645039141 
Producthypothetical protein 
Protein accessionYP_003199398 
Protein GI258406657 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones113 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGTTT CAGGCCTTGT CGTCCAATTC AAGACCGCCT ATTCTTCCAA AAAAAAGAGT 
TGGCAGACGT TGTCCCTCGA TAAAGCCGAA CAATCTATCT TCGGTTCCGT GGGGATTGAT
AAGCTGGCCT CGGGGGAAGC CGGAGTATGG CTCACACCCC GAGCACACGA CAGCGCCGGA
GACACCTGGC ACATGGCGTG GTGGGACGTG GAACACCCCG ACGAACACCA CACCAGTATC
GAGGCCAATA CCCGTACCGC CCAGGACCTT TTCATCCAAC TGGATGGACT CGGCCTGGCG
CATGGACTGT CTGTCGTTTT ATCCGGAAAG GGGTTCCGTT TCCTCTGGCC GTTTGTAATC
CCATCTGACT ACACCAAGGC CTACCGGGCC ATGATTACCG ACAAGGGCCA ATGGGTAGGA
CTCGATCCCT CGCCGCACAT GGCTCCAAAC CGGTGGTTCC GCTTTCTTGG GTACCGGGGG
CACCGCAAGC AAGACACCAA CCCCAAAGAT CGCCATATCC ACCTTTTGGA GCACCCTGCC
CACCTCCTGG ATCTCACAGA GACAACGTAT CTGGAGTTAG TGCAAGGAAA ACCGGATCCG
GCCACATTTC GCCCTTGGAT GCGGAGGCTT TTGCCCCACA CCACAGAGCC GCCACCGGAA
TGGGTGGAGT TACTGAAAAA ATACAACGAC ATTCTGCGGC TACGATCGCA TATCGTGAAG
CTGAATTTCC CCAAGAAGCC GAAACCTAGA GGAGTGGACT GGGCGCAAAT AGAAACCTTC
CTTACCCAAA AGGGGATCCG TACCTGGGAC ATGCAGGACA ACGGGGAAAT CTTCTATCGC
TTAACCGAAT GCCCCATGTG TGGCCGGAGG GATGGTAATC CCTGGATGAC GCAAGCCGGC
CGGCTCAAAT GCTTCCACGC AAACACCTGC CCGGCTGGAG AAGAACACAC CGACCTGCAG
GGGCAGACCT TCAAAAAAGG CTTGCCGCCG GAAAAGTGGG TCGAAGGATA CCAGGAGATA
GAAGTAAGCC CTCCCGTGCA AGAAGACCAA CGGGAGAAGA CGGACGTCAA AACCGCCAGA
GAACGCATCC GGGACGCTCT GCGCTCTGAC GAAGACGTAT TGATTCGGGC TGCCCCAGGG
GTGGGCAAGA CACACACCAC CTTGGAAGAG ATCCTGCCTC AATGCCGGGA TCGACTCGTT
TTGTTCACAG TTCCCAAAGG GGAGAACGTC GCCGAGATAT ACGAAAAGGC ATTGAGCCTG
GCGCCGGAAG GCGTTGAAAT CCGCAAGATC AGGGGACGCA GGCGAGAAGA AAACGGTTCA
GGAACTTTGG ATTTCAACCC TCCACCGGAG GGGATCTGCT ACAACATGGA CTATGTAGAA
GAAGTGGCCA ATTGGGGCTA CTCTCCAGGG TTGATTTGCT GCACCGGGTG CGAGCACCAA
AAAAATTGCC CCTACCAGGA ACAATTCAAG TCCCTCCCGA AAACCGGACT TGTCATTGCA
GCGCATGAAA GCGCTGTTTC CCTGCCTAAA AAACGCCATT TTGATCTCTG GGTAATCGAT
GAAAACCCTG TGGCGTCTCT TCTTCAAACC AAGACCGTTT CGCCCGGGGC TCTTTCACAA
ATCAGGGCCA AACTACCCCG GAGGTCTGAA TTGCCTTTGG ATACAATCAA GGCTCAGGGC
GAAGGCCTCT TGAAGTATCT CGCAGGAAAC CAACACGAAG GCCGGATCTA TGCCACTACG
CCGCCAGCCG AATGGAAGAA CACGGAAAGT GTCTGGGAGC TCGGAGGAAT CGAAAGCCAT
AAAACACCGT TTGCCGAAGA TTTGAGCTGC TTCGACCAAC TCGAAGAAGA AAACCTCAAG
CAGTGGCAGA AGCGGCTGTA CTACAGCGAA AAGGTAAATT TTACCGCTTT GGAATGGCTC
TGGACCGCCA CAGGACAACA GGCAGGGGTG GCCTATATCA AGGCCCGGGC AGACCGGAAG
CACCCCATTT CCTATGTCCT GCACCAAACC AAAGCCCCAG GCATGAGGCG GGCAAACCAG
GATGGCAGCG AAACCAAAAC CCGAATCGTG GCTCTGGACG GGACCGGCAA CAAACAAGAA
CTGGAAGCGC TCTTCCCAAA TCGTTCCTTT GCCGAGGTGT CGGCTGATGT GGATCTTCCG
GGCCGCAGGG TCCACCTTGA ATACAACCTG AGCAAAACCA CAGTCTGTGG GGCAGAGAAG
TACAAAAAAA CCCCGATGGC GCCGCAGCAC GTCAAAGCCA AGCTCAAAGA GGGGCTCAAA
TGCCTTCGTA CCGAGGAAAA GCGTGTGCTG TTGGTGACGT TCAAGGACGC CAAGGAAACA
GTGTTGCGGG CAGCGCAGAA CCTCGATCCA GGTCGCACCT TTGAGGTGAC ACATTTTTGG
GGCAACCGGG GGCTGAACTG CTTCCAGGAA TGCGACGCCG TTATCTGCTT CGGCTCCCCT
CGGGTGGCAC CGCACAGCGT CAAAGACATG GCCTCCAGTC TGTTCGACGA TACGGAGCAG
CAGAAGGCCT GGACCGAACA ACAGGGGCAC CGGGACGTGG TGCAGTCCAT TCACCGGATC
CGGCCAATTT ACTCCAGAAA GTCCGTGATC GTGATGGGCG ATTTTTGGCC GGAGCAGTTA
GGGACACCAC AATTCCGGAT TCGGGCTTAT CAAAAGAATG GCGCCTTTGA CCTGGCCTTG
GAACGGCTCA AGCTCGTAGC GCAGACCTAC GGCTTTGTGA CCCGGGAGCT TGCCTGTTTA
CACGGGGTGT TCTGCCGGGT GGATACCCAG AGCATAGCTA AGTGGATGGA ACTCCAGAAG
CGCTTCCGGG AAAAGCTCGA AGAAAGTCCA TCTGAATTTG TGTTCTTTCC TATAAATATA
TTTCTTATAG GAAACCACAC AAATAAAAAT GGACTTTTGG ACCCGATCAA GCTGAAGGAC
ACTCACGCTT GGAATGACCT GGTGTCAGCC CTCGAGCTTG AGCTTGGCTT TCCGTCTCTG
ACTGAGCGGC AAAGTACTGG GGCAGGAAGA CCGAGCCGTG GAGTGGGGAC TGTCAGCGCT
GCACGGCGGT TTTACCACGC TCTGGGTGTG GTCGTTTTTG ATGAATCCCT CTGGTCCGGC
CAGGAGCATT TTTTCGAGAT CCCCGTAGGG AAACTCAAAA GGCGGGTGCT GCCCGGGCAG
GGCCTTTTTG AAGCGCGGAG GGCCAAGTCC GTACTGTTGG GATTCACCTC AGAAATCCGA
CACCGGCAAC AAAAGCCACC GATAGAGGTT TTCAGAGATC AACAGTCGTC CTTGGCAGCT
GTGGTGTAA
 
Protein sequence
MRVSGLVVQF KTAYSSKKKS WQTLSLDKAE QSIFGSVGID KLASGEAGVW LTPRAHDSAG 
DTWHMAWWDV EHPDEHHTSI EANTRTAQDL FIQLDGLGLA HGLSVVLSGK GFRFLWPFVI
PSDYTKAYRA MITDKGQWVG LDPSPHMAPN RWFRFLGYRG HRKQDTNPKD RHIHLLEHPA
HLLDLTETTY LELVQGKPDP ATFRPWMRRL LPHTTEPPPE WVELLKKYND ILRLRSHIVK
LNFPKKPKPR GVDWAQIETF LTQKGIRTWD MQDNGEIFYR LTECPMCGRR DGNPWMTQAG
RLKCFHANTC PAGEEHTDLQ GQTFKKGLPP EKWVEGYQEI EVSPPVQEDQ REKTDVKTAR
ERIRDALRSD EDVLIRAAPG VGKTHTTLEE ILPQCRDRLV LFTVPKGENV AEIYEKALSL
APEGVEIRKI RGRRREENGS GTLDFNPPPE GICYNMDYVE EVANWGYSPG LICCTGCEHQ
KNCPYQEQFK SLPKTGLVIA AHESAVSLPK KRHFDLWVID ENPVASLLQT KTVSPGALSQ
IRAKLPRRSE LPLDTIKAQG EGLLKYLAGN QHEGRIYATT PPAEWKNTES VWELGGIESH
KTPFAEDLSC FDQLEEENLK QWQKRLYYSE KVNFTALEWL WTATGQQAGV AYIKARADRK
HPISYVLHQT KAPGMRRANQ DGSETKTRIV ALDGTGNKQE LEALFPNRSF AEVSADVDLP
GRRVHLEYNL SKTTVCGAEK YKKTPMAPQH VKAKLKEGLK CLRTEEKRVL LVTFKDAKET
VLRAAQNLDP GRTFEVTHFW GNRGLNCFQE CDAVICFGSP RVAPHSVKDM ASSLFDDTEQ
QKAWTEQQGH RDVVQSIHRI RPIYSRKSVI VMGDFWPEQL GTPQFRIRAY QKNGAFDLAL
ERLKLVAQTY GFVTRELACL HGVFCRVDTQ SIAKWMELQK RFREKLEESP SEFVFFPINI
FLIGNHTNKN GLLDPIKLKD THAWNDLVSA LELELGFPSL TERQSTGAGR PSRGVGTVSA
ARRFYHALGV VVFDESLWSG QEHFFEIPVG KLKRRVLPGQ GLFEARRAKS VLLGFTSEIR
HRQQKPPIEV FRDQQSSLAA VV