Gene Dret_2507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2507 
Symbol 
ID8420369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013224 
Strand
Start bp3657 
End bp5123 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content44% 
IMG OID645039109 
Producthypothetical protein 
Protein accessionYP_003199366 
Protein GI258406625 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones67 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones124 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGA AAAAAAGACG CTCCCTCTAC GACACATTAT CCGCACGGGA TAGCGGACAG 
GACGATGAGC TTCGAGAGAC ATTTAAACGT GTACGTGAAA TCAATACAAA ACAGTCCACT
GAAACCACCC CTCCATCTGA GAACTCCAAT AAAAAAGTCC AAGCCCAAGA CAAGAGCCCA
TCTGGACCGC TCATTCATAC CGCCCAAACA AACCGCTCAA ACATACCGAT TACTCATTCA
GATCAACAAA GTAATAATAA AGAAAATAAC AAATCAAAAC AAGAATTTAA CAAAGATAGC
CTTGGTGACA CAAACCGCTC ACCTAAAGCG CCCACAAATA CCGCTCAAAA CAACCGCTCA
AGCAAAGCGC TCGCTAATAC CGCTCAGGTA GACCGCCCAA ACAAACCGCT CGAAAACAAC
CATACCGCTC ATTTAAACAG CTCAAATGAA CCGTTTACTC GAGCGCTTCG AGCGCAATCA
TTGAGCGGTA CTAATGAGCG GTATGCCCAA GCGGTACAAA TGAGCGGTTC AGAGGAGTCA
CATGAGCCCA CTGAAGACCC TCAAAATATC CTCCTAAAAC CCAAATCTTC CATCAGATCT
AGAAACCAAA AAAAGATTTT TGACTACCTT CAGCGGATTG GCTCCCAAAC GACCACCCTT
ACATATATAT CTAATATTAC TGGTGTTCCG TACAGCACAA CGAGACGAAT AATAAGCAAA
TTTAAAGCTG AAGGGCTTAT TTATTACAGA ACGCTGTTCG TTAAGGACGT AGGCTGGTGC
GCAAAAATCT GGATAATAAA TTCAGAAGGC GAGACACCCA ACCGAGCGGT ACAAATGAGC
GGCATAAATG GGCGGTATGA ATGGGCTGTA TCAAACGCCT CTAAGATAGA TAGAGAATCT
ATCTATCTAA AAGAGGGGGG TGTGGGGGGA GATGGGCAAG ATCAACCTGA CAACTCATCC
TCTTCTAGTG AGGCTCGTCT CAACCAGCTC ACCGACGAAG ATATTGCTTT CTTTTGGCCA
AAGCTTCACC AATCTGGTTT TGGAGCTCAC CAGGTCCAGC AAATCGTTCA AAGACTTTCC
AAGGTGGATA AAAAAGCCGA CAAAGTTATC CAGGGGCTTG ATCACGCTGA ATGGGAACTA
GATCAGGGAA AGATGACTGA CAAAGAAGGT AATCCCGTAG GGAATCCATG TTCTTACGTG
TTCAGTTCTC TAGCCAGAGA AGGCTATTAT CGTCGTCCAT CTGGATATAT TTCCCCGGAA
GAACAAGCTG AGCTGGATGC CAAGGAAGAA GCTGATAGGC TGCAACAACT GGGGAAAGAG
AAAAAGGAAT CTCAGTTTAA AGCATGGAAG GCAAATTTGT CGGAAGAGGA GTTAAATCAA
ATTTTGGCTT GTAAAACTCA CAAAGGACCA ACGGATCCGT GGCTTAGGCA ATATTGGGAA
AAGACTATTT ATTATACAAC AAAGTAA
 
Protein sequence
MAKKKRRSLY DTLSARDSGQ DDELRETFKR VREINTKQST ETTPPSENSN KKVQAQDKSP 
SGPLIHTAQT NRSNIPITHS DQQSNNKENN KSKQEFNKDS LGDTNRSPKA PTNTAQNNRS
SKALANTAQV DRPNKPLENN HTAHLNSSNE PFTRALRAQS LSGTNERYAQ AVQMSGSEES
HEPTEDPQNI LLKPKSSIRS RNQKKIFDYL QRIGSQTTTL TYISNITGVP YSTTRRIISK
FKAEGLIYYR TLFVKDVGWC AKIWIINSEG ETPNRAVQMS GINGRYEWAV SNASKIDRES
IYLKEGGVGG DGQDQPDNSS SSSEARLNQL TDEDIAFFWP KLHQSGFGAH QVQQIVQRLS
KVDKKADKVI QGLDHAEWEL DQGKMTDKEG NPVGNPCSYV FSSLAREGYY RRPSGYISPE
EQAELDAKEE ADRLQQLGKE KKESQFKAWK ANLSEEELNQ ILACKTHKGP TDPWLRQYWE
KTIYYTTK