Gene Dret_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2039 
Symbol 
ID8419884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2339706 
End bp2340749 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content52% 
IMG OID645038627 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_003198901 
Protein GI258406159 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0253649 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000243746 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCGTCA AAGACGGTGA CAAACTTGTC AATTGCCGTA ATTGGTCGAC GCTGGTTCAT 
CCAGAGACTC TTGAGCGGGA TGAAGACAGC ACGGAGAACT ATGGCAAGTT TTCCTGCGAA
CCTTTGGAGC GGGGTTTTGG AACGACCTTA GGCAATGCCC TGCGCCGGGT GCTGCTGTCC
TCTCTGCAAG GGGCGGCCAT TGTAGCGGTG CGCATCAAGG GTATCCAGCA CGAGTTCACG
ACGATCCCCG GGGTAATGGA GGACATTACT GATATAGTCC TCAATTTGAA GCAGTTGCGT
TTGCGGATGA ATACGGACGA GCCTCAGCGT ATTGAACTCA ATGTCAATAC CAAGGGTGCC
GTGACGGCTT CCGCGTTTCA GACAACGCAA AACCTCGAGA TCCTGAATCC GGACTTACAT
ATCGCCACCT TGTCGGAGGA TATCGAATTC GGCATTGAGG CTGAAGTGCG GATGGGAAAA
GGGTATGTTC CGGCGGAGAT GCATGAAGGA TTGGAAGAGG AAATCGGGCT GATTTCCATG
GATGCCAGTT TCTCTCCGAT CCGCAAAGTC GCGTATCGTG TGGAACAGGC CCGCGTGGGC
CAGATGACCA ACTATGACAA ACTGGTCATG GAAGTCTGGA CTGACGGGTC CGTGTTGCCC
GAGGACGCTG TGGCTTATAG CGCCAAAATC CTGAAAGAAC AGCTGGCCGT GTTCATTAAT
TTCAATGAAG ACAGTGCCAA TGTCTGTGAA TCCAAGGGTG CCGGCACCGA GTCATTGAAC
TCGAACCTCT TCAAACATAT TGATGACCTC GAACTCCCCG TCCGGGCCAG CAATTGTCTG
AAAAGCGCCA ACATCAATCT TGTTGGTGAA CTGGTCCAGA AGACCGAGGG CGAGATGTTG
AAGACCAAGA ATTTTGGTCG CAAATCGCTT GAGGATATCC GCAAGGTGAT CCATGAACTC
GGTCTCGACT TCGGAATGAA GCTGGACGGC TTCGAGGAAC AATACAAGAA ATGGCGAGAG
AGGAACCAGC AAGATGAGGC ATAA
 
Protein sequence
MIVKDGDKLV NCRNWSTLVH PETLERDEDS TENYGKFSCE PLERGFGTTL GNALRRVLLS 
SLQGAAIVAV RIKGIQHEFT TIPGVMEDIT DIVLNLKQLR LRMNTDEPQR IELNVNTKGA
VTASAFQTTQ NLEILNPDLH IATLSEDIEF GIEAEVRMGK GYVPAEMHEG LEEEIGLISM
DASFSPIRKV AYRVEQARVG QMTNYDKLVM EVWTDGSVLP EDAVAYSAKI LKEQLAVFIN
FNEDSANVCE SKGAGTESLN SNLFKHIDDL ELPVRASNCL KSANINLVGE LVQKTEGEML
KTKNFGRKSL EDIRKVIHEL GLDFGMKLDG FEEQYKKWRE RNQQDEA