Gene Dret_2105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2105 
Symbol 
ID8419955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2395353 
End bp2396270 
Gene Length918 bp 
Protein Length305 aa 
Translation table11 
GC content56% 
IMG OID645038698 
Producthypothetical protein 
Protein accessionYP_003198967 
Protein GI258406225 
COG category[R] General function prediction only 
COG ID[COG1611] Predicted Rossmann fold nucleotide-binding protein 
TIGRFAM ID[TIGR00725] conserved hypothetical protein, DprA/Smf-related, family 1
[TIGR00730] conserved hypothetical protein, DprA/Smf-related, family 2 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0622096 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCTCA AAAAATACGG CCGCGGCCCA TTTCCTTCAG CCACAGAAGA CGCCCGGGCC 
TCCCAATGCC CCCCCGATTC CCCTCAATGC CAATCCCCGT CGTACCGGTT GGCCTTTCAG
GACCCCGAAT TCATGCTCCG CGGCGAATTA CGGCCGGTAC GCCTGCAATT GGAACTCCTC
AAGCCGGAGT TGATTCTTCA GGAGGAGGGA ATCGAATCGA CTATCGTCAT CTACGGCTCA
GCGCGGATCC CTGACCCGGA GACGGCCCGA GCCCATTTGG AGACCATCCA ACAAGCCTAT
GAACACAACC CGGAAGACCG GGAACTGCAG AACCAGTTGG CCATCGCTCA CAATGCCGTG
GCCACAAGCC ACTATTATGA TGAATCCCGC AAACTTGGCC GGCTCATTTC CCAATCGACC
CCGGACAACC AGCTCGTTGT CATTACCGGT GGCGGAGCCG GCATCATGGG CGCGGCCAAT
CACGGCGCGC ATGACATTGG AGCCAAAAAT ATCGGCCTCA ATATCGTTTT GCCCCATGAG
CAGGCCCCCA ATCAATACAT CACCCCCAAT CTCGCCTTCC AATTCCACTA CTTTGCTATT
CGCAAAATGC ATTTTCTGCT CCGGGCAAAG GGATTGGTGG TGTTCCCGGG GGGCTTTGGC
ACGTTGGACG AATTGTTCGA AGCCTTGACT CTGCTCCAGA CCCGCAAAAT CAAACCGATC
CCGGTCCTGC TGTTTTGCGA ACGGTTCTGG CGTCGGATCA TCAATTTCGA TGCCCTGGTG
GACGAAGGAA CGATTTCCCA GGACGATCTC GACTTTTTCC AATTTGTGGA AACCGCTGAA
GAGGCCTGGG AGATACTAGC CCAACACAAC GGGGTCGCAA ACGGCTCCAC TCCAGCGAAC
GACGCTTCGG CTGATTGA
 
Protein sequence
MPLKKYGRGP FPSATEDARA SQCPPDSPQC QSPSYRLAFQ DPEFMLRGEL RPVRLQLELL 
KPELILQEEG IESTIVIYGS ARIPDPETAR AHLETIQQAY EHNPEDRELQ NQLAIAHNAV
ATSHYYDESR KLGRLISQST PDNQLVVITG GGAGIMGAAN HGAHDIGAKN IGLNIVLPHE
QAPNQYITPN LAFQFHYFAI RKMHFLLRAK GLVVFPGGFG TLDELFEALT LLQTRKIKPI
PVLLFCERFW RRIINFDALV DEGTISQDDL DFFQFVETAE EAWEILAQHN GVANGSTPAN
DASAD