Gene Dret_1986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1986 
Symbol 
ID8419831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2279869 
End bp2281074 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content60% 
IMG OID645038574 
Productprotein of unknown function UPF0052 and CofD 
Protein accessionYP_003198848 
Protein GI258406106 
COG category[S] Function unknown 
COG ID[COG0391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01826] conserved hypothetical protein, cofD-related 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCAA CCGGCACCCG CATCACTGTA ACCAAAAGCG CCCTGCTCCC GGACCCGGTC 
AAGCTCGCCC GGTACGCCAA GGCGCCGGAA TTGGGTCCCA AAATCCTTTT TTTCAGCGGC
GGCACCGCCT TGCGGCCCTT GAGCCAAAAG CTCATCGAAT TCACGCACAA CTCCATCCAT
TTCATCACCC CGTTTGACTC AGGGGGCAGC TCTGCTGTCC TTCGCAAGGC GTTTGCCATG
CCGGCTATCG GCGATATCCG CAACCGGCTC ATGGCTCTGG CTGACCAAAG CCTGCACGGT
GCCCCTGAGA TCTATGAACT CTTTGCACTG AGGCTGCCCA AAGAGGCCGA TCCCGGTGCC
CTGAACGATC TTTTGCAATC CCTGATCCGG GGCAAACATC CGCTGGTGGC CGCCATACCG
GATCCAATGC GCAAGATCAT CCGCAACCAC CTTGGACGCT TCGCCGAGGC CATGCCCACA
GATTTCGATT TGCGCGGCGC GAGTATCGGC AATCTCATCC TCACCGCCGG GTATCTCGAT
TATCGCCGCC AACTCGACCC GGTCATCTTT CTCTTCGCCA ATCTCGTCCG GGTCCGCGGT
GTCGTCCGTC CGGTCCTCAA CAAAGATCTC CAACTGGCCG TCCGCCTTGA CGACGGGAGC
ACGGTCGTCG GGCAGCACCG CATCACCGGC AAGGAAACCG CTCCACTGAA CACCAAAATC
CGTTCGGCCT GGATTTGCGC CAGTTCCGAA GAGCCGGCCC CATTGCGGGT TCCGGTCCGC
AACAAGGTCA TGGAACAGAT CCAACAGGCT GAGCTCATCT GCTATCCCAT CGGCAGTTTC
TATTCCAGTC TGATCGCCAA CCTCCTGCCC GGGGGCATAG GACGGGCCAT TGCCTCTACT
CCCTGCCCCA AAGTTTTCAT TCCCAACACG AGCGGGGATC CTGAACTCCA CGGCCAGGAC
GTCATGGAGC AGGTTCGGAC CATCCTTTAT ACCTTGCAAC GGGATTTTCC TGAACCACTG
CCGACGCGCG ACCTGCTGAA CTTTGTGCTC ATCGATCACG ATTTGACGCT CTATCCCGGC
GGCGTTCCGC GGCACTCCCT GGAAAAAATG GGGTTGACCG TCATCGCCGG CGACCTGACC
ACAGAACAAA GCCGCCCGCT TCTCGATGCC ACACGCCTGA CAGAGGCGCT CTTATCGCTG
ACCTGA
 
Protein sequence
MSATGTRITV TKSALLPDPV KLARYAKAPE LGPKILFFSG GTALRPLSQK LIEFTHNSIH 
FITPFDSGGS SAVLRKAFAM PAIGDIRNRL MALADQSLHG APEIYELFAL RLPKEADPGA
LNDLLQSLIR GKHPLVAAIP DPMRKIIRNH LGRFAEAMPT DFDLRGASIG NLILTAGYLD
YRRQLDPVIF LFANLVRVRG VVRPVLNKDL QLAVRLDDGS TVVGQHRITG KETAPLNTKI
RSAWICASSE EPAPLRVPVR NKVMEQIQQA ELICYPIGSF YSSLIANLLP GGIGRAIAST
PCPKVFIPNT SGDPELHGQD VMEQVRTILY TLQRDFPEPL PTRDLLNFVL IDHDLTLYPG
GVPRHSLEKM GLTVIAGDLT TEQSRPLLDA TRLTEALLSL T