Gene Dret_0333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0333 
Symbol 
ID8418137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp417251 
End bp418681 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content63% 
IMG OID645036898 
Productprotein of unknown function UPF0027 
Protein accessionYP_003197213 
Protein GI258404471 
COG category[S] Function unknown 
COG ID[COG1690] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACAA GTCTTTTGCG GCGGATCAAT GCCTATTGTT GGGAAGTTCC CCAACAGGGC 
GCCATGCGGG TTCCAGGGCG TCTGTTCGGC AGCGGAGCCC TTATCCGCGA CCTCGACGAC
ATGGTTCTGG AGCAGGTCAG CCAGGTCGCA GCCCTGCCAG GGATTGTCCG TGCTTCGCTG
GCCATGCCCG ACGCCCACTG GGGGTACGGA TTCCCTATCG GGGGCGTGGC GGCCTTTGAT
CCCGACCGTG GAGGCATTAT TTCCGTGGGC GGGGTGGGGT ATGATATTTC CTGCGGTGTG
CGCACCTTGC GGACAGGATT GTCCAAAGAG GACGTCCTTT CTGTCTTACC GGAACTGGTG
GATCTTTTGG CTACGGTGAT TCCTGCGGGG GTCGGACGCG GCGGACAGCT CAGGCTGTCC
GGCTCCGAAC TCGACGATGT CCTGCGTCTG GGCGCGCGCT GGGCCGTGGC CCAAGGATAC
GGAGAATCGC GGGATCTGGA ATATATCGAG GATGGCGGTT GCTTGAGCGG GGCGAATCCG
GAGGTGGTTT CAGAAACGGC GAAAAAGCGG CAGCAGGATC AGGTCGGCAC CCTGGGCTCC
GGCAACCACT ATCTGGAAGT CCAGTATGTG GATGCGATCT ATGACCAAGC AGCGGCCTTT
GCCTTCGGGC TCAAAGAGGG CGGTGTCGTG GTTTCACTCC ATTGCGGCTC ACGGGCTCTG
GGGCACCAGA TCGGCACCGA TTATATCCAA ATCCTGGGGC GGGCGGCGCA AAAACGCAGC
CTGCACCTGC CCTCGCGGGA CCTGGTCTGC GCGCCCATCG ATTCCAAAGA GGGCCGCGAT
TATTACCAGG CTATGGCCTG CGGTGTGAAT TGCGCCCTGG CCAACCGGCA GGTGCTGGGG
CACCTGGTCC GGCAGGCCTT TGCCGAAATG TTTCCGTTGG CCCGTCTGGA ATTGCTCTAC
GATGTGAGCC ACAACACCTG CAAGGTCGAA GACCACGATG TCGATGGACT GCGCAAGTCC
CTGTATGTGC ACCGCAAGGG GGCGACCCGC TCTTTTGGGC CGGGGCGTCA GGAACTTCCA
GCCGCCTACC GCGGCGTGGG ACAACCGGTG CTGATCGGTG GGACTATGGG GACGGCGTCG
TATATTCTGG CCGGTACGGT GGAGAGCGAG GCCATGGCCT GGGGCTCGGC CTGCCACGGC
GCCGGACGGG CCATGAGCCG CAAGCAGGCG ACGAAGCGGT GGAAAGGCAA GAGTGTACTC
CACGAACTGG AACACCGCGG TATCCTGGTG CGTGCGGCCA GTCAGCGTGG TGTGGCTGAA
GAGGCCCCGG GGGCCTATAA AGACGTCGAC GAGGTCGTGG AATCTACCCA CCATGCCGGG
TTGGCGCGCA AGGTGGGCCG CTTGCGACCC CTGGCCTGTA TCAAGGGGTG A
 
Protein sequence
MDTSLLRRIN AYCWEVPQQG AMRVPGRLFG SGALIRDLDD MVLEQVSQVA ALPGIVRASL 
AMPDAHWGYG FPIGGVAAFD PDRGGIISVG GVGYDISCGV RTLRTGLSKE DVLSVLPELV
DLLATVIPAG VGRGGQLRLS GSELDDVLRL GARWAVAQGY GESRDLEYIE DGGCLSGANP
EVVSETAKKR QQDQVGTLGS GNHYLEVQYV DAIYDQAAAF AFGLKEGGVV VSLHCGSRAL
GHQIGTDYIQ ILGRAAQKRS LHLPSRDLVC APIDSKEGRD YYQAMACGVN CALANRQVLG
HLVRQAFAEM FPLARLELLY DVSHNTCKVE DHDVDGLRKS LYVHRKGATR SFGPGRQELP
AAYRGVGQPV LIGGTMGTAS YILAGTVESE AMAWGSACHG AGRAMSRKQA TKRWKGKSVL
HELEHRGILV RAASQRGVAE EAPGAYKDVD EVVESTHHAG LARKVGRLRP LACIKG