Gene Dgeo_0518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_0518 
Symbol 
ID4057754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp545766 
End bp547097 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content68% 
IMG OID641229530 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_603989 
Protein GI94984625 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.4212 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCCA ATACGCTGCA CTTCGAGACG CTCCAGGTCC ACGCCGGACA GCATCCCGAC 
CCTGCGACCG GTGCCCAGGC CGTGCCGATC TACGCCACCA ACGCCTATGT CTTTGAGTCG
CCGGAACACG CCGCCGACCT GTTTGGCCTG CGGGCCTTCG GCAACATCTA CAGCCGGATC
ATGAATCCCA CCAACGCCGT GCTGGAGGAA CGCATTGCGG CGCTAGAGGG CGGCGTAGGA
GCGCTGGCGG TGGCGAGCGG GCACGCCGCG CAGTTCTTAG CCATCACCAC TGTCGCGCAG
GCGGGGGACA ACATCGTCTC CACGCCCAAC CTCTACGGTG GTACGGTCAA CCAGTTCCGC
GTTACCCTGC GGCGGCTGGG CATCGAGGTC CGCTTTACCA GCAAGGACGA GCGCCCGGAG
GAATTCGCGG CGCTGATCGA CGACCGCACG CGCGCCGTGT ATCTGGAAAC ACTCGGCAAC
CCGGCGCTGA ATGTCCCCGA TTTTGAGGGC ATCGCGGAGG TGGCCCACGC GCGGGGGGTG
GCCGTGTTCG TGGACAACAC CTTCGGGGCG GGCGGGTACT ACTGCCAGCC CCTCCGCCAC
GGCGCGGACG TGGTGCTGCA TTCGGCAAGC AAGTGGATCG GTGGGCACGG CAACGGCATC
GGTGGTCTTC TCGTGGACGG CGGAACCTTT GACTGGGGCA ATGGCCGCTA TCCCCTTCTC
ACCGAACCCA GCCCCTCCTA CCACGGCCTG AGCTTCTGGG AGGCGTTTGG CGAGGGGAAC
GCGCTGGGCC TGCCCAACAT CGCCTTTATC ACCCGCGCCC GCACTGAGGG GCTGCGCGAC
CTGGGGCCAA CGCTCGCGCC GCAGCAGGCC TGGCAGTTCC TGCAAGGGGT GGAAACCCTC
TCGCTGCGCG CCGAGCGGCA CGCGCAAAAC GCGCTCGCGC TGGCCTCCTG GCTAAGTGGC
CACCCGGACG TGTCACGCGT CACCTATCCG GGCCTGAGCA ACCACCCGCA CTACGACCGC
GCCCAGACGT ATCTGCCGCG CGGGGCGGGG GCCGTGCTGA CCTTTGAGCT GCGCGGGGGA
CGGGCGGCGG GCGAGGCATT TATTGGCGCG GTGCGGCTCG CACAACATGT CGCCAATGTG
GGCGACACCC GCACGCTGGT GATTCACCCC GCCAGCACCA CCCACTCCCA GCTGGACGAG
GCGGCGCAGG CGGCCGCGGG CGTGACGCCG GGACTGGTGC GCGTGTCGGT GGGGATCGAG
CACATTGACG ACATCCGCGA GGACTTTGCG CAGGCACTGG CCACCGCGCT GGTGGACGCG
GAGGGCGCAT GA
 
Protein sequence
MASNTLHFET LQVHAGQHPD PATGAQAVPI YATNAYVFES PEHAADLFGL RAFGNIYSRI 
MNPTNAVLEE RIAALEGGVG ALAVASGHAA QFLAITTVAQ AGDNIVSTPN LYGGTVNQFR
VTLRRLGIEV RFTSKDERPE EFAALIDDRT RAVYLETLGN PALNVPDFEG IAEVAHARGV
AVFVDNTFGA GGYYCQPLRH GADVVLHSAS KWIGGHGNGI GGLLVDGGTF DWGNGRYPLL
TEPSPSYHGL SFWEAFGEGN ALGLPNIAFI TRARTEGLRD LGPTLAPQQA WQFLQGVETL
SLRAERHAQN ALALASWLSG HPDVSRVTYP GLSNHPHYDR AQTYLPRGAG AVLTFELRGG
RAAGEAFIGA VRLAQHVANV GDTRTLVIHP ASTTHSQLDE AAQAAAGVTP GLVRVSVGIE
HIDDIREDFA QALATALVDA EGA