Gene Dret_2239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2239 
Symbol 
ID8420097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2542621 
End bp2543799 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content63% 
IMG OID645038840 
ProductS-adenosylhomocysteine deaminase 
Protein accessionYP_003199101 
Protein GI258406359 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.975396 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTTAT CCCGAACATC CTCTGCAGAG ACACCGATCT TGCGGCGCTC GCGCACCTTG 
CTCACACGGA ATCCGGCCCG GCCGGTGATC GACAATGGAG CGATCCTGGA AGCCAACGGG
ATGATTTGCG CTGTCGGTAA ATACGGTGAG CTATCCCAGA CGCACCACGC TCGGCTGGTC
GACGAAGGAG AGACGGTGTT ACTGCCCGGG CTGATAAACG CCCACACCCA TACCGAACTC
AGCCACTTGC GGGACCGCAT CCAGCCCGGC GGCGGATTCG AGGACTGGGT CGCCCAATTG
CTTGCGCTGC CGGCCCGGGA TCTGGACACC AAGGCGGTCT CCAAGGCCAT AGACGAAATG
GCCGCCGGTC AGATCTGTGC CGTGGGAGAC ATCAGCGGGA ACAATCCGCA GGCCATGGCC
TCGTTATGGC GCCAAAGCGA TCTCCATGCC CTCCTTTGGG TCGAGCAGAT CGGGTTCGCC
CCTTTGCCCC AGGGCCAGCC GCGAGGTCTG CCTGATGCCG GGGCAACTCC AAAGCTGCGC
GTCGCTCCTG CCGGACACGC CCTGTATTCA ACAAGCCCGG AGACTTTGCG CCAGACCAAG
GCCTGGTGTC GCTCCCACGA ACTCCCCTTC AGCCTCCATC TCGCCGAACA TTCCGGTGAA
ATAGAATTTT TGACCACCGG CCGCGGCCCG TTTGGCGCCA TGCTGACCAA ACGGCTGGTC
CCCAAATCCT TTTCTCCGCC GGGCCTGCAC CCGGTTGCCT ATGCCGATTC GTTGGGACTT
CTGGATTCCT CCACCCTGGT GGTGCACGCT GTTCACCTCG ACCCGGGCCA CCCGCACCTT
ATCGCGCACC GCGGCTCTAC TGTCTGTCTG TGCCCGCGCA GCAACGATTT CATCGGCGTG
GGGCGCGCTT TGTGGGAAGC CCTGGACGCC GCAGGTGTTC CCTTGTGTCT GGGTACGGAC
AGTCTGGCCT CAAATTGGGA CCTCGATTTA TGGCAGGAGG CCTGGTATAT CGCCCAACAT
TGGTCCGGCA CACTGACCCT GGACAAGCTG GTCAGCTTCA TGACCACGAC CCCGGCACGC
ATCCTGGGCC TGCCACGTCT GGGCCGCCTT GCCCCCGGCA AACGGGCTGT CTATGCCCGG
CTGTCTCTGG AGAAGGCGAA CCGCCTTCCT CTTGCCTGA
 
Protein sequence
MPLSRTSSAE TPILRRSRTL LTRNPARPVI DNGAILEANG MICAVGKYGE LSQTHHARLV 
DEGETVLLPG LINAHTHTEL SHLRDRIQPG GGFEDWVAQL LALPARDLDT KAVSKAIDEM
AAGQICAVGD ISGNNPQAMA SLWRQSDLHA LLWVEQIGFA PLPQGQPRGL PDAGATPKLR
VAPAGHALYS TSPETLRQTK AWCRSHELPF SLHLAEHSGE IEFLTTGRGP FGAMLTKRLV
PKSFSPPGLH PVAYADSLGL LDSSTLVVHA VHLDPGHPHL IAHRGSTVCL CPRSNDFIGV
GRALWEALDA AGVPLCLGTD SLASNWDLDL WQEAWYIAQH WSGTLTLDKL VSFMTTTPAR
ILGLPRLGRL APGKRAVYAR LSLEKANRLP LA