Gene Dret_0482 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0482 
Symbol 
ID8418288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp586982 
End bp588232 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content61% 
IMG OID645037044 
Productamidohydrolase 
Protein accessionYP_003197357 
Protein GI258404615 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.338769 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTTC TTCTCCGCAA CGCCGTGCTC GGCGAAACCC CTACTGACCT GTTCATCGAC 
CGCGGCGTCT TTCAACGCAT CGGCCCAGAT CTGGACATCT CGGCAGACAA AACGATCGAC
GCCGCTGGCA AGGCCATTGT GCCTCCGCTG GTCAACGGCC ACACCCACGC GGCCATGACC
CTGCTCCGCG GCTACGCGGA CGACATGGAA TTGCACACCT GGCTCACGGA ACATATCTGG
CCCCTGGAGG CCCGGCTGAG CGAGGAGGAT GTCTATGTCG GATCACTCCT GGCCTGCCTG
GAGATGATCA AGTCTGGAAC GCTGTTTTTC AATGATATGT ACTGGCATTT CGAGGGCACC
GCCAGGGCGG TGACGGAGAT GGGTCTGCGG GCGGCCCTTT CGTCGGTCTT CATTGATTTC
GGGGATGCCC GGACGGCTGA GGACAAACAG CGACGCTGCC TGGACCTCCT CGCGACCTAC
AAAGAAGTTG ACCCGCGCCT GCAATGCGCT CTCGGCCCCC ACGCCGTGTA CACGGTGAGC
CGGAAATCCT TGGAATGGAT CAGAGATATA GCTGAGGAAC ACGATCTGCT GATCCATATG
CATGTGGCCG AGACCCGCAA AGAGGTCGAG GACTGCATGG CCGAGCACGG CAAACGCCCG
GTCGCCTATC TGGACGAACT CGGCCTGCTT TCCCCGCGTC TTGTCGCTTG CCATGCAGTC
TGGCTCACCC CTGAAGAAAT GGAGCTCCTG GCCAAGCGCG GCGTGAACAT CGTGCACAAT
CCGGTCTCGA ACATGAAGCT CTGCTCCGGA ACCAGTCCCG TGGAATCCAT GCGCCAACAT
GGCCTGCGGA TCGGACTCGG CACGGACGGC TGCTCATCCA ACAACGCCCT GGACATGTTC
AGCGAGATGA AATCCGCGGC GTTGGCAGCC AAAGTGGCGA CCGGATCACC CAAAGCCCTC
CCGGCTGATG CCGTGTGGGA GATGGCCACT GCCCAGGGGG CTGCCATATT CAATCTCAAC
CACGGGATAA CCGAAGGGGC CTGGGCCGAC TGCCTGCTGG TGGACCTCGA TCAACCGGCA
ATGGTGCCCT GCTACAACCT GACCTCCAAT CTGGTCTATG CCGCTTCCGG AGGCTGCGTG
GACACAGCCA TCTGTAACGG CGAAGTTCTG ATGCAAAAGC GCCATATCCC AGGAGAAGAG
GAAATTATCG CCCGGGCCCG GGCCTGTGCC CGGCGTCTTG TTGCTGGGTG A
 
Protein sequence
MSLLLRNAVL GETPTDLFID RGVFQRIGPD LDISADKTID AAGKAIVPPL VNGHTHAAMT 
LLRGYADDME LHTWLTEHIW PLEARLSEED VYVGSLLACL EMIKSGTLFF NDMYWHFEGT
ARAVTEMGLR AALSSVFIDF GDARTAEDKQ RRCLDLLATY KEVDPRLQCA LGPHAVYTVS
RKSLEWIRDI AEEHDLLIHM HVAETRKEVE DCMAEHGKRP VAYLDELGLL SPRLVACHAV
WLTPEEMELL AKRGVNIVHN PVSNMKLCSG TSPVESMRQH GLRIGLGTDG CSSNNALDMF
SEMKSAALAA KVATGSPKAL PADAVWEMAT AQGAAIFNLN HGITEGAWAD CLLVDLDQPA
MVPCYNLTSN LVYAASGGCV DTAICNGEVL MQKRHIPGEE EIIARARACA RRLVAG