Gene Rcas_0159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0159 
Symbol 
ID5537619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp192826 
End bp194088 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content59% 
IMG OID640892323 
ProductS-adenosyl-L-homocysteine hydrolase 
Protein accessionYP_001430312 
Protein GI156740183 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0499] S-adenosylhomocysteine hydrolase 
TIGRFAM ID[TIGR00936] adenosylhomocysteinase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGAAGA ATTTCGATAT TAAGGATGCC AGCCTGGCAG AGTATGGTCG CCGCCGGATC 
GATTGGGCTG AAGCCGAAAT GCCGGTGCTG CGCCAGATTC GTGAGCGCTT TGCCAGCGAG
CGTCCACTCA AGGGCGTGCG TCTGTCTGCC TGCCTCCACG TGACGGCAGA AACTGCCAAT
CTGATGCGAA CGCTCGCCGC CGGCGGCGCC GACGTGGTGC TCTGCGCGTC CAATCCGCTC
TCCACGCAGG ACGATGTCGC CGCTTCGCTC GTCGTGCACG ACGAAATCCC GGTCTACGCT
ATCAAGGGCG AAGACCACGA AACGTATTAC CAACATATCA CCGCCGCACT CGACCACAAT
CCGCACATCA CGATGGACGA CGGCGCCGAC CTGGTGACCG GCGTACTCAA ACAGCGTCCT
GACTTAATCC CCGCGATGCT GGGCAGCACC GAGGAAACGA CAACCGGCGT CATCCGCCTC
AAGGCGATGG CAGCCGATGG CGTGCTGAAG TTTCCGGTCA TCGCCGTCAA CGATAGCGAT
ACCAAGCATC TGTTCGATAA TCGTTACGGC ACCGGGCAGA GCACGCTCGA TGGCATTCTG
CGCGCTACGA ATGTGCTACT TGCCGGAAAA ACCTTTGTGG TCGGCGGCTA TGGCTACTGC
TCACGCGGCA TTGCCGAACG CGCACGGGGC ATGGGCGCGA ATGTCATTGT GACGGAAGTC
AATCCAATCC GGGCACTGGA AGCCGCGATG GATGGCTACC GCGTCATGCC GATGCGTGAA
GCCGTGAAGG TTGCCGATTT CGTCGTCACG GCGACCGGCA ACAAGAATGT CCTCGATGCG
GAAGACTTCG CCGTCATGAA AGACGGGTGC ATTGTCGCCA ACTCGGGGCA TTTCAACGTC
GAGATCAATA TTCCCGATCT GGAAGCGCTG TCGGTTGAGA AGCGCCAACC ACGCGCATTT
GTCGATCAGT ACATCCTCAA GGATGGACGC CGGATCAACC TGCTCGGTGA AGGGCGCCTG
ATCAATCTGG CAAGCGCCGA AGGGCACCCC AGCGCCGTTA TGGATATGTC GTTCGCCAAT
CAAGCCCTGG CAAGCGAGTA TCTGTTGACC CACAAAGGGA AACTGCCGAA CGGCGTGCAT
GCGCTGCCGA AAGAAGTGGA TCGCGAAATT GCGTCGCTCA AACTCAAGTC GATGGGTATC
ATGATCGATA CGCTCACGCC TGAACAGGCG AAATACCTGG CTTCCTGGGA AGAAGGAACG
TAA
 
Protein sequence
MGKNFDIKDA SLAEYGRRRI DWAEAEMPVL RQIRERFASE RPLKGVRLSA CLHVTAETAN 
LMRTLAAGGA DVVLCASNPL STQDDVAASL VVHDEIPVYA IKGEDHETYY QHITAALDHN
PHITMDDGAD LVTGVLKQRP DLIPAMLGST EETTTGVIRL KAMAADGVLK FPVIAVNDSD
TKHLFDNRYG TGQSTLDGIL RATNVLLAGK TFVVGGYGYC SRGIAERARG MGANVIVTEV
NPIRALEAAM DGYRVMPMRE AVKVADFVVT ATGNKNVLDA EDFAVMKDGC IVANSGHFNV
EINIPDLEAL SVEKRQPRAF VDQYILKDGR RINLLGEGRL INLASAEGHP SAVMDMSFAN
QALASEYLLT HKGKLPNGVH ALPKEVDREI ASLKLKSMGI MIDTLTPEQA KYLASWEEGT