Gene RoseRS_1741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1741 
Symbol 
ID5208698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2148601 
End bp2149788 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content63% 
IMG OID640595347 
Productamidohydrolase 
Protein accessionYP_001276081 
Protein GI148655876 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.452444 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGACA AAGCTCAAGC CATCGCTCCT GAAATCATTC GTCTGCGTCG TGAAATCCAC 
GCGCATCCAG AGCTTGCGTT CCAGGAAGTG CGAACCGCGC AACTGGTTGT CGAAACATTG
CGCGAAATCG GCGGCATCGA CATTCGTACC GGCGTCGGCA AAACCGGTGT CGTGGGACAA
CTCGGTGACG GCAACGGACC GACGATCGGC ATCCGTGCCG ATATGGATGC CCTGCCGATC
GATGAGGCGA CCGGTTTGCC GTTTGCTTCC CGGAATCCAG GGGTGATGCA CGCATGTGGT
CACGATGCCC ACACCGCGAT CCTGCTCGGC GTGGCGCACC TGCTCCGGCA GGAGTTTGCC
GCCGGCAACC TGCACGGCAC CGTGCGCTTC CTTTTTCAGC CAGCCGAAGA AGCGCAGGAC
GACGAGGGTC TCAGCGGTGC GCCGCGCATG ATCAACGACG GCGCGCTCGA TGGGGTCGAT
CACGTGATTG CCCTGCACGT CGACTCGGGA CTGCCGGTTG GGAAAATCAC CATCCGTGAC
GGAGCGAGTT CGGCAGCGGT CGATACCTTT CGCGGGTGGA TCACGGGGAG TGGCGGGCAT
GGCGCCTACC CGCACCTGGG CACGGATCCG CTCTGGATGC TGTTGCCGGT GATGCAGGCG
TTGCACGGGA TTGTTGCGCG CCGGATCAAC CCGATGCACC CGGCGGTCGT CAGCCTTGGC
ATTGTGCGGG GCGGCACAGC GTCGAACGTT CTTCCCGCCG GGGTGTATCT GGAAGGAACA
TTGCGCAGTT TCGATCCGCA GGTGCGCGAG CAGTTGATCG TCGAAGTCGA ACGCGCATTC
GCGGTTGCAC GCGCCGTAGG CGGCGATTAT CGGCTGGAGA TCGAGCGCGG CTATCCTGCC
GGGCACAACG ATGCCACCGT CAGCGAATGG ATCGCCGCCA CGACCGCCGA TCTGATCGGC
GCCGATGCAA TCGACCGGAG TCGCAGCGGG ATGGGGGCGG AAGATTTCGC CTATATGACC
CAGAAAGCGC CTGGTGCGAT GTTCATGCTC GGCGCTGCGA TTGACGATGG TGTGAATCGC
GGACATCACA CCCCGATCTT CGACATCGAT GAGCGCGCGC TGCCGATCGG CGCGGCAATT
CTCGCCGAAA CAGCGCGACG CTATCTGGCA GGCGAAGTGA AACGCTAG
 
Protein sequence
MLDKAQAIAP EIIRLRREIH AHPELAFQEV RTAQLVVETL REIGGIDIRT GVGKTGVVGQ 
LGDGNGPTIG IRADMDALPI DEATGLPFAS RNPGVMHACG HDAHTAILLG VAHLLRQEFA
AGNLHGTVRF LFQPAEEAQD DEGLSGAPRM INDGALDGVD HVIALHVDSG LPVGKITIRD
GASSAAVDTF RGWITGSGGH GAYPHLGTDP LWMLLPVMQA LHGIVARRIN PMHPAVVSLG
IVRGGTASNV LPAGVYLEGT LRSFDPQVRE QLIVEVERAF AVARAVGGDY RLEIERGYPA
GHNDATVSEW IAATTADLIG ADAIDRSRSG MGAEDFAYMT QKAPGAMFML GAAIDDGVNR
GHHTPIFDID ERALPIGAAI LAETARRYLA GEVKR