Gene RoseRS_1549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1549 
Symbol 
ID5208504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1894197 
End bp1895738 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content60% 
IMG OID640595155 
Productcarboxypeptidase Taq 
Protein accessionYP_001275891 
Protein GI148655686 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2317] Zn-dependent carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.901357 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATATAC CTCCGCAACT AACCGAACTC AAGACGCGTC TCCACGAAAT CTACGATCTG 
GAAATGGCTG CCGCGCTGCT GAACTGGGAC CAGACGACCT ACATGCCGCC AGGTGGCGCG
ACTGCGCGTG GACGTCAACT GGCAACCCTG GGACGGATCG CGCACGAGAA GCGAACTGAC
GCGCAGATCG GGCGTTTGCT CGATGCGTTG CGCCCCTATG AAGAGTCGCT CCCGCCCGAT
TCGCCCGATG CGGCGCTCAT TCGGGTTGCA CGGCGCGACT ATGAGCGCGC GACCCGGATT
CCTGCCGCAT TCACGGCTGA ACTGTACCAG CACATGGCTG TCAGTTACGA TGTCTGGTCG
CGCGCCCGAC CGGCGAATGA TATTGCGGCG GTGCTGCCCT ACCTGGAACG CACGCTCGAT
CTCAGTCGGC GCTTCGCGGA ATTCTTTCCA GGCTACGACC ATATCGCCGA TCCGCTGATC
GATATGGCGG ATTATGGCAT GCGCACTGCG ACGATCAAAC AGGTTTTTGC CGGACTCCGT
CAGGGATTGT TGCCGCTGGT CGAGCAGGTG ACCGCTCAAC CGCCTGTCGA TGATTCATGC
CTGCGCCAGT TCTTTCCCGA AGCGCAACAA CTGGCATTCG GCGTTGAGGT CATTACCGCC
CTGGGGTACG ACTTCACCCG TGGACGACAG GATAAGACGC TGCATCCGTT CATGACCAAA
TTCTCGCTGA ACGATGTGCG GATCACGACC CGCTTCGATG AGTACGATCT CGGATCGGCG
CTGTTCAGCA CCATTCACGA AGCAGGGCAC GCGATGTACG AGCAGGGGAT CGCGCTCGAA
TTCGAGGGTA CGCCCCTCGC TTCCGGCACA TCTGCCGGGA TGCACGAAAG TCAGTCGCGC
CTGTGGGAGA ATATCGTCGG GCGCAGCCTG CCGTTCTGGG AGCACTTCTA TCCGCGTCTC
CAGGCGACAT TCCCCGACCA GTTGGGGCAC GTGTCGCTTG AAACATTCTA CCGCGCCATC
AATAAAGTGC AGCGTTCGCT CATTCGCACC GAAGCGGACG AAGTCACCTA CAACCTGCAC
GTCATTCTGC GCTTCGACCT GGAACTGGCG CTGCTGGAAG GAACGCTCGC TGTGCGCGAC
CTGCCCGAAG CCTGGCACGA ACGCTATCGC AGCGATCTGG GTGTGACGCC GCCGGATGAC
CGCGACGGGG TGTTGCAGGA TGTTCACTGG TACGGCGGTC TGATTGGCGG GGCATTCCAG
GGGTATACCC TGGGGAATAT CATGAGTGCG CAATTGTATG AAGCGGCATT GCGTGACCAT
CCCGACATCC CGCAGCAGAT CGGGCGCGGT GAGTTCGGTA CGCTGCGTGA ATGGATGCGC
GAACACGTCT ACCGCTATGG GCGCGCTCTC GACGCTGATG ACATCCTGCG CCGCGCGACC
GGCAGGTCGC TCGATGTGCA GCCGTACCTG GCGTATCTGT GGCGCAAATA TGGCGAGATA
TACGGCATCG CGTACATTCC CTTCCATCAG GTGGTCGCAT AG
 
Protein sequence
MHIPPQLTEL KTRLHEIYDL EMAAALLNWD QTTYMPPGGA TARGRQLATL GRIAHEKRTD 
AQIGRLLDAL RPYEESLPPD SPDAALIRVA RRDYERATRI PAAFTAELYQ HMAVSYDVWS
RARPANDIAA VLPYLERTLD LSRRFAEFFP GYDHIADPLI DMADYGMRTA TIKQVFAGLR
QGLLPLVEQV TAQPPVDDSC LRQFFPEAQQ LAFGVEVITA LGYDFTRGRQ DKTLHPFMTK
FSLNDVRITT RFDEYDLGSA LFSTIHEAGH AMYEQGIALE FEGTPLASGT SAGMHESQSR
LWENIVGRSL PFWEHFYPRL QATFPDQLGH VSLETFYRAI NKVQRSLIRT EADEVTYNLH
VILRFDLELA LLEGTLAVRD LPEAWHERYR SDLGVTPPDD RDGVLQDVHW YGGLIGGAFQ
GYTLGNIMSA QLYEAALRDH PDIPQQIGRG EFGTLREWMR EHVYRYGRAL DADDILRRAT
GRSLDVQPYL AYLWRKYGEI YGIAYIPFHQ VVA