Gene RoseRS_2053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2053 
Symbol 
ID5209015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2543890 
End bp2545824 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content61% 
IMG OID640595658 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_001276387 
Protein GI148656182 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.197732 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCATT CGCAGATCGC GCCATACGGT TCGTGGCGCT CGCCGATCAC TGCTGCGCTG 
GTGGCAACAT CGGGCGTTTC GTTGAACGAC GTTGCTCTCG ACGGCGACGA CATCTACTGG
CTCGAAGGGC GACCGGCTGA AGGCGGTCGG GTGGTCATTG TGCGGCGCGC CGCCAATGGA
ACGATTGCCG ATGTGACCCC GCCCGGCTTC AACGTGCGCA CCCGCGTCCA CGAATATGGC
GGCGCACCAT ACACAGTTGA TCAGGGGGTC GTCTATTTCA GTAACTTTGC CGATCACCGG
GTCTACCGCC AGCAAGCGGG TGAAACGCCA ACCCCCGTTA CTCCTGAAGC GCCGTTGCGC
TATGCGGATA TGACTGTCGA TCGCGGGCGG AACCGTCTGA TCTGCGTGCG CGAGGATCAT
TCCGGCGATG GCGAGGCGGT CAATGCGATT GTTGCCGTTC CACTCGACGG AACAACCGGG
CAGCAGGTTC TGGTCGCAGG TTCGGATTTC TATGCCCATC CGCGTCTCAG CCCGGATGGA
ACCTGGCTGG CGTGGCTCTC CTGGAACCAC CCGAACATGC CGTGGGACGC TGCCGAACTG
TGGGTTGCGC CATTGCGCGA GGATGGCTCA CCGGGAACTG CCGAACGGAT CGCGGGTGGA
CCGGACGATG CCGCTTTTCA GCCTGAGTGG GGACCCGACG GCGCGCTCTA TTTCGTTGCC
GAACGTACCG GATGGTGGAA CCTCTACCGC TGGCACGCTG GCAATGTCGA GGCGCTCTGC
CCGATGGAAG CCGAGTTCGG TCTGCCGCTC TGGGTCTTCA GTGCACGCAC CTATGCCGTC
GAATCGGCGG GACGACTGGT ATGCACGTAT ATCGAGCGCG GTGAGCAGAA GATGGCGACG
CTCGACACGC AGAGCGGAAT GCTGACGCCG CTCACTCTGC CATTCAGCGA CTTTGGTTTC
AGCGGTCCTC GCGCCGCCAA TGGCAGGGTT GTCTTCATCG GCGCTTCATC GAACACGCCC
TCGACGCTGG TAATGCTCGA TCTGGCGAGC GATGCACTGA TGACTATTCG CCGTTCGATG
GATATCCAGA TCGATCCCGG CTATATCTCG ACGCCGCAGG TGGTGGAATT TCCCACCGAA
GGCGGCGTGA CTGCGTTCGG CTTCTATTAT CCGCCGCACA ACCGCGACTT TCGTGCGCCG
GAAGGTGAAA AGCCGCCTTT GCTCGTTTTG AGCCATGGCG GACCGACCGG CGCAACATCG
GCGTCGTTCG ATGTCGGCAT TCAGTTCTGG ACGAGTCGCG GCATTGCGGT AATGGATGTC
AACTACGGCG GCAGCACCGG GTTCGGTCGT GCGTACCGTC AGCGTCTCGA CGGACGCTGG
GGAGTCGTTG ATGTGGATGA TTGCTGCAAC GCAGCAACAT ACCTGGCGGC GCAGGGTCTC
GCCGATCCAG CGCGATTGAT CATTGCTGGC GGCAGCGCCG GCGGCTACAC GACCCTGGCG
GCGCTCACCT TCCGCCGGGT GTTCAAGGCT GGCGCCAGTT ACTACGGCGT CAGCGACCTG
GAGGCGCTGG CGCGTGATAC GCACAAATTC GAGTCGCGTT ACCTCGACCG CCTGATCGGA
CCGTATCCTG AACGCATCGA TCTCTACCAC GCGCGTTCGC CGATCCACCA TATCGAACAA
CTCAACTGCC CGGTCATCTT TCTGCAAGGA CTGGAAGACA GAGTCGTGCC GCCGGATCAG
TCCGAACGAA TGGCGGCAGC CCTGCGCACA AAAGGGATAC CGGTTGCATA CCTGGCATTC
GAAGGCGAAC AGCATGGCTT CCGCAAGGCA GAGACGATCA TCCGGGCGCT GGAAGCCGAG
TTGTACTTCT ACGCGCGCAT CCTGGGATTT GAACCCGCCG ATCCGGTCGA ACCGATTCAG
ATCGACAATC TTTAG
 
Protein sequence
MSHSQIAPYG SWRSPITAAL VATSGVSLND VALDGDDIYW LEGRPAEGGR VVIVRRAANG 
TIADVTPPGF NVRTRVHEYG GAPYTVDQGV VYFSNFADHR VYRQQAGETP TPVTPEAPLR
YADMTVDRGR NRLICVREDH SGDGEAVNAI VAVPLDGTTG QQVLVAGSDF YAHPRLSPDG
TWLAWLSWNH PNMPWDAAEL WVAPLREDGS PGTAERIAGG PDDAAFQPEW GPDGALYFVA
ERTGWWNLYR WHAGNVEALC PMEAEFGLPL WVFSARTYAV ESAGRLVCTY IERGEQKMAT
LDTQSGMLTP LTLPFSDFGF SGPRAANGRV VFIGASSNTP STLVMLDLAS DALMTIRRSM
DIQIDPGYIS TPQVVEFPTE GGVTAFGFYY PPHNRDFRAP EGEKPPLLVL SHGGPTGATS
ASFDVGIQFW TSRGIAVMDV NYGGSTGFGR AYRQRLDGRW GVVDVDDCCN AATYLAAQGL
ADPARLIIAG GSAGGYTTLA ALTFRRVFKA GASYYGVSDL EALARDTHKF ESRYLDRLIG
PYPERIDLYH ARSPIHHIEQ LNCPVIFLQG LEDRVVPPDQ SERMAAALRT KGIPVAYLAF
EGEQHGFRKA ETIIRALEAE LYFYARILGF EPADPVEPIQ IDNL