Gene RoseRS_1998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1998 
Symbol 
ID5208960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2476726 
End bp2478537 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content61% 
IMG OID640595605 
ProductpepF/M3 family oligoendopeptidase 
Protein accessionYP_001276334 
Protein GI148656129 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR02290] oligoendopeptidase, pepF/M3 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0556432 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACTAT CAGTCAGTGA ACCACTGCCT CATTGGGATA TGACGGTCGT CTATCCCGCG 
CTCGATTCGC CGGAGTTCAA CCACGACCTT ACGGCGACAC GTCAGGCGTT CGACACGATG
ACCTCGCTGT TCGACCGTCT GGGAATTAAC AGGCGCGACC GGCAATCGAC CGATGACGCT
GCTGTCGCCG CATTCGAAAC GGTTGTTCCG GCGCTGAACG AACTGCTCGA ACAGTTCTCC
ACCCAGCGCG CCTATGTCTA CAGTTTCGTG GCAACCGACT CACGGAACGA TCAGGCGCAG
GCAACGTTCA GCATGCTGAT GCGGGAAGGG GTTCGGCTAA CAAAACTGTT GCGGCGATTG
ACCGCCTGGC TTGGCGGGTT GGATGTCGAA ACGTTGATCC AGCGCTCGAC TATCGCGCGT
GATCACGCCT ATCTGGTGCG CCGCGCCGCC GAAGAGGCGC GTCACCTGAT GTCGCCTGCC
GAAGAGGAAC TTGCCGCAGA ACTGGACCTG TCGGGCGGTA TTGCCTGGGC ACGTATGTAC
CAGAATCTGA CGTCGCAGAT ACTTGTGCCC ATCGAACGCG AGGGGCAGAC AGTCGAACTG
CCGATGAGTC AGGTGCGCAA TCTGGCGCGC GACCCGGACC GGGCAGTGCG CCGCAGCGCG
CACGAGGCGG AACTCGCAGC CTGGGAACGC GCAGCGTTGC CGCTCGCATC CGCGCTCAAC
AGCATCAAAG GGCAGGTGCT CACCCTCAGT CGCCGTCGTC GGTGGGAATC GCCGCTCGAG
GCATCGCTGT TCGACAATGG CATCGACCGC GCCACGCTCG ATGCCATGAT GACCACAGCG
CGCGAGTTCT TTCCCGATTT TCGGCGCTAC CTGCGCGCCA AAGCCAGGCT CCTCGGTCTT
GAACGCCTCG CCTGGTACGA TCTCTTCGCC CCGGTCGGCA GCGGTGGACG CAGCTGGCGC
TTCAGCGATG CAGAGGCGTT TATCGTGGCG CAGTTCACGC GCTACTCGAC GCGCATGGGC
GATTTTGCTG CGCGCGCATT CCGCGAACGC TGGATCGACG CCGAACCGCG CGCAGGAAAA
GTCGGCGGCG CGTTCTGTAT GTCGCTCCGC CGCGATGAGT CGCGCATTCT GCTGAACCAC
GATCCCACAG CGGACAGCAT GTTTACGCTG GCGCACGAAC TCGGGCATGG CTACCACAAC
CTCAACCTGG CGCAGCAGAC GATGCTCAAC CGTGATACGC CGATGACCCT GGCAGAAACG
GCGAGCATTT TCTGTGAGAC GATTGTGCGC AATGCGGCGC TCCAGGACGC CAGCCGCGAT
GAGACGCTCG AAATCCTCGA GGCGTTCCTC AGCGGCGCGT GTCAGGTGGT GGTTGATATT
ACGTCACGCT TCCTGTTTGA AACCGCGCTG TTCGAACAAC GCGCCACGCG CGATCTGTCG
GTCGCCGAGT TGTGCGTGCT GATGATCGAC GCGCAGAAAC AAACGTATGG CGATGCGCTC
GATGAACAAA CATTGCATCC ATTTATGTGG GCGGTCAAGG GGCACTACTA CAGCAGCGGC
TTTTCCTTCT ACAATTACCC TTACATGTTC GGCTTGCTGT TCGGGTTGGG GCTGTATGCC
GCCTATCAGC GTGCGCCCGA CGCTTTTCAG GCGCGCTACG ACGATCTGCT GGCTTCGACC
GGGCTGGCAA GCCCGCTCGA ACTGGCAGCG CGCATGGAGA TCGATCTGCG CTCACCCGCA
TTCTGGCGCG CCAGTCTCGA GGTCATTCGC TACGATATTG ATCGCTTCGA GTCGCTGGCA
GTTGCGACAT GA
 
Protein sequence
MTLSVSEPLP HWDMTVVYPA LDSPEFNHDL TATRQAFDTM TSLFDRLGIN RRDRQSTDDA 
AVAAFETVVP ALNELLEQFS TQRAYVYSFV ATDSRNDQAQ ATFSMLMREG VRLTKLLRRL
TAWLGGLDVE TLIQRSTIAR DHAYLVRRAA EEARHLMSPA EEELAAELDL SGGIAWARMY
QNLTSQILVP IEREGQTVEL PMSQVRNLAR DPDRAVRRSA HEAELAAWER AALPLASALN
SIKGQVLTLS RRRRWESPLE ASLFDNGIDR ATLDAMMTTA REFFPDFRRY LRAKARLLGL
ERLAWYDLFA PVGSGGRSWR FSDAEAFIVA QFTRYSTRMG DFAARAFRER WIDAEPRAGK
VGGAFCMSLR RDESRILLNH DPTADSMFTL AHELGHGYHN LNLAQQTMLN RDTPMTLAET
ASIFCETIVR NAALQDASRD ETLEILEAFL SGACQVVVDI TSRFLFETAL FEQRATRDLS
VAELCVLMID AQKQTYGDAL DEQTLHPFMW AVKGHYYSSG FSFYNYPYMF GLLFGLGLYA
AYQRAPDAFQ ARYDDLLAST GLASPLELAA RMEIDLRSPA FWRASLEVIR YDIDRFESLA
VAT