Gene RoseRS_3207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3207 
Symbol 
ID5210178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4042932 
End bp4045010 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content61% 
IMG OID640596799 
Productpeptidase S41 
Protein accessionYP_001277518 
Protein GI148657313 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000881492 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0220331 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACACG GCTCTATGGG GCGCGCGGTA GCGCTGGCAG TTATCGCACT GGCGCTTGCC 
GCCTGTGGTG TCGCACCGAT GGGGTCGCCA CCGGCGGCTG CACTCCCGGT GGAAGGTGCA
ACCGCTGTCA GGCCGCCTGC CGTTTCACCG CCTGCGGCGA CGTCGGCGCC GGCGCCAACT
CCTTCGCCTG CGCCGCGCTC ATCGGCGGGC GGCGTCGAGG TGATCACCGG CGAATTCGAG
TATACCAACG ACATCATCAC CGTCTACTAC GTGGAACATG CGGTCGGGCT GGTCGATCTC
TACGGGTTCG TCACCCGCGA TGAGGAGTGG GAACTGCCGG TTGACAGTCA GGTGTTGGGA
CCTCTGACGA TCGATACGGA TCAACAGCGC GGTACATTCC GTCTGTTGCT TCCCGCGCGC
CCTGCCGGGA CGATGGTCGA TGTCGATAAT GACGGTCAGC GTAATGCTGG CGTGCAGGTG
TTTGTGGTGG CGTACTGGCC CAACCTGTAC GGCGGTCCGT TTTCCGAAGG GGATGATCGC
AGTTTTGGCT GGCCCGCCTA TCTTGCTTCG ACGATCAATG ATCCGGAGAA TAACGATGAA
GTGACCGGCG GCAAACTGGT GGTGTGGTCG CCGGACGATG CGCAGCAGTT TCCGACAGAT
TTCGGTCCTG ATGGGTTGCT GTTCACCGAT GACGATCCGG TTGGTTCGCT TCCCGCCGGG
TATTCGGTGA TCGATCTTGA TCGGCGACCG TTTGCCATCG AGCGCGACCG CGAGGTGCGG
ATCACGCTGT ATGAGCCGCC GGATGCAGCG ATCAAGGATT TCTCCGATCT GTCGTATACG
AAGGCGTTCG ATGCGATGTT CAACAGGGTG CGTGTGGAGT ACGCCTTCAA CGGCATTCCT
GGCAAAGCGC CGGATTGGGA TGCGCTCTAC GCCGAACTGG CGCCGCGCGT GGCTGAGGCG
GAGCGTCGGG CGGATCGCAG GGCGTTTTTT GACGTGCTGT TCGATTTTGC GTATGCCTTC
CGCGATGGGC ATGTCGGCGT CAGTTCGCCG CTGTCCGGCG CGCTGTTCCG TGAACGGGCG
TCCGGCGGGT ACGGTTTTGC CATTCGTGAA CTGGATGATG GTCGGGCGCT GGTGATGTTT
GTGACGCGCA ACGGTCCTGC CGATCGCGCC GGGATGCAGG TGGGGGCGGA ACTGCTGGCG
TTCAATGGCA CGCCGGTCAA AGAAGCGATT GCCGCAGTTG AGCCGCTGGG CGGTCCGTTT
TCGACCGATT TTGCGCTGCG CTATCAGCAG GCGCGCTACC TGTTGCGTGC GCCGGTTGGA
ACATCAGCGC AGGTGACCTT TGCCAATCCG CGCAGTGCGC CCCAGACGGT CACACTGCGC
GCCGTCGAAG AGCGCGATAG TTTTCTTGCA ACGTCGATCT ATGAGGAGCG CAATCCGGCG
GCGTTGCCGG TTGAGTTCGA GCAGCGACCG TCCGGTGTCG GGTATATTCG CATCAACGCC
AATTACGACG ATCTGAACCT GCTGATCAGG CTGTTTGAAC GCGCGCTGAA GACGTTCGAC
GACCTGGATG TTCCCGGCAT TATCATTGAT ATGCGCCAGA ACAACGGCGG TGCGCCGCTC
GGGCTTGCCG GTTTTCTGAC CGATCAGGAG ATCATTATCG GTCAGGATGA ATACTATAGT
GAACGCACCG GTCGTTTTGA GCCGGAAGGT CCGGTTGATA AAATTCTACC CAACCAGAAT
CAGTACCGTT TCGATAAAAT TGCGCTGCTG GTGGGTCAGG CATGCTTCAG CGCGTGCGAG
TATGAGTCGT ATGGGTTCAG CAGGGTTCCA GGCGTGATCG TGGTTGGTGA AACGCCAACT
GCGGGGGTGT ACGCCGAAGT GTCGCGCGGG CAGTATGTAT TGCCGGACGG TATCTTCCTG
CAAGTCCCGA CTGGTCGCAC GCTGTTGCCC GATGGAACGC CGCTGCTGGA GGGCGTGGGG
GTTGTGCCGA CTATTCGCGT GCCGGTCACT GCGGAGACGG TGCTTTCTGA TCGCGATGTG
GTGCTGGAGC GGGCGGAGCG CGAGATCGTC GGGCGCTGA
 
Protein sequence
MKHGSMGRAV ALAVIALALA ACGVAPMGSP PAAALPVEGA TAVRPPAVSP PAATSAPAPT 
PSPAPRSSAG GVEVITGEFE YTNDIITVYY VEHAVGLVDL YGFVTRDEEW ELPVDSQVLG
PLTIDTDQQR GTFRLLLPAR PAGTMVDVDN DGQRNAGVQV FVVAYWPNLY GGPFSEGDDR
SFGWPAYLAS TINDPENNDE VTGGKLVVWS PDDAQQFPTD FGPDGLLFTD DDPVGSLPAG
YSVIDLDRRP FAIERDREVR ITLYEPPDAA IKDFSDLSYT KAFDAMFNRV RVEYAFNGIP
GKAPDWDALY AELAPRVAEA ERRADRRAFF DVLFDFAYAF RDGHVGVSSP LSGALFRERA
SGGYGFAIRE LDDGRALVMF VTRNGPADRA GMQVGAELLA FNGTPVKEAI AAVEPLGGPF
STDFALRYQQ ARYLLRAPVG TSAQVTFANP RSAPQTVTLR AVEERDSFLA TSIYEERNPA
ALPVEFEQRP SGVGYIRINA NYDDLNLLIR LFERALKTFD DLDVPGIIID MRQNNGGAPL
GLAGFLTDQE IIIGQDEYYS ERTGRFEPEG PVDKILPNQN QYRFDKIALL VGQACFSACE
YESYGFSRVP GVIVVGETPT AGVYAEVSRG QYVLPDGIFL QVPTGRTLLP DGTPLLEGVG
VVPTIRVPVT AETVLSDRDV VLERAEREIV GR