Gene RoseRS_3985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3985 
Symbol 
ID5210968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4986642 
End bp4987757 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content65% 
IMG OID640597576 
Productmetalloendopeptidase glycoprotease family 
Protein accessionYP_001278282 
Protein GI148658077 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.765684 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.99435 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGGC AAGTCACCAT TCTGGCAATC GAGACGTCCT GCGACGAGAC GGCAGCGGCG 
GTTATTCGCG GCGGGCGCAC GATCATCTCC AATGTCGTGG CATCGCAGAT CGATGAGCAT
CGGCGCTACG GCGGCATCGT CCCTGAAGTC GCGTCACGCC AGCACATCCT GACGATTGAT
GCTGTGCTGC ACGAAGCGTT ACGCCCGCTG CCCAGCGGAT GGAACGACAT CCATGCGGTC
GCGGCGACGT ATGGACCGGG GCTGGCAGGC GCGCTGATGA CCGGGCTGAA TGTCGCCAAG
GCAATCGCCT GGATCCGCGA ATTACCGTTT GTCGGAGTCA ACCATATCGA GGCGCACATC
TACGCAAACT GGTTGTTGAC CGATGCACAA CCAGAAGCGC CAGCGCCGCA ATTCCCCGTC
GTTGCGCTGG TGGTCAGCGG CGGGCATACG CTGCTGGCGC TGCTCGAAGG GCACGGACGC
TACCGGTTGC TCGGACAGAC GCGCGATGAT GCTGCTGGCG AGGCGTTCGA CAAAGTTGCG
CGTTTGCTGG GGCTTGGCTT TCCTGGCGGA CCTGCCATTC AGCGTGCGGC GGAAGGCGCA
CCGGGCGGGG TGGTGCTGCC GCGCGCCTGG TTGCGCGACA GTTACGATTT TTCGTTCAGT
GGGCTGAAGA CGGCGGTCCT CCATCAGATC CGCGACTATC AGGCGCGTGA AGCAGCGTTG
CAACCCGGCA CGGGGAAGAG CGCTGGCAAA CGCGGCGTCG GCGCCCCCAG CACGCCGCCT
GAAGCGACCG CTACGCCCCA CCTGCCGCCA ACAGTCGTGG CGCGTCTTGC CCGCGCCTTC
CAGGAGTCCG TCGTCGATGT GCTGGTCACG AAGACGGTCG AAGCAGCGCG CGCATTCGGC
GCTGCCGAGA TTCTGCTGGC CGGGGGCGTG GCGGCAAATC TTCGCCTGCG CGAGGAACTC
AACCGGCGCG CCCCGGTTCC TGTGCGCGTC CCACCGGTCG CCCTGTGTAC CGATAATGCT
GCGATGATCG GCGCTGCCGC CTTCTATCGC TTCGATGCCG GCATTCAGCA CGGATGGGAC
CTGGATGTCC AGCCGAACCT TGCCCTGGAT GGGTAG
 
Protein sequence
MNRQVTILAI ETSCDETAAA VIRGGRTIIS NVVASQIDEH RRYGGIVPEV ASRQHILTID 
AVLHEALRPL PSGWNDIHAV AATYGPGLAG ALMTGLNVAK AIAWIRELPF VGVNHIEAHI
YANWLLTDAQ PEAPAPQFPV VALVVSGGHT LLALLEGHGR YRLLGQTRDD AAGEAFDKVA
RLLGLGFPGG PAIQRAAEGA PGGVVLPRAW LRDSYDFSFS GLKTAVLHQI RDYQAREAAL
QPGTGKSAGK RGVGAPSTPP EATATPHLPP TVVARLARAF QESVVDVLVT KTVEAARAFG
AAEILLAGGV AANLRLREEL NRRAPVPVRV PPVALCTDNA AMIGAAAFYR FDAGIQHGWD
LDVQPNLALD G