Gene RoseRS_4361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4361 
Symbol 
ID5211345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5479417 
End bp5481339 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content60% 
IMG OID640597942 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_001278646 
Protein GI148658441 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.193811 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGATA ATCGCTGGCT GAAAAATAGT TTCGTCTACC TCATCATTCT TGTTGCTGCG 
TTGGCGCTGT TCATCAATTA CTTCAACAAT GCGCAGGGTC AGCAAGAGGA ACGCGGCATC
TATCAGGTGC TCGCCGATGC AAAAGCTGGC AGAGTTGAGA AGATCGAAGC GCAGTCGGGC
AACTCCGAAA TCCTGGTGAC GTACCGCGAT ACCAGGGCTA AGGTGCGGTC ACGCATCGAG
TCGAACGATA GTATTACGAT GCTGCTTGTG CAGGCAGGCG TGCCGCTCGA CGCAGTGAAC
GTCGAGGTGC GCGCAGCGCC AGCATGGGGC GGTCTGCTGA ATGTCTTCAC GTTCCTGCTG
CCAGTCTTAC TGATGATCGG CTTCTTCATC TTCTTTATGC GTCAGGCGCA GGGGTCGAAC
AATCAGGCGC TCTCGTTCGG CAAGAGCCGG GCGCGGATGT TTTCCGGCGA TAAGCCGACG
GTAACATTTG CCGATGTTGC CGGTCAGGAA GAAGCCAAGC AGGATTTGAC AGAAGTCGTT
GAGTTTCTCA AGTTTCCCGA CAAGTTTGCC GCCCTTGGAG CGCGTATTCC GCGTGGTGTG
CTGATGGTAG GTCCGCCAGG AACCGGCAAG ACCCTCCTGT CACGCGCGGT TGCCGGTGAA
GCCGGGGTGC CGTTCTTCTC GATCTCCGGT TCAGAATTCG TCGAGATGTT CGTCGGCGTC
GGCGCCAGCC GTGTGCGCGA CCTGTTCGAC CAGGCGAAGC GTAATGCCCC CTGCATCGTC
TTCATTGACG AGATCGATGC CGTCGGGCGT CAGCGCGGCG CCGGGCTTGG CGGCTCCCAC
GATGAGCGTG AGCAGACCCT CAACCAGATT CTGGTCGAGA TGGACGGCTT TGATACGAAT
ACGAATGTCA TCGTGATTGC AGCCACGAAC CGCCCCGATG TGCTCGACCC GGCACTGGTG
CGCCCCGGTC GCTTCGACCG CCAGGTGGTG CTCGATGCTC CGGACGTGAA AGGGCGCATC
GAGGTGCTCA AGGTGCATAC CAAGGGCAAG CCGCTCGCCG ATGATGTGCA GTTCGATGTG
ATCGCGCGTC AGACCCCCGG TTTCTCCGGT GCGGACCTGG CGAATGCAGT GAACGAGGCG
GCAATCCTGG CGGCGCGCCG CTCGAAGAAG AAGATCGGCA TGGCAGAGTT GCAGGACGCG
ATTGAGCGCG TGGCGCTCGG TGGTCCGGAG CGCCGCAGTC GGGTGCTGAC CGAACGTGAG
AAATTGCTGA CTGCATACCA CGAATCGGGG CACGCCATCG CCGCCGCTGG TATGCCCAAA
GCTTTCCCGG TGCAGAAAGT GACGATCGTG CCGCGTGGAC GCGCTGGCGG GTATACGCTC
TATCTGCCGG AAGAAGATAG CATTCGCTAC ACTACCGCAT CGCAGTTCGC CGCACAACTC
GTGTCGGCGC TCGGCGGGCG CGTGGCGGAA GAGATCGTCT TCGGTCCTGA TGAGGTCTCG
ACCGGCGCCG CAGGTGACAT TCAGCAGGTG ACGCGCATTG CCCGCGCAAT GGTGACGCGC
TACGGTATGA GTCCGAAGCT CGGTCCGATT GCGTTCGGTG AGCGTGAGGA ACTGATCTTC
CTCGGGCGAG AGATCACCGA GCAACGCAAC TACAGCGACG ATGTCGCGCG CGAGATCGAT
AATGAAGTGC ATCGCATCGT TTCGGAAGCG TATGAGCGCA CACGCCTGAT CCTGACGCAT
AACCGCGAGG TGCTGAACGA TATGGCGAGT GCGCTGATCG AGTATGAAAC GCTCGATGGC
GAACGCCTGA GAGAATTGCT CAGCCGTGTG GTGAAGATCG ATGAGATCGA GAGTCGGGTG
AACGGCGGCA ACGGCATGCT GACCACGCCA TCGGGCATGA ACGTTCCGTC TGCACAGGCA
TAA
 
Protein sequence
MGDNRWLKNS FVYLIILVAA LALFINYFNN AQGQQEERGI YQVLADAKAG RVEKIEAQSG 
NSEILVTYRD TRAKVRSRIE SNDSITMLLV QAGVPLDAVN VEVRAAPAWG GLLNVFTFLL
PVLLMIGFFI FFMRQAQGSN NQALSFGKSR ARMFSGDKPT VTFADVAGQE EAKQDLTEVV
EFLKFPDKFA ALGARIPRGV LMVGPPGTGK TLLSRAVAGE AGVPFFSISG SEFVEMFVGV
GASRVRDLFD QAKRNAPCIV FIDEIDAVGR QRGAGLGGSH DEREQTLNQI LVEMDGFDTN
TNVIVIAATN RPDVLDPALV RPGRFDRQVV LDAPDVKGRI EVLKVHTKGK PLADDVQFDV
IARQTPGFSG ADLANAVNEA AILAARRSKK KIGMAELQDA IERVALGGPE RRSRVLTERE
KLLTAYHESG HAIAAAGMPK AFPVQKVTIV PRGRAGGYTL YLPEEDSIRY TTASQFAAQL
VSALGGRVAE EIVFGPDEVS TGAAGDIQQV TRIARAMVTR YGMSPKLGPI AFGEREELIF
LGREITEQRN YSDDVAREID NEVHRIVSEA YERTRLILTH NREVLNDMAS ALIEYETLDG
ERLRELLSRV VKIDEIESRV NGGNGMLTTP SGMNVPSAQA