Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_4361 |
Symbol | |
ID | 5211345 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 5479417 |
End bp | 5481339 |
Gene Length | 1923 bp |
Protein Length | 640 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640597942 |
Product | ATP-dependent metalloprotease FtsH |
Protein accession | YP_001278646 |
Protein GI | 148658441 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.193811 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGATA ATCGCTGGCT GAAAAATAGT TTCGTCTACC TCATCATTCT TGTTGCTGCG TTGGCGCTGT TCATCAATTA CTTCAACAAT GCGCAGGGTC AGCAAGAGGA ACGCGGCATC TATCAGGTGC TCGCCGATGC AAAAGCTGGC AGAGTTGAGA AGATCGAAGC GCAGTCGGGC AACTCCGAAA TCCTGGTGAC GTACCGCGAT ACCAGGGCTA AGGTGCGGTC ACGCATCGAG TCGAACGATA GTATTACGAT GCTGCTTGTG CAGGCAGGCG TGCCGCTCGA CGCAGTGAAC GTCGAGGTGC GCGCAGCGCC AGCATGGGGC GGTCTGCTGA ATGTCTTCAC GTTCCTGCTG CCAGTCTTAC TGATGATCGG CTTCTTCATC TTCTTTATGC GTCAGGCGCA GGGGTCGAAC AATCAGGCGC TCTCGTTCGG CAAGAGCCGG GCGCGGATGT TTTCCGGCGA TAAGCCGACG GTAACATTTG CCGATGTTGC CGGTCAGGAA GAAGCCAAGC AGGATTTGAC AGAAGTCGTT GAGTTTCTCA AGTTTCCCGA CAAGTTTGCC GCCCTTGGAG CGCGTATTCC GCGTGGTGTG CTGATGGTAG GTCCGCCAGG AACCGGCAAG ACCCTCCTGT CACGCGCGGT TGCCGGTGAA GCCGGGGTGC CGTTCTTCTC GATCTCCGGT TCAGAATTCG TCGAGATGTT CGTCGGCGTC GGCGCCAGCC GTGTGCGCGA CCTGTTCGAC CAGGCGAAGC GTAATGCCCC CTGCATCGTC TTCATTGACG AGATCGATGC CGTCGGGCGT CAGCGCGGCG CCGGGCTTGG CGGCTCCCAC GATGAGCGTG AGCAGACCCT CAACCAGATT CTGGTCGAGA TGGACGGCTT TGATACGAAT ACGAATGTCA TCGTGATTGC AGCCACGAAC CGCCCCGATG TGCTCGACCC GGCACTGGTG CGCCCCGGTC GCTTCGACCG CCAGGTGGTG CTCGATGCTC CGGACGTGAA AGGGCGCATC GAGGTGCTCA AGGTGCATAC CAAGGGCAAG CCGCTCGCCG ATGATGTGCA GTTCGATGTG ATCGCGCGTC AGACCCCCGG TTTCTCCGGT GCGGACCTGG CGAATGCAGT GAACGAGGCG GCAATCCTGG CGGCGCGCCG CTCGAAGAAG AAGATCGGCA TGGCAGAGTT GCAGGACGCG ATTGAGCGCG TGGCGCTCGG TGGTCCGGAG CGCCGCAGTC GGGTGCTGAC CGAACGTGAG AAATTGCTGA CTGCATACCA CGAATCGGGG CACGCCATCG CCGCCGCTGG TATGCCCAAA GCTTTCCCGG TGCAGAAAGT GACGATCGTG CCGCGTGGAC GCGCTGGCGG GTATACGCTC TATCTGCCGG AAGAAGATAG CATTCGCTAC ACTACCGCAT CGCAGTTCGC CGCACAACTC GTGTCGGCGC TCGGCGGGCG CGTGGCGGAA GAGATCGTCT TCGGTCCTGA TGAGGTCTCG ACCGGCGCCG CAGGTGACAT TCAGCAGGTG ACGCGCATTG CCCGCGCAAT GGTGACGCGC TACGGTATGA GTCCGAAGCT CGGTCCGATT GCGTTCGGTG AGCGTGAGGA ACTGATCTTC CTCGGGCGAG AGATCACCGA GCAACGCAAC TACAGCGACG ATGTCGCGCG CGAGATCGAT AATGAAGTGC ATCGCATCGT TTCGGAAGCG TATGAGCGCA CACGCCTGAT CCTGACGCAT AACCGCGAGG TGCTGAACGA TATGGCGAGT GCGCTGATCG AGTATGAAAC GCTCGATGGC GAACGCCTGA GAGAATTGCT CAGCCGTGTG GTGAAGATCG ATGAGATCGA GAGTCGGGTG AACGGCGGCA ACGGCATGCT GACCACGCCA TCGGGCATGA ACGTTCCGTC TGCACAGGCA TAA
|
Protein sequence | MGDNRWLKNS FVYLIILVAA LALFINYFNN AQGQQEERGI YQVLADAKAG RVEKIEAQSG NSEILVTYRD TRAKVRSRIE SNDSITMLLV QAGVPLDAVN VEVRAAPAWG GLLNVFTFLL PVLLMIGFFI FFMRQAQGSN NQALSFGKSR ARMFSGDKPT VTFADVAGQE EAKQDLTEVV EFLKFPDKFA ALGARIPRGV LMVGPPGTGK TLLSRAVAGE AGVPFFSISG SEFVEMFVGV GASRVRDLFD QAKRNAPCIV FIDEIDAVGR QRGAGLGGSH DEREQTLNQI LVEMDGFDTN TNVIVIAATN RPDVLDPALV RPGRFDRQVV LDAPDVKGRI EVLKVHTKGK PLADDVQFDV IARQTPGFSG ADLANAVNEA AILAARRSKK KIGMAELQDA IERVALGGPE RRSRVLTERE KLLTAYHESG HAIAAAGMPK AFPVQKVTIV PRGRAGGYTL YLPEEDSIRY TTASQFAAQL VSALGGRVAE EIVFGPDEVS TGAAGDIQQV TRIARAMVTR YGMSPKLGPI AFGEREELIF LGREITEQRN YSDDVAREID NEVHRIVSEA YERTRLILTH NREVLNDMAS ALIEYETLDG ERLRELLSRV VKIDEIESRV NGGNGMLTTP SGMNVPSAQA
|
| |