Gene Rcas_0717 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0717 
Symbol 
ID5538182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp939958 
End bp941874 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content59% 
IMG OID640892872 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_001430856 
Protein GI156740727 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGACA ATCGCTGGCT GAAGAATAGT TTCGTCTACC TCATCATCCT GGTCGCTGCA 
TTGGCGCTGT TCTTCAATTA TTTCAACAAT GCGCAGGGTC AGGCGGAGGA ACGGGGCATC
TATCAGGTGC TCGCAGATGC CAAAGCTGGC AGGGTTGAGA AGATCGAAGC GCAGTCGGGC
AACACCGAGA TTCTGGTGAC GTACCGCGAT ACCAGAACGA AAGTGCGGTC GCGCATTGAG
TCGAATGATA GTATCACGAT GCTGCTGGTG CAGGCTGGTG TGCCGCTCGA CGCGGTGAAT
GTCGAGGTGC GCGCGGCGCC TGCCTGGGGC GGGTTGCTGA ATGTTTTCAC CATCCTGTTG
CCGGTATTGT TGATGATCGG CTTTTTTGTC TTCTTCATGC GTCAGGCGCA AGGGTCGAAC
AATCAGGCGC TGTCGTTTGG CAAGAGCCGG GCGCGTATGT TCTCTGGCGA TAAGCCGACG
GTGACGTTTG CCGATGTCGC CGGTCAGGAA GAAGCCAAAC AGGATTTGAC CGAGGTTGTT
GAGTTTCTTA AGTTTCCTGA CAAGTTTGCG GCGCTTGGTG CGCGTATTCC GCGCGGCGTG
CTGATGGTTG GTCCTCCGGG AACCGGCAAG ACGCTCCTGT CGCGTGCGGT CGCCGGCGAG
GCAGGGGTGC CGTTCTTCTC AATCTCTGGT TCGGAGTTCG TTGAGATGTT CGTCGGTGTC
GGCGCCAGCC GTGTGCGCGA CCTGTTCGAC CAGGCGAAGC GGAATGCGCC CTGTATTGTG
TTCATCGACG AGATCGACGC GGTCGGTCGG CAGCGTGGCG CCGGGCTTGG CGGCTCGCAC
GACGAACGCG AGCAGACGCT CAATCAGATT CTGGTTGAGA TGGATGGCTT CGATACCAAT
ACAAACGTTA TCGTCATTGC CGCCACCAAC CGACCAGACG TGCTCGATCC GGCGCTGGTG
CGCCCCGGTC GCTTCGACCG CCAGGTGGTG CTCGATGCGC CGGATGTGAA AGGGCGTATT
GAGGTGCTCA GGGTGCATAC CAAGGGTAAG CCGCTTGCCG ATGATGTGCA ACTCGATGTC
ATCGCAAGGC AGACGCCCGG CTTCTCTGGG GCCGATCTGG CAAATGCGGT GAACGAGGCG
GCGATTCTGG CGGCGCGCCG TTCGAAGAAG AAGATTGGCA TGGCAGAGTT GCAGGACGCG
ATTGAGCGCG TGGCGCTCGG TGGTCCTGAG CGACGCAGCC GGGTGTTGAC CGAACGGGAA
AAATTGCTGA CGGCGTATCA CGAATCCGGT CACGCGATTG CGGCGGCGGG TATGCCGAAA
GCCTTTCCGG TGCAGAAGGT GACGATTGTG CCGCGTGGAC GAGCCGGCGG GTATACGCTC
TATCTGCCCG AAGAGGATAG TATTCGTTAC ACAACTGCGT CGCAGTTCGC AGCGCAACTC
GTCTCGGCGC TTGGCGGGCG CGTGGCAGAA GAGATTGTCT TCGGACCTGA TGAGGTCTCG
ACGGGGGCGG CGGGTGATAT TCAGCAGGTG ACACGCATTG CGCGCGCGAT GGTGACCCGC
TATGGCATGA GTGCGAAGCT TGGTCCGATT GCATTCGGTG AGCGCGAGGA GTTGATCTTC
CTGGGGCGCG AGATTACTGA GCAGCGCAAC TATAGCGATG CTGTGGCGCG CGAGATCGAT
AACGAAGTGC ATCGCATCGT TTCAGAAGCG TATGAGCGTA CTCGCCTGAT CCTGACGTAT
AACCGCGAGG TGCTGAACGA TATGGCCAGT GCGCTGATTG AGTATGAAAC GCTCGATGGT
GAACGCTTGA AAGAATTGAT CAGCCGTGTC GTGAAGATCG ATGAGATTGA GCGTCGCCCG
AACGGTGGCA ACGGGGTGCT GGATACTTCA TCAACGCTGA CTGCGCCGCA GGCATAG
 
Protein sequence
MGDNRWLKNS FVYLIILVAA LALFFNYFNN AQGQAEERGI YQVLADAKAG RVEKIEAQSG 
NTEILVTYRD TRTKVRSRIE SNDSITMLLV QAGVPLDAVN VEVRAAPAWG GLLNVFTILL
PVLLMIGFFV FFMRQAQGSN NQALSFGKSR ARMFSGDKPT VTFADVAGQE EAKQDLTEVV
EFLKFPDKFA ALGARIPRGV LMVGPPGTGK TLLSRAVAGE AGVPFFSISG SEFVEMFVGV
GASRVRDLFD QAKRNAPCIV FIDEIDAVGR QRGAGLGGSH DEREQTLNQI LVEMDGFDTN
TNVIVIAATN RPDVLDPALV RPGRFDRQVV LDAPDVKGRI EVLRVHTKGK PLADDVQLDV
IARQTPGFSG ADLANAVNEA AILAARRSKK KIGMAELQDA IERVALGGPE RRSRVLTERE
KLLTAYHESG HAIAAAGMPK AFPVQKVTIV PRGRAGGYTL YLPEEDSIRY TTASQFAAQL
VSALGGRVAE EIVFGPDEVS TGAAGDIQQV TRIARAMVTR YGMSAKLGPI AFGEREELIF
LGREITEQRN YSDAVAREID NEVHRIVSEA YERTRLILTY NREVLNDMAS ALIEYETLDG
ERLKELISRV VKIDEIERRP NGGNGVLDTS STLTAPQA