Gene Rcas_2107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2107 
Symbol 
ID5539587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2706219 
End bp2708639 
Gene Length2421 bp 
Protein Length806 aa 
Translation table11 
GC content59% 
IMG OID640894241 
Productpeptidase S16 lon domain-containing protein 
Protein accessionYP_001432210 
Protein GI156742081 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID[TIGR00764] lon-related putative ATP-dependent protease 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.801534 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCAG AACTTCCTCC CGAACAGTTG CGCCGCACCT TCGATCCTGG GCAGATGGTC 
TTTCCTACCA CCGAAGAGCC GCCAGGAGAC GGCGGCATCA TTGGTCAGCA GCGCGCGGTC
GCCGCACTGC GCTTCGGTCT CAATATGGTG GACGGCGGCT TCAACATCTA TGCCGCCGGC
CCGCCCGGCA TCGGCAAGAT GACCGCTGTT CAGGCGTTTA TCGAAGAACT TGCACAGCGT
CGTCCAACGC CGTCAGACTG GTGTTATGTC AATGATTTCG ATGATCCGTA TCAACCGAAG
GCGCTGCGCC TTCCTCCTGG ACGCGGACGA CGCTTGCAGC AGGATGTCCA TCAGATGATT
GCGCATCTGC GCGCCGAGCT GCCGCGCGCA TTCGAGAGTG ATGAGTATGC AATGCGGCGC
GACGAGGTAT TGCACGAACT CAATTCCCAT CGTGAAGCCT TGCTCAGCCA GATCAGCGAA
CGCGCCGCGC AACAGGGATT CGTGCTACAG GCCGGTCCTG TCGGCATCAT GATCATCCCA
ATTCGCAACG GTGAACCGCT CAGTGACGCG GCATTCCAGG CAATGACCCT CGACCAGCGC
GAGGAGTTGC TGCGCCATCG CGCAATGTTG CAGGAAGAAC TCAAGAACGT GTTGAAACAG
GTGCGCGCAG CGGAACGGAT TGCGCGTCAG CGCATGGAAG AAATCGACCG CCAGGTCGTC
GAGTACATCG TCGGCGGACT GATCGACGAT CTTCAGGAAC AGTACGCCGA CCTGCCCGAT
GTCGTCGCCT TTCTCGAAGC GATGCAGAAA GATATCCAGG AAAATCCTGA CCCCTTTCGC
TCAGGCGGAC AGCAGCAACC TTCCGGTGAA GCGCAGGTCG ATCTGGCGTC GATCCCGTGG
CTCAGAGAAT TGCCGTTCCG TAAGTACCAG GTGAATGTCC TGATCGACAA CAGCCGTCAG
CAGGGTGCGC CGGTGGTGGT TGAGTACAAT CCGACTTATC CCAATCTGTT TGGGCGTATC
GAGAAGGAAA CGCACTTCGG CGCACTCTAT ACCGACTTCC TGATGATCAA GCCCGGCAGC
CTGCACCGCG CCAATGGCGG GTTCCTCGTC ATTGAAGCCG AAGACCTGCT CCGCGATTAT
TTCAGTTGGG ACGGGCTTAA ACGCGCTCTA CGCACGCGCG ACATTCAGAT CGAAGAACTG
GCTGACCGCC TGGGGCTGAC AACCGTCAAG AGTCTCCGTC CGCAACCAAT CCCGCTTGAA
CTCAAGGTTG TGCTCGTCGG ACCGCCGCCG CCATACTATC TCCTTGCCGC TTACGACGAT
GAGTTTTCGA CCCTTTTCAA GGTTAAAGCC GATTTCGACA TCAGTATGCC GCTGAATGAC
GAGAACCTGC GCGGGTCGTT GCATCTGTTT CGACGCTTCT GCGAGCGCGA AAAACTCCTG
CCGATCACCG AGGAAGCAGC GGCGCGCCTG CTGGAACACT CGCTCCGCCT CGCCGATGAC
CAGGAGCGCC TTTCGACGCA CTTCGGCGCG CTGACCGATG TGGTGCGCGA GGCGAACTAT
TGGGCAATCC AGGAGCAGTG CAATGCTATT CTGGGGCGGC ATGTGCTTCG CGCGCTCGAT
GAAAAGGTCT ATCGCTCGAA CATGATCCAG GCGCGCATCC AGGAATTGAT CGACCGCGGG
ATTATCCTGA TCGATACAGA AGGCGCAAAG ATCGGTCAGA TCAATGGGTT GTCGGTGCTG
AGCCTGGGGG ATTATATGTT TGGCAGACCA AGCCGTATCA GCGTCAGCGT CGGACCAGGG
CGCGGCGCCA TTCTCGACAT CGAACGCGAG GTAAAACTGG GAGGACCAAT CCACAGCAAG
GGAGTGCTCA TTCTCAGCGG ACACCTTGCG GAACGGTACG GGCAGGAACG TCCGCTGACC
CTCTCAGCGC GGTTGGTCTT CGAGCAGAGT TATGAAGGGG TTGAGGGGGA CAGTGCTTCG
GCAGCAGAGT TGTTCGCGCT GCTCTCGGCG CTTGCTGAAC TGCCGTTGCG CCAGAGTATC
GCCGTTACCG GGTCGGTCAA TCAGCGTGGT GAGATCCAGG CGGTCGGTGG GGTCAACCAG
AAAATCGAAG GGTTTTTCGA TATCTGCCGG TTACGCGGTC TAACGGGTGA ACAGGGGGTG
CTCATTCCTC GAGCGAATGT GCAGAATCTG ATGCTGCGCA GCGACGTGGT GGAAGCAGTG
CGTGAGGGAC GGTTCCACAT CTGGACAGCA GCCACCGTCG ATGAAGGCAT TGCCCTGCTG
ACCGGCGTGC CGGCCGGCGA ACGCGGCGCA GATGGCGAAT ACCCGCCGGA CAGCGTCAAT
GGCCGGGTGA TGACGCGGCT GCGCGCCTTT GCGGAACGTC TGCGCGAAGG AGGGAAGGGT
AATGAGAAGG AAGCGCAGTG A
 
Protein sequence
MAAELPPEQL RRTFDPGQMV FPTTEEPPGD GGIIGQQRAV AALRFGLNMV DGGFNIYAAG 
PPGIGKMTAV QAFIEELAQR RPTPSDWCYV NDFDDPYQPK ALRLPPGRGR RLQQDVHQMI
AHLRAELPRA FESDEYAMRR DEVLHELNSH REALLSQISE RAAQQGFVLQ AGPVGIMIIP
IRNGEPLSDA AFQAMTLDQR EELLRHRAML QEELKNVLKQ VRAAERIARQ RMEEIDRQVV
EYIVGGLIDD LQEQYADLPD VVAFLEAMQK DIQENPDPFR SGGQQQPSGE AQVDLASIPW
LRELPFRKYQ VNVLIDNSRQ QGAPVVVEYN PTYPNLFGRI EKETHFGALY TDFLMIKPGS
LHRANGGFLV IEAEDLLRDY FSWDGLKRAL RTRDIQIEEL ADRLGLTTVK SLRPQPIPLE
LKVVLVGPPP PYYLLAAYDD EFSTLFKVKA DFDISMPLND ENLRGSLHLF RRFCEREKLL
PITEEAAARL LEHSLRLADD QERLSTHFGA LTDVVREANY WAIQEQCNAI LGRHVLRALD
EKVYRSNMIQ ARIQELIDRG IILIDTEGAK IGQINGLSVL SLGDYMFGRP SRISVSVGPG
RGAILDIERE VKLGGPIHSK GVLILSGHLA ERYGQERPLT LSARLVFEQS YEGVEGDSAS
AAELFALLSA LAELPLRQSI AVTGSVNQRG EIQAVGGVNQ KIEGFFDICR LRGLTGEQGV
LIPRANVQNL MLRSDVVEAV REGRFHIWTA ATVDEGIALL TGVPAGERGA DGEYPPDSVN
GRVMTRLRAF AERLREGGKG NEKEAQ