Gene RoseRS_2778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2778 
Symbol 
ID5209747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3463927 
End bp3466335 
Gene Length2409 bp 
Protein Length802 aa 
Translation table11 
GC content63% 
IMG OID640596377 
ProductATP-dependent protease La 
Protein accessionYP_001277099 
Protein GI148656894 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGC CAACGCCGCC CCAGAATGCG CCGGAGATAC CGGAGATCCT GCCGGTTCTG 
CCGCTGAACA ACGTCGTGCT CTTTCCAGGC ATGTTCCTGC CGCTGGTCGT AAGCGGCGAC
ATGTGGGTCA AACTGGTGGA TGAAGCCGCT CTCTCGACCA AGATGGTCGG CGTCTTTATG
CGCACCCAAC CTGGCGAGGG GTTCGATCCG CTGGCGCTGG CGCGTACCGG AACGGCAGCG
CTCATTGTGC GCATGCTGCG GTTGCCGCAC GGGGCGGTGC AGATCCTGGT GCAGGGGCAG
GCGCGCATCC AGATTATGCA ACTGATCGTC AGCGAGCCAT ACCCGCAGGC GCGCATGTCC
ATCCATCGCG ATCCCGCCGT GCTGTCGGTC GAAGTGAGCG GTCTGGCGCG CGCGGCGCTC
GCCGCCTTCC AGCAGATCAT CCAGCTCAGC CCAACCCTTC CCGATGAACT GGCAATCGTC
GCCGCCAATA CCGCACAACC GGGCATGCTG GCAGACCTGA TCGCTGCCAA CCTGAATCTC
AAACCGGAAG ATCAGCAACT CGTGCTCGAT ACGCTCGATG TGCAGGATCG TCTGCGTCAG
GTGCTCAGTT TCCTCGAACG TGAACGCGAA ATCCTGACGA TTGGACGCAA GGCGCAGGAA
GAAATGTCGA AGAGCCAGCG CGAGTATGTG CTGCGCCAGC AACTAGAGGC AATCAGGCGC
GAACTGGGCG AGACCGATGA ACATGCTGCC GAAATTGCCG AGTTGCGGCG GCGCCTGGAA
GCGGCGAACC TGCCCGAAGA AGCCCGCAAA GAAGCGGAAC GCGAAATCTC CCGTCTGGAA
CGCATGCCCC CCGGTGCTGC CGAGTATGTC GTTTCGCGCA CCTACCTGGA CTGGCTGCTC
GACCTGCCGT GGAACGTAAG TACGGAAGAC AACCTCGATC TGGCGCAGGC GCGTCAGGTG
CTCGATGAGG ATCACTACGA CCTGGAACGC ATCAAGGAGC GGATTATCGA ATACCTGGCG
GTGCGCAAAC TGCGACTGGA GCAGAATGCC ACCGGCAGCG CTCGCGGTCC GATCCTCTGC
TTCGTTGGTC CGCCGGGGGT GGGGAAAACC AGCCTGGGCG CCTCAATTGC ACGCGCGCTG
GGACGGAAAT TCGTGCGCGT GGCGCTTGGC GGCGTGCGCG ACGAGGCGGA GATCCGCGGT
CACCGCCGCA CCTATATCGG CGCACTTCCG GGGCGCATCA TCCAGGGGAT CAACCGCGCT
GGCAGCAACA ATCCGGTCTT CATGCTCGAT GAGGTGGATA AACTCAGCGT CGGCTTCCAG
GGCGATCCGG CGGCTGCGCT GCTCGAAGTG CTTGACCCGG AACAGAATGC CGCATTCGTT
GACCGCTACC TGGATGTGCC GTTCGATCTG AGCCGTGTGC TCTTCATCTG TACCGCCAAC
CGTTCCGACA CCATTCCGCC AGCATTGCTC GACCGGATGG AGTTGCTGGA ACTGGCAGGC
TACACCGAAA TGGAAAAACT CGAGATCTGT CGTCGCTACC TGATCCAGCG CCAGCGGAGC
GAGCAGGGTC TGGCGGAACG TGGTCCGACG ATCACCGAAG CGGCGCTCCG CCGGCTCATC
CGCGAATACA CCCACGAGGC GGGCGTGCGC GACCTGGAGC GACGGATCGG CGCCATTTAC
CGCAAAATGG CGACGCGCGC TGCGGAAGGT CAATCCCTGC CAGATCAGGT GGATGCTCCC
GATCTCGACG ATCTGCTGGG ACCGCCGCGC TTCCGCAGTG AGACGCTGCT CGGTGAGGAT
GAAGTCGGCG TGGTGACCGG GCTTGCCTGG ACGCCGACTG GCGGTGATGT GCTCTTTGTC
GAAGCGAGTG TGGTGCCCGG CAACGGTCAG TTGACCCTGA CCGGGCAACT CGGCGATGTG
ATGAAGGAGT CGGCGCGCGC CGCACTGACC TATGCGCGTT CGCGGGCGCG TTCGCTGAAC
ATCCCGACCG ACTTTGCCCA GATCTGCGAT ATTCACATCC ACGTGCCGGC AGGCGCCGTA
CCAAAGGATG GTCCTTCGGC TGGCATTACT ATGGCAAGTG CCCTGATTTC GGCGCTTACC
GAGCGTCCTG CCCGTAAACA CGTGGCGATG ACGGGCGAAA TCACGTTGCG CGGCAAAGTG
CTTCCAATCG GCGGCGTCAA GGAGAAACTG CTGGCGGCGC AACGCGCTGG CGTGCATACT
GTGCTGCTGC CAAAGGCAAA TGCGCCCGAT CTGCGGGAAA TTCCGGAAGA AACCCGTCAG
CATCTCGAGA TCATCCTCGT CGAGCATATG GACGAAGTGT TGCCGCACGT GCTCCATCCC
CGCAGCGAGC CAGCGACCCA ACCCGAGTTG ACGCCAGCCG ATGGGATGGG AACAGCGCAG
ACGACGTAG
 
Protein sequence
MSTPTPPQNA PEIPEILPVL PLNNVVLFPG MFLPLVVSGD MWVKLVDEAA LSTKMVGVFM 
RTQPGEGFDP LALARTGTAA LIVRMLRLPH GAVQILVQGQ ARIQIMQLIV SEPYPQARMS
IHRDPAVLSV EVSGLARAAL AAFQQIIQLS PTLPDELAIV AANTAQPGML ADLIAANLNL
KPEDQQLVLD TLDVQDRLRQ VLSFLERERE ILTIGRKAQE EMSKSQREYV LRQQLEAIRR
ELGETDEHAA EIAELRRRLE AANLPEEARK EAEREISRLE RMPPGAAEYV VSRTYLDWLL
DLPWNVSTED NLDLAQARQV LDEDHYDLER IKERIIEYLA VRKLRLEQNA TGSARGPILC
FVGPPGVGKT SLGASIARAL GRKFVRVALG GVRDEAEIRG HRRTYIGALP GRIIQGINRA
GSNNPVFMLD EVDKLSVGFQ GDPAAALLEV LDPEQNAAFV DRYLDVPFDL SRVLFICTAN
RSDTIPPALL DRMELLELAG YTEMEKLEIC RRYLIQRQRS EQGLAERGPT ITEAALRRLI
REYTHEAGVR DLERRIGAIY RKMATRAAEG QSLPDQVDAP DLDDLLGPPR FRSETLLGED
EVGVVTGLAW TPTGGDVLFV EASVVPGNGQ LTLTGQLGDV MKESARAALT YARSRARSLN
IPTDFAQICD IHIHVPAGAV PKDGPSAGIT MASALISALT ERPARKHVAM TGEITLRGKV
LPIGGVKEKL LAAQRAGVHT VLLPKANAPD LREIPEETRQ HLEIILVEHM DEVLPHVLHP
RSEPATQPEL TPADGMGTAQ TT