Gene Rcas_2557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2557 
Symbol 
ID5540039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3298669 
End bp3301077 
Gene Length2409 bp 
Protein Length802 aa 
Translation table11 
GC content62% 
IMG OID640894686 
ProductATP-dependent protease La 
Protein accessionYP_001432653 
Protein GI156742524 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0122618 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACTC CACTCCCAAC CCAGGACCCG CCCGACATCC CGGAAGTGCT ACCGATCCTG 
CCGCTCAACA ACGTCGTGCT CTTCCCCGGC ATGTTCCTGC CGCTCGTCGT CAGCGGCGAC
ACATGGGTGA AACTGGTCGA TGAAGCAGCG CTTGCAACGA AGATGGTCGG CGTGTTTATG
CGCACTCAAC CCGGCGAGGG GTTCGATCCG CTGGCGCTGG CGCGCACCGG CGCCGTCGCG
TTAATCGTCC GCATGCTGCG GTTGCCGCAC GGCGCAGTGC AAATCCTGGT GCAGGGGCAG
GCGCGCATTC AGATCAGGCA ACTGATCGTC ACCGAACCAT ATCCGCAGGC GCGCGTCGCC
ATTCATCGCG ATCCCGCTGT GCTTTCGGTC GAGGTCAGCG GTCTGGCGCG CGCGGCGCTC
GCCGCCTTCC AGCAGATCAT CCAACTCAGC CCGACTCTGC CTGATGAACT GGCAATCGTT
GCGGCCAATA CCGCGCAACC CGGTATGCTG GCAGACCTGA TCGCAGCGAA TCTGAACCTT
AAACCAGAGG ATCAGCAACT TGTGCTCGAT ACGCTCGATG TGCAAGAACG CCTGCGTCAG
GTGCTCAGTT TCCTCGAGCG CGAACGTGAA ATTCTGACAA TTGGACGCAA GGCGCAGGAA
GAGATGTCGA AAAGCCAGCG CGAGTATGTG CTGCGCCAGC AACTCGAGGC GATCAAACGC
GAGTTGGGCG AAACCGACGA TCATGCCGCC GAAATTGCAG AGTTGCGTCG TCGGCTCGAG
GCAGCGAACC TTCCCGAAGA GGCGCGCAAA GAAGCCGAAC GCGAGATTTC GCGCCTGGAG
CGCATGCCCC CCGGCGCCGC CGAGTATGTC GTTGCCCGCA CCTACCTGGA CTGGCTCCTC
GATCTGCCGT GGAATGTCAG TACGGAAGAC AACCTCGATC TGACGCAGGC GCGTCAGGTG
CTCGATGAGG ATCACTACGA TCTGGAACGG ATCAAAGAGC GTATTATCGA GTACCTGGCG
GTGCGCAAAC TGCGGTTGGA GCAAGACGCC AGCGGCAGTG CGCGCGGTCC GATCCTCTGT
TTTGTCGGTC CTCCCGGCGT TGGGAAGACC AGTCTGGGAA CCTCGATTGC GCGTGCGCTG
GGGCGCAAGT TCGTGCGGGT GGCGCTCGGC GGCGTGCGCG ATGAGGCGGA GATTCGTGGG
CATCGCCGCA CCTATATCGG CGCGCTGCCA GGGCGCATCA TCCAGGGAAT TAATCGCGCC
GGCAGCAACA ATCCGGTCTT CATGCTCGAT GAGGTCGATA AATTGAGCGT CGGCTTCCAG
GGCGATCCGG CGGCTGCGTT GCTGGAAGTG CTCGACCCGG AACAGAATGT GGCGTTTGTC
GACCGCTACC TCGATGTGCC GTTCGATCTG AGTCGTGCGC TCTTCATCTG CACCGCCAAC
CGCTCCGATA CCATTCCGCC CGCGCTGCTC GACCGCATGG AACTACTGGA ACTCGCTGGC
TACACTGAGA TGGAAAAACT CGAGATTTGT CGCCGCTACC TGATCCAGCG CCAGCGGAAC
GAGCAAGGTC TGGCGGAACG CGCACCGACG ATCACCGAAG CGGCGCTTCG CCGCCTGATC
CGCGAATACA CCCACGAGGC GGGTGTGCGC GATTTGGAGC GACGGATCGG CGCGATTTAC
CGCAAGATGG CAACGCGCGC GGCGGAGGGA CAACCACTTC CGGATCAGGT GGATGCGCCC
GATCTCGATG ACTTGCTCGG ACCGCCGCGC TTCCGCAGTG AAACGCTCCT CGGTGAAGAT
GAGGTGGGCG TGGTCACCGG GCTGGCATGG ACGCCGACCG GCGGCGACGT GCTCTTTGTC
GAAGCGAGTG TTGTGCCGGG CAACGGTCAG TTGACGCTGA CCGGTCAACT TGGTGATGTG
ATGAAGGAAT CGGCGCGCGC AGCGCTGACG TATGCGCGGT CGCGGGCGCG GGCGTTGAAC
ATTCCGACCG ATTTTGCGCA GATTTGCGAT ATTCATATCC ATGTTCCGGC CGGCGCTGTT
CCCAAAGATG GACCATCCGC CGGCATCACG ATGGCAAGCG CGCTTATCTC AGCACTGACC
GACCGCCGTG CCTACAAACA CGTCGCCATG ACCGGCGAAA TCACGCTGCG CGGCAAGGTG
CTGCCCATTG GCGGGGTGAA GGAAAAGGTG CTGGCGGCGC AGCGCGCCGG TGTGCGCACG
GTTCTGCTGC CGAAGGCGAA TGCGCCCGAT CTGCGCGAAC TGCCGGAAGA AACGCGCCAG
CAAATCGACA TTGTTTTGGT CGAACACATG GATGAGGTAC TGCCGCGCGT CCTGCATCCC
AAGAGCGAGT CGGTCACCTT GGCTGAACCG GCGCCACCCG ATGGGGCGGG AACGGTGCAG
GCGACGTAA
 
Protein sequence
MSTPLPTQDP PDIPEVLPIL PLNNVVLFPG MFLPLVVSGD TWVKLVDEAA LATKMVGVFM 
RTQPGEGFDP LALARTGAVA LIVRMLRLPH GAVQILVQGQ ARIQIRQLIV TEPYPQARVA
IHRDPAVLSV EVSGLARAAL AAFQQIIQLS PTLPDELAIV AANTAQPGML ADLIAANLNL
KPEDQQLVLD TLDVQERLRQ VLSFLERERE ILTIGRKAQE EMSKSQREYV LRQQLEAIKR
ELGETDDHAA EIAELRRRLE AANLPEEARK EAEREISRLE RMPPGAAEYV VARTYLDWLL
DLPWNVSTED NLDLTQARQV LDEDHYDLER IKERIIEYLA VRKLRLEQDA SGSARGPILC
FVGPPGVGKT SLGTSIARAL GRKFVRVALG GVRDEAEIRG HRRTYIGALP GRIIQGINRA
GSNNPVFMLD EVDKLSVGFQ GDPAAALLEV LDPEQNVAFV DRYLDVPFDL SRALFICTAN
RSDTIPPALL DRMELLELAG YTEMEKLEIC RRYLIQRQRN EQGLAERAPT ITEAALRRLI
REYTHEAGVR DLERRIGAIY RKMATRAAEG QPLPDQVDAP DLDDLLGPPR FRSETLLGED
EVGVVTGLAW TPTGGDVLFV EASVVPGNGQ LTLTGQLGDV MKESARAALT YARSRARALN
IPTDFAQICD IHIHVPAGAV PKDGPSAGIT MASALISALT DRRAYKHVAM TGEITLRGKV
LPIGGVKEKV LAAQRAGVRT VLLPKANAPD LRELPEETRQ QIDIVLVEHM DEVLPRVLHP
KSESVTLAEP APPDGAGTVQ AT