Gene Rcas_1623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1623 
Symbol 
ID5539099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2096543 
End bp2097937 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content62% 
IMG OID640893760 
Producthypothetical protein 
Protein accessionYP_001431733 
Protein GI156741604 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1030] Membrane-bound serine protease (ClpP class) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.475533 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGCAA ACTGCTCCTC TCATCAGCGC AGCGGCATGC TGATTGCAGC GCTGGCGATT 
GCGCTGCTGC TCCTCGTCTG GCCCGCTGGA AGCGCGAGAG CGCAACAGAG CGGGGGCCCG
CTGTACCTGA TTGAAGTCAG TGGTATCGTT AGCGCACCGA CGATTGATTA TCTGCGACGA
GCCGTGCAGA TTGCGGAAGC GTCGAGCGCC GACGCGCTGA TTATTTCGTA CAGCAGCACC
GGCGGCGTCC TTCGCGATGT GCGCGTCTTC GCGGCAGAAG TTGCTCAGGC GCGCGTGCCG
ATCGTGGTCT TCATTACTCC GCGGGGAACG CAGGCGGGTC CGGCCGGCGC TTTGTTCGTC
ACTGCTGCAC ACATCAGCGC CCTTTCGCCC GATACCAGTT TCGGCAGTCC TTTCCCGCTG
GCGCAGGTGG ACGATACGCT GAGCCAGCAA ACGCGCGACC TTCTTCTCGA TAATGTCAGT
TCGCAAATGC GGAAATGGAA TGAAACCCGT GGGCGGAACG CGGAATGGAT CGACCGTGCG
GTGCGTGAGG GGGTCATCCT CAACAATCAG CAGGCGATCA GCCTTGATCC TCCCGCAATT
GACCTGGTGG CCGCCGATCT GCGTGAATTG CTGACGTTGA TCGATGGGCG GATTGTCACC
CTCGATGATG GTCGTGTGGT GCGTATTTCG TCCATCGGTC GCATGCCAAC CCCAATCGAG
CCGACCCTTT TCGAGAGCCT GCGCCTGGCG CTGACCGATC CGACTGTCGC GTTTGCGCTG
CTGGTGCTCG GGTCGATGGC GATCTATCTT GAGTTGAACA CGCCGGGGAT CGGATTGTTC
GCCGGCGCCG GGTTTCTGCT GCTGCTAGGC GCGGCTGCCG GGTTTCTTGC GCTTCCGGTG
TTGTGGTGGG CAATGACATT GGTGATCCTT GGTCTGGTGC TGATCGGTGC TGAGTTTGTA
GCGCCGACTC ATGGCGGTTT GATGATCACA GGTCTGGTGA TGCTCGGTAT TGGCGGAGGA
AATCTGATCG ATGCGACACA GGCGCCCGGC GCCGGAGTCG CCTGGTGGGC GCTGTTGATC
GTGATCGGCG GGATCGCGGC CACTGCTGCG CTGGGGTTGG CGCTTGCCCT GCGCAGTCGC
AAACGTCCGG CAGCCATTGG AACCGAGGCG CTGGTGGGGC GCCTGGCGCA GGTGCGGCGG
CGACTGGATC CGCGCGGGAT GGTGTTTGTC GAAGGTGCGT TGTGGCAGGC GATCAGTGAG
ACCGATCCGG TGGAAGTTGG CGATTGGGTG CGCGTGGTCG CGGTGCATAA CTTGCAGTTG
ATTGTGCGAC CGCTCGAAAG CGAGGAGTCA ACGACCCGCG AAGCGCAACC TCCTTCTACA
GGACGTGTCG TCTAG
 
Protein sequence
MPANCSSHQR SGMLIAALAI ALLLLVWPAG SARAQQSGGP LYLIEVSGIV SAPTIDYLRR 
AVQIAEASSA DALIISYSST GGVLRDVRVF AAEVAQARVP IVVFITPRGT QAGPAGALFV
TAAHISALSP DTSFGSPFPL AQVDDTLSQQ TRDLLLDNVS SQMRKWNETR GRNAEWIDRA
VREGVILNNQ QAISLDPPAI DLVAADLREL LTLIDGRIVT LDDGRVVRIS SIGRMPTPIE
PTLFESLRLA LTDPTVAFAL LVLGSMAIYL ELNTPGIGLF AGAGFLLLLG AAAGFLALPV
LWWAMTLVIL GLVLIGAEFV APTHGGLMIT GLVMLGIGGG NLIDATQAPG AGVAWWALLI
VIGGIAATAA LGLALALRSR KRPAAIGTEA LVGRLAQVRR RLDPRGMVFV EGALWQAISE
TDPVEVGDWV RVVAVHNLQL IVRPLESEES TTREAQPPST GRVV