Gene Rcas_3184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3184 
Symbol 
ID5540682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4138418 
End bp4140223 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content59% 
IMG OID640895305 
Productpeptidase M61 domain-containing protein 
Protein accessionYP_001433256 
Protein GI156743127 
COG category[R] General function prediction only 
COG ID[COG3975] Predicted protease with the C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.152326 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTTAG TCTATTCCAT CTCGATGTTG CGTCCCCATA CTCATCTGTA CGATGTAACG 
CTCGATATTC CCTCTGTTGA CGGTCCCACG CTTGATCTGG CGCTGCCGGC CTGGACGCCG
GGATCGTATC TGATCCGCGA TTATGCCCGC CATGTGCAGC AGTTCGCTGC TGCCAACGAT
CGCGGCGAAC CGTTGCCCTG GCAGAAGATC GATAAGACCA CCTGGCGCAT TATGGCGAGC
AATGCCCGTT CCGTGCGTGT CACCTATCAG GTGTATGCGT TCGATCTGAG CGTCCGCACT
AGCCACCTCG ATGGCACGCA CGGTTATTTC AACCCGTCCA ATCTCTGCAT GTACCGTTGC
GGCCATCTGC ACGAGCCATG CGTCGTCCAT GTCCAGACGC CGCTTGAATG GCGTGTAACG
ACCGGGCTGG AGCGGATCGA TGGCGCCGGG GAGCGCACTG GTTGGGCGAC ATTTCGCGCC
AACGATTATG ATGAACTGGT TGACTCGCCG TTCGAATGCG GAACGCACCG TCTGCTGACC
TTCGAGGTCG ATGGCATTCC TCACGAAATC GCGCTCTGGG GACGCGGCAA CGAAGATGAG
CGTCAGATTC TCGCCGATAC TCGCACGATT GTCGAAACCA CACGTGCCAT GTTTGGCGGA
TTACCCTATC AGCGTTATGT CTTCATCGTT CATCTGGTCG ATGGCGAATA TGGCGGTCTT
GAACATCGCA ACAGCGTCTC GAACATTGTG GATCGCTGGG GCTTTCGTCC TGCACGTTCG
TATGAGCGAT TTCTGGCGCT CACAGCTCAT GAGTTTTTTC ACGTCTGGAA TGTTAAGCGC
ATTCGTCCTG CGCCGCTTGG TCCGTTCGAC TACGCGCGCG AAAACTACAC CCGCCAGTTG
TGGGTGATGG AAGGAATCAC CAGTTATTAC GACCATCTGA TTCTGCTGCG TGCCGGCTTA
ATCAGTCGCG AACGGTATCT CGAAACGCTC GCCGACGACA TTAAGTTATT GCAGAGCCAG
CCGGGACGGG CGCTTCAGTC GCTGGAACAG AGCAGTTTCG ACGCCTGGAT TAAGTTCTAT
CGCCCTGATG AGAACGGACC GAACAGCAGT GTGTCGTACT ATCTGAAAGG CAGCCTGGTG
GCGCTGCTGC TCGATCTGGA GATCCGGCGG CGCACCGGTG GGGCGCGTTC GCTCGATGAT
GTCATGCGCT ATCTGTATGC TGAATATGCC GGCGATCAGG TGCACGACCT CTACAGCGGC
GCATTTGCCA AGCGCCCTGG CTTCGATGAT GATGACGGGT TCTGCCGCGC GGTGGAAGCA
GTCGCCGGCG AGGAGGGCGG GGCATACCGC GCACTGCTGG CGCGCGCCGT CGCCAGTACG
GATGAATTGG AGTATGATCG CGCATTCGAT GCCGTGGGGT TGCGCCTGGT CTGGGGGCAT
TCGCTCGAAA AAGAGAACGA TCATCTCCCC GCCTGGCATG GGTTACGCCT CAAAACGGAT
CATGGTCGAC TCAAGGTCTC GGTGGTGCTG GCAGGTGGAC CCGGCGAGTC TGCCGGGATT
TATGCCGGTG ACGAACTGAT CGCGCTGGAT GGCGTTCGTA TCGACGAAGA GCGCCTCAAA
GCGCGGCTGG CGGAACGTAA ACCTGGCGAT ACCGTGCTGT TCAGCCTCTT TCGCCGCGAC
GACCTGCTCC ACATCCCGCT GCAACTTACC GAATCTCCGC CTGATACCCT GACGATCACG
CCGGTCGAAT CGCCGACAGA AGAGCAGCAA CGGCTCCTGG ACGGGTGGCT GAACGTTGCC
CGTTGA
 
Protein sequence
MPLVYSISML RPHTHLYDVT LDIPSVDGPT LDLALPAWTP GSYLIRDYAR HVQQFAAAND 
RGEPLPWQKI DKTTWRIMAS NARSVRVTYQ VYAFDLSVRT SHLDGTHGYF NPSNLCMYRC
GHLHEPCVVH VQTPLEWRVT TGLERIDGAG ERTGWATFRA NDYDELVDSP FECGTHRLLT
FEVDGIPHEI ALWGRGNEDE RQILADTRTI VETTRAMFGG LPYQRYVFIV HLVDGEYGGL
EHRNSVSNIV DRWGFRPARS YERFLALTAH EFFHVWNVKR IRPAPLGPFD YARENYTRQL
WVMEGITSYY DHLILLRAGL ISRERYLETL ADDIKLLQSQ PGRALQSLEQ SSFDAWIKFY
RPDENGPNSS VSYYLKGSLV ALLLDLEIRR RTGGARSLDD VMRYLYAEYA GDQVHDLYSG
AFAKRPGFDD DDGFCRAVEA VAGEEGGAYR ALLARAVAST DELEYDRAFD AVGLRLVWGH
SLEKENDHLP AWHGLRLKTD HGRLKVSVVL AGGPGESAGI YAGDELIALD GVRIDEERLK
ARLAERKPGD TVLFSLFRRD DLLHIPLQLT ESPPDTLTIT PVESPTEEQQ RLLDGWLNVA
R