Gene Rmet_3233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_3233 
Symbol 
ID4040068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007973 
Strand
Start bp3508636 
End bp3509832 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content61% 
IMG OID637978639 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_585374 
Protein GI94312164 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family
[TIGR02038] periplasmic serine pepetdase DegS 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCGCC GCTTCTGGCT GTTCTTCGCT CAGGCTGTCA CGGTTGTTCT GGCCGTGTGG 
TTCGTCGTGG CCACACTCAA ACCCGAGTGG CTGCAGCGCG GGCGGGTGGC CGTGCAATCG
GGTTCGCCCA TTGTGGCGCT AAAAGAGGTC GTCCCCAGTG TGGAAGGTTC GGCCGCTCCG
GGGTCCTATA GCGAGGCGGC CCGCCTTGCC ATGCCCGCAG TCGTCAATAT TTTCACCAGC
AAGAACGGAT CGAAGCGATC GCCCAATAAT CCGCAGGCCG AAGATCCGTG GTTCCGGTTC
TTCTTTGGCG ACCGCTTGCC GGAGCGCCAA GAGCCGGTGT CGAGCTTGGG CTCGGGCGTG
ATCGTCAGTG CCGAAGGTTA CATTCTAACC AACCACCACG TTGTGGATGG CGCCGACGAA
ATCGAGGTGG CGCTGACCGA CGGACGCAAG GCAAATGCCA AGGTGGTGGG CTCCGATCCC
GAAACCGACC TTGCCGTGCT GAAGGTCACG CTCAAGGACT TGCCTGCGAT CACGCTGGGG
CGGATCGAGA ACGTGAAGGT GGGCGATGTG GTGCTGGCTA TCGGCAACCC GTTTGGTGTC
GGCCAGACCG TGACAATGGG TATTGTCTCG GCGCTCGGCC GCAGCCATCT CGGCATCAAC
ACATTCGAGA ACTTCATTCA GACCGATGCA GCGATCAACC CCGGTAACTC TGGTGGTGCA
CTGGTCGACG CACAGGGCAA TCTGCTTGGC ATCAACACGG CGATCTATTC GCGCTCCGGC
GGCTCGCTCG GTATTGGCTT TGCGATTCCT GTGTCGACCG CCAAGCAAGT CATGGAATCG
ATCATCTCCA CGGGTAGCGT GACACGTGGC TGGATCGGCG TGGAGCCGCA GGATCTGACC
CCAGAGATTG CCGAGTCTTT CGGGCTCGAA GCCAAGGAAG GCGCGCTGAT TGCAGCGGTG
GTCCAGGGTG GGCCAGCTGA CAAGGCCGGC GTCAAACCTG GGGATGTGCT GGTCTCGGTC
GACAATCAAT CGATCTCGGA CACCACCGCC CTGCTCAACG CGATTGCACA GTTGAAACCG
GGCGCCGAGG TGAAGATGAA GGTGATTCGA CGCGGCAAAC CGGCGGAACT CACTGTCACG
ATCGGCAAGC GCCCGCCTCC TCCGCGCAGG CCGATGCCGC TGGATGAGGA AGAGTAG
 
Protein sequence
MLRRFWLFFA QAVTVVLAVW FVVATLKPEW LQRGRVAVQS GSPIVALKEV VPSVEGSAAP 
GSYSEAARLA MPAVVNIFTS KNGSKRSPNN PQAEDPWFRF FFGDRLPERQ EPVSSLGSGV
IVSAEGYILT NHHVVDGADE IEVALTDGRK ANAKVVGSDP ETDLAVLKVT LKDLPAITLG
RIENVKVGDV VLAIGNPFGV GQTVTMGIVS ALGRSHLGIN TFENFIQTDA AINPGNSGGA
LVDAQGNLLG INTAIYSRSG GSLGIGFAIP VSTAKQVMES IISTGSVTRG WIGVEPQDLT
PEIAESFGLE AKEGALIAAV VQGGPADKAG VKPGDVLVSV DNQSISDTTA LLNAIAQLKP
GAEVKMKVIR RGKPAELTVT IGKRPPPPRR PMPLDEEE