Gene Rpal_2140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2140 
Symbol 
ID6409800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2309908 
End bp2311479 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content67% 
IMG OID642712024 
Productprotease Do 
Protein accessionYP_001991136 
Protein GI192290531 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.750781 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGATC GTCGCCCCGT CCTGTCCACG CAGTCTTCGC ACCGGCCCCA GTCGAGGTCG 
TTGCTGTCGG CGCGCAAGTT CGCGCTGATG GCCTCGGTCG TCGCCGGTCT CGGCGCGGGG
GCTTTCGGGC TCGGCAACGG TTCGTTCGAT CTGATCGCCA CTCCGGCGCA TGCGCAGCAG
GTCGGCGCCA ACGTTCAGCC GGCGCAGCAG CCGGTCGGTT TCGCCGACAT CGTCGACAAG
GTGAAGCCGT CGGTGATCTC GGTGAAGGTC AACATCGCCG ACAAGATGGC CAAGAACGAA
GACCGCGAGG ACTTCTCGTT CCCGCCCGGC TCGCCGATGG AGCGATTCTT CCGCCGGTTC
GGCGGCGAGA TGCCTCCCGG CCTGCGCGGC CATCGCGGCG GCGGCATGAT CACCGGCCAG
GGCTCGGGCT TCTTCATCTC GGCCGACGGC TATGCGGTGA CCAACAATCA CGTGGTCGAA
GGCGCCGACA AGGTCGAAGT CACCACCGAC GACGGCAAGA CCTACAAGGC CAAGGTGATC
GGCAATGATC CGCGCACCGA CTTGGCGCTG ATCAAAGTCG AAGGCGGTTC GAACTTCCCC
TACGCCAAGC TGTCGGAAGG CAAGCCGCGG ATCGGTGACT GGGTGCTGGC GGTCGGCAAT
CCGTTCGGCC TCGGCGGCAC CGTGACGGCC GGCATCGTCT CGGCGATGGG CCGCGACATC
GGCAACGGTC CGTACGACGA CTTCATCCAG ATCGACGCGC CGGTGAACAA GGGCAACTCC
GGTGGTCCGG CGTTTAACAC CGCGGGCGAA GTGGTCGGCG TCAACACCGC GATCTATTCG
CCGTCGGGCG GCAGCATCGG CATCGCATTC TCGATCCCGG CCAACACCGT CAAGGCGGTG
GTCGAGCAGC TCAAGGATCG CGGCTCGGTG AGCCGTGGCT GGATCGGCGT GCAGGTGCAG
CCGGTGACGC CGGAGATCGC CGACAGCCTC GGCTTGAAGA AGGCGGAAGG CGCGCTGGTC
GCAGAGCCGC AGTCGAACGG TCCGGCCGCC AAGGCCGGCA TCGAATCCGG CGACGTGATC
GTCGCGGTCG ATGGCACGTC GGTGAAGGAC GCTCGCGAAC TCGCCCGCAC CATCGGTGCG
TTCGCGCCGG GTCATGCGGT CAAGCTCACC GTGTTCCACA AGGGCAAGGA GCGTGAGCTG
ACGCTGACGC TCGGCGAGCT GCCGAACAAG ATCGAAGCCA GCAACAACAC CGACCGCGGT
GATCGCGGCG GAGCCAACCA GGGCCTCGAC CTGCCCAAGC TCGGCCTGAC GCTGGCTCCC
GCCAGCTCGG TCGCCGGTGC CGGCAAGGAT GGCGTGGTGG TCACCGACGT CGATCCGAAG
GGCGCCGCTG CAGACCGCGG CTTCAAGGAA GGCGATGTGA TCCTCGAGGT CGCCGGCAAG
AACGTGTCGA GCCCGGCGGA CGTCCGCGAC GTGCTCGCTA CGGCGAAGAC CGAAAACAAG
AACAGCGTGC TGGTCCGGGT ACGCAGCGGC GGCGCCTCGC GCTTCGTCGC CCTCCCGATC
GCCAAGGGCT GA
 
Protein sequence
MHDRRPVLST QSSHRPQSRS LLSARKFALM ASVVAGLGAG AFGLGNGSFD LIATPAHAQQ 
VGANVQPAQQ PVGFADIVDK VKPSVISVKV NIADKMAKNE DREDFSFPPG SPMERFFRRF
GGEMPPGLRG HRGGGMITGQ GSGFFISADG YAVTNNHVVE GADKVEVTTD DGKTYKAKVI
GNDPRTDLAL IKVEGGSNFP YAKLSEGKPR IGDWVLAVGN PFGLGGTVTA GIVSAMGRDI
GNGPYDDFIQ IDAPVNKGNS GGPAFNTAGE VVGVNTAIYS PSGGSIGIAF SIPANTVKAV
VEQLKDRGSV SRGWIGVQVQ PVTPEIADSL GLKKAEGALV AEPQSNGPAA KAGIESGDVI
VAVDGTSVKD ARELARTIGA FAPGHAVKLT VFHKGKEREL TLTLGELPNK IEASNNTDRG
DRGGANQGLD LPKLGLTLAP ASSVAGAGKD GVVVTDVDPK GAAADRGFKE GDVILEVAGK
NVSSPADVRD VLATAKTENK NSVLVRVRSG GASRFVALPI AKG