Gene Rpal_3637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3637 
Symbol 
ID6411313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3897476 
End bp3898867 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content67% 
IMG OID642713517 
Productprotease Do 
Protein accessionYP_001992612 
Protein GI192292007 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.25969 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCCGA TCCGCACGTT CGCCGTCCTC TGTGTGTCAC TCGCCCTCAC CACGCCGCTC 
GCGGCGCAGG ACCGGCGGGT GCCGTCGTCG CCGGCGGAGC TGAAGCTGTC CTACGCGCCG
ATTGTGCAGC ATGTGCAGCC GGCGGTCGTG AACGTCTATG CCGCCAAGGT GGTGCAGAAC
CGAAATCCGC TGCTGGAAGA TCCGATCTTC CGCCGGTTCT TCGGCGTGCC CGGCCAGCCG
GAGCAGATCC AGCGCTCGCT CGGCTCGGGT GTGATGGTCG ATGCCTCGGG CCTCGTCGTC
ACCAACAATC ACGTCATCGA GGGCGCCGAT CAGGTCAAGG TCGCGCTCGC CGACAAGCGC
GAGTTCGAAG CCGAGATCGT GCTGAAGGAC AGCCGCACCG ATCTGGCGGT GCTGCGGCTC
AAGGACACCA GCGAGAAATT CCCCACGCTC GACTTCGCCA ACTCCGACGA CCTGCTGGTC
GGCGACGTGG TGCTGGCGAT CGGCAATCCG TTCGGCGTCG GTCAGACGGT GACGCATGGC
ATCGTCTCGG CGCTGGCGCG CACTCAGGTC GGCATTACCG ACTATCAGTT CTTCATTCAG
ACCGACGCCG CGATCAACCC GGGCAATTCC GGCGGCGCGC TGGTCGATGT CTCGGGCAAG
CTGGTTGGTA TCAACACCGC GATCTTCTCG CGCTCGGGCG GCTCGCAGGG GATCGGCTTC
GCGATCCCCG CCAACATGGT GCGCGTCGTG GTCGCCTCGG CCAAGAGCGG CGGCAAGGCC
GTGAAGCGGC CGTGGCTCGG CGCGCGGCTG CAGGCGGTGA GCCCGGAGAT CGCCGAGACC
CTGGGGCTGA AGCGGCCGGG CGGTGCGCTG GTCGCCAGCG TTACCAAGGG CAGCCCGGCG
GAGCGGGCAG GGCTGAAATT GTCCGACCTG ATCGTGTCGA TCGACGGCTT TGCGATCGAT
GATCCCAACG CGTTCGATTA TCGGTTTGCG ACGCGTCCGC TTGGCGGTGC CGCGCAGCTC
GAAGTGCAGC GCAGCGGCAA GGCGGTGAAG CTGTCGATCC CGCTCGAAAC CGCACCGGAC
TCCGGCCGCG ACGAGCTGGT GATCACCTCG CGCTCGCCGT TCCAGGGTGC GAAGATCGCC
AATATTTCCC CGGCGATCGC CGACGAAATG CGGCTCGATC CGAGCGTCGA AGGCGTCGTG
GTCACCGATC TTCCCGACGA CAGCACTGCG GCGAATGTCG GCTTCCAGAA GGGCGACATC
ATCGTCGCCG TCAACAACAC CCGGATCGGC AAGACCAGCG ACCTCGAACG CGTAGCCGGC
CAAACGGCGC GGCTGTGGCG CATCATGCTG GTCCGCGGCG GCCAGCAGAT CCAAGTCACC
TTGGGCGGGT AG
 
Protein sequence
MNPIRTFAVL CVSLALTTPL AAQDRRVPSS PAELKLSYAP IVQHVQPAVV NVYAAKVVQN 
RNPLLEDPIF RRFFGVPGQP EQIQRSLGSG VMVDASGLVV TNNHVIEGAD QVKVALADKR
EFEAEIVLKD SRTDLAVLRL KDTSEKFPTL DFANSDDLLV GDVVLAIGNP FGVGQTVTHG
IVSALARTQV GITDYQFFIQ TDAAINPGNS GGALVDVSGK LVGINTAIFS RSGGSQGIGF
AIPANMVRVV VASAKSGGKA VKRPWLGARL QAVSPEIAET LGLKRPGGAL VASVTKGSPA
ERAGLKLSDL IVSIDGFAID DPNAFDYRFA TRPLGGAAQL EVQRSGKAVK LSIPLETAPD
SGRDELVITS RSPFQGAKIA NISPAIADEM RLDPSVEGVV VTDLPDDSTA ANVGFQKGDI
IVAVNNTRIG KTSDLERVAG QTARLWRIML VRGGQQIQVT LGG