Gene Rpal_4002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4002 
Symbol 
ID6411684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4288937 
End bp4290442 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content67% 
IMG OID642713884 
Productprotease Do 
Protein accessionYP_001992973 
Protein GI192292368 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGCTG CGATCGCCTT TAGTTCCCGG ATCAGGCCCT TGCGGATCAG GCCGCTGCTC 
GCCGCGCTGT GCCTCGGCGC GGCTGTGATC GCCGGCCCGG CGCCGGCCTC CGCCCGCGGC
CCCGAAGGGA TCGCCGACGT CGCCGAGAAG GTGATCGATG CGGTGGTCAA CATCTCCACC
ACCCAGACGG TCGACGCCAA GGGCTCGGGC GAGAGCAAGG GCGGCGCCGC GCCGCAACTG
CCGCCGGGCT CGCCGTTCGA GGAGTTCTTC GAGGACTTCT TCAAGAACCG CCGCGGCGAA
AAGGGCGGCG GTCCGCGGAA GACCAATTCG CTCGGCTCCG GCTTCATCAT TGATCCGGCC
GGCGTGGTCG TGACCAACAA TCACGTCATC GCCGATTCCG ACGAGATCAA CGTGATCCTC
AACGACGGCG CCAAGATCAA GGCGGAGCTG GTCGGCGTCG ACAAGAAGAC CGATCTTGCG
GTGCTGAAGT TCAAGCCGCC GGCGGGCAAG ACGCTGACCG CGGTGAAGTT CGGCGACAGC
GACAAGCTGC GGCTGGGCGA ATGGGTGATC GCGATCGGCA ATCCGTTCTC GCTCGGCGGC
ACCGTGACCG CCGGCATCGT CTCGGCGCGC AACCGCGACA TCAACTCCGG CCCTTATGAC
AGCTACATCC AGACCGACGC CGCGATCAAT CGCGGCAACT CCGGCGGTCC GCTGTTCAAC
CTCGCCGGCG AAGTGATCGG CGTGAACACT CTGATCATCT CGCCGTCGGG CGGCTCGATC
GGCATTGGCT TCGCGGTGCC GTCCAAGACC GTGGTGCCGG TGGTCGATCA GCTGCGTCAG
TTCGGCGAAC TGCGCCGCGG CTGGCTCGGC GTCCGCATCC AGCAGGTCAC CGACGAAATC
GCCGAGAGCC TGAGCATCAA GCCGGCGCGC GGCGCACTGG TCGCCGGCGT CGACGACAAG
GGCCCGGCCA AGCCGGCCGG CATCGAGCCC GGCGACGTGG TGGTGAAGTT CGACGGCAAG
GACATCAAGG AGCCGAAGGA CCTGTCGCGC ATCGTTGCCG ACACCGCGGT CGGCAAGACC
GTCGATGTCG TGGTGATCCG CAAGGGCAAG GAAGAAACCA AGCAGGTCAC GCTCGGCCGG
CTCGACGACG ATGCCAAGCC GCAACCGGCT TCGGCGAAGT CGCAGCCCGA GGCGGACAAG
CCGGTGACCC AGAAGGTGCT CGGTCTCGAT CTCGCCGCGC TGTCGAAGGA TTTGCGCGGC
CGCTACAAGA TCAAGGACAG CGTCAAGGGC GTGCTGGTGA CCGGTGTCGA CGACGGCTCC
GACGCGGCCG AGAAGCGGCT GTCGGCCGGC GACGTCATCG TCGAGGTGGC GCAGGAGTCG
GTCGGCAGCG CCGCCGACAT CAAGAAGCGT GTCGATCAGC TCAAGAAGGA CGGCAAGAAG
TCTGTGCTGC TGCTGGTCGC CAACGCTTCC GGCGAGCTGC GCTTCGTCGC GCTCAGCCTA
CAATAG
 
Protein sequence
MPAAIAFSSR IRPLRIRPLL AALCLGAAVI AGPAPASARG PEGIADVAEK VIDAVVNIST 
TQTVDAKGSG ESKGGAAPQL PPGSPFEEFF EDFFKNRRGE KGGGPRKTNS LGSGFIIDPA
GVVVTNNHVI ADSDEINVIL NDGAKIKAEL VGVDKKTDLA VLKFKPPAGK TLTAVKFGDS
DKLRLGEWVI AIGNPFSLGG TVTAGIVSAR NRDINSGPYD SYIQTDAAIN RGNSGGPLFN
LAGEVIGVNT LIISPSGGSI GIGFAVPSKT VVPVVDQLRQ FGELRRGWLG VRIQQVTDEI
AESLSIKPAR GALVAGVDDK GPAKPAGIEP GDVVVKFDGK DIKEPKDLSR IVADTAVGKT
VDVVVIRKGK EETKQVTLGR LDDDAKPQPA SAKSQPEADK PVTQKVLGLD LAALSKDLRG
RYKIKDSVKG VLVTGVDDGS DAAEKRLSAG DVIVEVAQES VGSAADIKKR VDQLKKDGKK
SVLLLVANAS GELRFVALSL Q