Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4002 |
Symbol | |
ID | 6411684 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 4288937 |
End bp | 4290442 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642713884 |
Product | protease Do |
Protein accession | YP_001992973 |
Protein GI | 192292368 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGCTG CGATCGCCTT TAGTTCCCGG ATCAGGCCCT TGCGGATCAG GCCGCTGCTC GCCGCGCTGT GCCTCGGCGC GGCTGTGATC GCCGGCCCGG CGCCGGCCTC CGCCCGCGGC CCCGAAGGGA TCGCCGACGT CGCCGAGAAG GTGATCGATG CGGTGGTCAA CATCTCCACC ACCCAGACGG TCGACGCCAA GGGCTCGGGC GAGAGCAAGG GCGGCGCCGC GCCGCAACTG CCGCCGGGCT CGCCGTTCGA GGAGTTCTTC GAGGACTTCT TCAAGAACCG CCGCGGCGAA AAGGGCGGCG GTCCGCGGAA GACCAATTCG CTCGGCTCCG GCTTCATCAT TGATCCGGCC GGCGTGGTCG TGACCAACAA TCACGTCATC GCCGATTCCG ACGAGATCAA CGTGATCCTC AACGACGGCG CCAAGATCAA GGCGGAGCTG GTCGGCGTCG ACAAGAAGAC CGATCTTGCG GTGCTGAAGT TCAAGCCGCC GGCGGGCAAG ACGCTGACCG CGGTGAAGTT CGGCGACAGC GACAAGCTGC GGCTGGGCGA ATGGGTGATC GCGATCGGCA ATCCGTTCTC GCTCGGCGGC ACCGTGACCG CCGGCATCGT CTCGGCGCGC AACCGCGACA TCAACTCCGG CCCTTATGAC AGCTACATCC AGACCGACGC CGCGATCAAT CGCGGCAACT CCGGCGGTCC GCTGTTCAAC CTCGCCGGCG AAGTGATCGG CGTGAACACT CTGATCATCT CGCCGTCGGG CGGCTCGATC GGCATTGGCT TCGCGGTGCC GTCCAAGACC GTGGTGCCGG TGGTCGATCA GCTGCGTCAG TTCGGCGAAC TGCGCCGCGG CTGGCTCGGC GTCCGCATCC AGCAGGTCAC CGACGAAATC GCCGAGAGCC TGAGCATCAA GCCGGCGCGC GGCGCACTGG TCGCCGGCGT CGACGACAAG GGCCCGGCCA AGCCGGCCGG CATCGAGCCC GGCGACGTGG TGGTGAAGTT CGACGGCAAG GACATCAAGG AGCCGAAGGA CCTGTCGCGC ATCGTTGCCG ACACCGCGGT CGGCAAGACC GTCGATGTCG TGGTGATCCG CAAGGGCAAG GAAGAAACCA AGCAGGTCAC GCTCGGCCGG CTCGACGACG ATGCCAAGCC GCAACCGGCT TCGGCGAAGT CGCAGCCCGA GGCGGACAAG CCGGTGACCC AGAAGGTGCT CGGTCTCGAT CTCGCCGCGC TGTCGAAGGA TTTGCGCGGC CGCTACAAGA TCAAGGACAG CGTCAAGGGC GTGCTGGTGA CCGGTGTCGA CGACGGCTCC GACGCGGCCG AGAAGCGGCT GTCGGCCGGC GACGTCATCG TCGAGGTGGC GCAGGAGTCG GTCGGCAGCG CCGCCGACAT CAAGAAGCGT GTCGATCAGC TCAAGAAGGA CGGCAAGAAG TCTGTGCTGC TGCTGGTCGC CAACGCTTCC GGCGAGCTGC GCTTCGTCGC GCTCAGCCTA CAATAG
|
Protein sequence | MPAAIAFSSR IRPLRIRPLL AALCLGAAVI AGPAPASARG PEGIADVAEK VIDAVVNIST TQTVDAKGSG ESKGGAAPQL PPGSPFEEFF EDFFKNRRGE KGGGPRKTNS LGSGFIIDPA GVVVTNNHVI ADSDEINVIL NDGAKIKAEL VGVDKKTDLA VLKFKPPAGK TLTAVKFGDS DKLRLGEWVI AIGNPFSLGG TVTAGIVSAR NRDINSGPYD SYIQTDAAIN RGNSGGPLFN LAGEVIGVNT LIISPSGGSI GIGFAVPSKT VVPVVDQLRQ FGELRRGWLG VRIQQVTDEI AESLSIKPAR GALVAGVDDK GPAKPAGIEP GDVVVKFDGK DIKEPKDLSR IVADTAVGKT VDVVVIRKGK EETKQVTLGR LDDDAKPQPA SAKSQPEADK PVTQKVLGLD LAALSKDLRG RYKIKDSVKG VLVTGVDDGS DAAEKRLSAG DVIVEVAQES VGSAADIKKR VDQLKKDGKK SVLLLVANAS GELRFVALSL Q
|
| |