Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2723 |
Symbol | |
ID | 3910516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3107292 |
End bp | 3108320 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637884623 |
Product | TPR repeat-containing protein |
Protein accession | YP_486336 |
Protein GI | 86749840 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.26129 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.431849 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCCTT GGCGCGGTTG CGACACCGTG AACCGCAACC GCGCAGCCTG CGACAAAGCC GTCAACGCAG GCGCGAGGGA AACCGACATG CGATCTCACA TGGTCTCGTC GAATGTCGTG GCGGTCGCGC TGTTCGCGTT GCTGTCGCAA AGTGCCGCGG CCGAGGACGC CGCGTGGAAA GGCTGTGTCG GACTGCAGGG CTCTCCGGCG GAGCGCGTCG CCGCCTGCAG CACGGTGATC GAGACCAACG CCGAAACTGG CCGGCGGCTG GCGGCGGCGT ATTGCAACCG GGGCCACGGC CTGACCGAAC AGCGCAAGCT CGACGAGGCG ATGGCCGATC TCGAAGCGGC GGTGCGGCTC GATCCGGGAT TCGCCTGCGC CTACAACAAT CGGGGCCGGG TCTATGCGTT CAAGGGAGAG GGCGATCGCG CGTTGGCCGA CTACGACGAA GCAATCAGGC TCGATGCGAA ATTCGCGCTG GCCTACAACA ACCGCGCCAT GATCTGGCTC GCCCGGCGCG ACCCCGACCG CGCACTGGAC GACCTCTCCG CGGCGATCAC AGCGGACCCG GGGCTCGCCG TCGCTTACGG CAATCGCGGC CACATCTACT ATCAGCAGCG CGACATGGCT CGTGCGCTGG CGGATTTCGA CGCCGAGATC GCCTTGCGGC CCAACGTGCT CGCCTACATC AATCGCGGCA ATGTCCATCG CGACACCGAG CAACTCGACC GCGCCGCCGC GGATTACGGC GAGGCGATCC GGCTGGCGCC GGAGGACGCC CGCGGCTGGC GCAATCGGGC GCTGATCAAG CTGTACCAGG GCGACAACAA GGGCGGCCTC GCCGACTACG ACAAGGCGCT ACGCTACGAT CCGGCCGACG TGTTCTCCTG GAACAACCGC GCCCAGGCCA GGATGCGGCT CGGCGACCGC AGCGGCGCGA TCGCGGATTT CCGCAAGGCG CTGGAATTGC GGCCGGGCCT GCAGACCGCG CGGGATTCGC TGAAGCGGCT CGGCGCTGCG GTGAACTGA
|
Protein sequence | MPPWRGCDTV NRNRAACDKA VNAGARETDM RSHMVSSNVV AVALFALLSQ SAAAEDAAWK GCVGLQGSPA ERVAACSTVI ETNAETGRRL AAAYCNRGHG LTEQRKLDEA MADLEAAVRL DPGFACAYNN RGRVYAFKGE GDRALADYDE AIRLDAKFAL AYNNRAMIWL ARRDPDRALD DLSAAITADP GLAVAYGNRG HIYYQQRDMA RALADFDAEI ALRPNVLAYI NRGNVHRDTE QLDRAAADYG EAIRLAPEDA RGWRNRALIK LYQGDNKGGL ADYDKALRYD PADVFSWNNR AQARMRLGDR SGAIADFRKA LELRPGLQTA RDSLKRLGAA VN
|
| |