Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1085 |
Symbol | |
ID | 3910171 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1245973 |
End bp | 1247751 |
Gene Length | 1779 bp |
Protein Length | 592 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637882978 |
Product | TPR repeat-containing protein |
Protein accession | YP_484706 |
Protein GI | 86748210 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTCTTC ACCGCTTCCG TCGTTCGATG TTTGTCGCCG TCACCGTCGC GGCGCTGCCG ATCGCGGGGC AGGCGCTGGC GCAGACTCCG GACCATCCGG GCGACAATTC CGCGCAGTTT CCGACCAGCC AGGATCTGCG GTCGATGACC ACGGCGGGCA GCTATCTCGC CGCCCGCCAC GCCAGCGTCG AGCGCGACGC CGCCTCGGCC GCGGCGTTCT ATCGGTCGGC GCTGCGCACC GACCCGAAGA ACAACGAATT GCTCGACCGC GCCTTCATCT CTTCGCTGGC CGAAGGCAAT ATCGAGGAGT CGGTCAAGCT CGCCGACCGG ATTCTCAAGA TCGACAAGAC CAACCGCGTG GCGCGGCTGG TGATCGGCGT GCGCGATCTG AAGACCAAGA AATACGCCGC AGCGGTCCAG AACGTGAATC TGTCGGTCCG CGGCCCGATC ACCGATCTGG TCGCGACGCT GCTGTCGAGC TGGGCGATGG AGGGCGCCGG CGACGTCAAG GGCGCCGTCG CCAATATCGA CAAGCTCGCC GGTCCGGAAT GGTATCCGAT CTTCAAGGAT CTGCATTCCG GCATGATGCT CGAGCTCGCC AACAAGCAGA AGGACGCCGG CGTCCGCTTC GAGCGGGCCT ACAAGCTCGA CGATTCCGCG CTTCGGGTGA CGGATGCCTA TGCGCGCTGG CTGTCGCGCA ACAAGGACGA CGGCTCCGCG GTCGCGATCT ACGAGGGCTT CGACAAGAAG CTGTCGCGCC ATCCGTTGGT GTTGGAGGGA TTGCGTGACG CCAAGGCCGG CAAGAAGCTG CCGCCGCTGG TCGACAGCCC GCAGGCCGGC GCTGCCGAAG CGCTGTACGG TATCGGAGCG TCGCTGACCC GCCGTGGCGG CGAGGACCTC GCGCTGGTCT ATCTGCAGCT CGCGCTGTAT CTGAAGCCCG ATCACGCGCT GGCGCTGCTG GCGCTCGGCG ATCTGTACGA ATCGGTGAAG AAGCCGCAGA TGGCGGTGAA GGTCTACGAG CGCGTGCCGG CGGATTCGCC GCTCAAGCGC AACGCCCAGA TCCAGCTCGC CACCGATCTC GACGCGATCG ACCGCAGCGA GGAAGCGATC AAGATCCTGA AGACGGTTAT CGCCGAGGAC GGCAAGGACC TCGAGGCGAT CATGGCGCTC GGCAACATCG AGCGCGGCCG CAAGAAGTTC GCCGATTGCG CGGTCACCTA CAGCCAGGGC ATCGATGCGC TCACCGGCAC CGAGAAGAAC AGCTGGGTCT ATTATTATTT CCGCGGCATC TGCGAGGAGC GTTCCAAGCA GTGGGCCAAG GCCGAGGTCG ACATGAAGAA GGCGCTGCAG CTGCAGCCCG AGCAGCCGCA TGTTCTGAAC TATCTCGGCT ATTCCTGGAT CGACCAGGGC ATCAATCTCG ACGAAGCGAT GAAGATGATC AAGCGCGCCG TCGATCAGCG CCCCGACGAC GGCTACATCG TCGACTCGCT CGGCTGGGCT TATTTCCGCA TCGGCAATTA CGAAGAGGCG GTGAAGACGC TGGAGCGCGC CATCGATCTG AAGCCGGAAG ATCCGACCAT CAACGATCAC CTCGGCGACG CCTATTGGCG CGTCGGGCGA ACGCTGGAGG CGCGCTTCCA GTGGGCGCAC GCCCGCGATC TCAAGCCGGA TCCGGAAGAG TTGCCGAAGA TCGAGGCCAA GCTCGCCAAC GGTCTCCCGG AGGACACCTC GTCGGCGGCG TCGGCGGACA AGAAAAAAGA CGACGACAAG GGCGGCTGA
|
Protein sequence | MLLHRFRRSM FVAVTVAALP IAGQALAQTP DHPGDNSAQF PTSQDLRSMT TAGSYLAARH ASVERDAASA AAFYRSALRT DPKNNELLDR AFISSLAEGN IEESVKLADR ILKIDKTNRV ARLVIGVRDL KTKKYAAAVQ NVNLSVRGPI TDLVATLLSS WAMEGAGDVK GAVANIDKLA GPEWYPIFKD LHSGMMLELA NKQKDAGVRF ERAYKLDDSA LRVTDAYARW LSRNKDDGSA VAIYEGFDKK LSRHPLVLEG LRDAKAGKKL PPLVDSPQAG AAEALYGIGA SLTRRGGEDL ALVYLQLALY LKPDHALALL ALGDLYESVK KPQMAVKVYE RVPADSPLKR NAQIQLATDL DAIDRSEEAI KILKTVIAED GKDLEAIMAL GNIERGRKKF ADCAVTYSQG IDALTGTEKN SWVYYYFRGI CEERSKQWAK AEVDMKKALQ LQPEQPHVLN YLGYSWIDQG INLDEAMKMI KRAVDQRPDD GYIVDSLGWA YFRIGNYEEA VKTLERAIDL KPEDPTINDH LGDAYWRVGR TLEARFQWAH ARDLKPDPEE LPKIEAKLAN GLPEDTSSAA SADKKKDDDK GG
|
| |