Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3801 |
Symbol | |
ID | 3911604 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4336545 |
End bp | 4338686 |
Gene Length | 2142 bp |
Protein Length | 713 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637885702 |
Product | TPR repeat-containing protein |
Protein accession | YP_487406 |
Protein GI | 86750910 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.386604 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCCCG GACGCAACAA GGCCCCCAGC GCCTCGCCGC CCTACGACAA ATCCTACGAG CCCGTGCTGC TGCTGATGCG GGCGCGGGTG ATGCACCAGG CCGGGCAATA CGACGAGGCG AAATCCGCCT ACAAGAAGGT GCTGAAGAAG AGCCCGAACA ACTTCCAGGC GCTGCACTTT CTCGGCCTCG CCGAATTTCA GACCGGACAT TTCGACGCCG GCATCCGCTC GCTGAAGCGC GCGCTGATCG AAGACCCGAA ATCGGCGCAG GCGCAGTCCG ACCTCGGCAG CGTGCTCAAC GCCGCGCAGC GCTACGACGA AGCGCTGGTC GCCTGCGACA AGGCGATCGC GCTGGATCCG GCGCTCGCCT TCGCCCATGC CAATCGCGGC AACGTGCTGA TCACGCTCGG CCGCTACGAC GAAGCGGTCG CCAGCCTCGA CCGGGCGCTC GAGCTCGTTC CGGACCACAC CGACACCTGG AACGACCGCG GCAACGCGCT GCACAAGCTC GGCCGCTACG ACGAGGCGCT GAACAGCTAC GCCCAGGCGA TCAGGATCGA TCCGCTGCAC GACGTCGCCT TCATGAACCA GGCGACCACG CTGAAGGAGA TGAAGCAGTT CGACCTGGCG CTGGCGAGCT ACGACCGCGC GCTGTCGATC GGCAAGCGAC CGATCGACGC CGGCATCGCG CGCGCCGATC TGCTGCTGCA GATGAAGAAC GTCGAGGGCG CGCTCGCGAC CTGCACGGCG CTGCTGAAGA TCGAGCCCGA CTTCGTCCCC GCCCTGACGC TGCTCGGCAA TTGCATGGCC TCGCTCGGCG ACGCCGACAC CGCGACCGCG CTGCACGGCC GCGCGCTGGC GCTGAAGCCG GACTACGAGC CGGCAATTTC CAGCCGGATC TTCTCGATGG ACTTCTGCTC CGATGCGGAC TTCCAGTCGC AGCAGGCCGC GCGCGCGGAC TGGTGGAAGC ACGTCGGCGC GCGGCTGTAC AAGAGCCATG CGGCGCCGCT CGCCAACGAT CGCGACCCAG AGCGCCGCCT GGTGGTCGGT TACGTCTCGG CCGATTTCCG CCAGCATTCC GCGGCGTTCT CGTTCCGCCC GGTGATCGAG AATCACGACC GCACGCAGGT CGAAGTGATC TGCTACTCCG GCGTCGTGCT GCCCGACGCC GCGACCAAAT CGTTCGAGGC GATCGCCGAC AGGTGGCGCG ACTCCTCGCA GTGGACCGAC GCCAGGCTCG CCGACACGAT CCGCGCCGAC AAGGTCGACA TCCTGATCGA CCTGTCGGGC CATTCGGCCG GCAACCGCCT GCGGGTGTTC GCGCGAAAGC CGGCGCCGGT GCAGGTCACC GCCTGGGGCC ACGCCACCGG CACCGGCCTG CCGGTGATCG ACTATCTGCT GGCCGATCCG GTCGCGGTAC CCAACGAGGT TCGACAGTTC TATGCGGAAG CGATCTACGA TCTGCCCTCG ATCGTGATCA TCGAACCGCC GCCTGCGGGG CTGCATGCCA CCGAGCTGCC GTTCGACCGC AACGGCTATC TGACCTACGG CTCGCTCAAC CGCATCAGCA AGATCTCGGA TGCGGCGATC GCGGCCTGGG CGCGGATCAT GACCGGCAAT CCGACCTCGC GGCTGATCCT GAAGGATCAC CAGATCGACG ATCCCGCCGT GCGACAGACG CTGCTCGACA AGTTCGCCGC GCAAGGCATC GCCGCCGAAC GCCTCACGCT GCTCGGTTCG ACGTCGCGGC AGGAGCATCT GGAGACGCTG CAACAGATCG ACCTCGGCCT CGATCCGTTC CCGCAAGCCG GCGGCGTTTC GACCTGGGAA GCGCTGCATA TGGGCGTGCC GGTGGTGAGC CGGCTCGGCA ACACCGTCGC CAGCCGGGTT GGCTCTGCGA TCCTGTCGGC CGCCGGCCTG CCGGACTTCA TCGCCACCAG CGAAGAGCGC TACATCGCGA TCGCGCTCGA TCCGGATCGC GAGCGGCTGC GCGCGATCCG CCGCGGCCTG CCCGCCTTCA TCGCCGAGCG CTGCGGCCCC GCCGCCTACA CCCGCGCCGT CGAGGACGCC TACCGCACGA TGTGGCGCCG CTGGTGCGCG ACGCCGGCGG ACGCGAAGCC GGCGGACGGC AAGCGGCGCT GA
|
Protein sequence | MQPGRNKAPS ASPPYDKSYE PVLLLMRARV MHQAGQYDEA KSAYKKVLKK SPNNFQALHF LGLAEFQTGH FDAGIRSLKR ALIEDPKSAQ AQSDLGSVLN AAQRYDEALV ACDKAIALDP ALAFAHANRG NVLITLGRYD EAVASLDRAL ELVPDHTDTW NDRGNALHKL GRYDEALNSY AQAIRIDPLH DVAFMNQATT LKEMKQFDLA LASYDRALSI GKRPIDAGIA RADLLLQMKN VEGALATCTA LLKIEPDFVP ALTLLGNCMA SLGDADTATA LHGRALALKP DYEPAISSRI FSMDFCSDAD FQSQQAARAD WWKHVGARLY KSHAAPLAND RDPERRLVVG YVSADFRQHS AAFSFRPVIE NHDRTQVEVI CYSGVVLPDA ATKSFEAIAD RWRDSSQWTD ARLADTIRAD KVDILIDLSG HSAGNRLRVF ARKPAPVQVT AWGHATGTGL PVIDYLLADP VAVPNEVRQF YAEAIYDLPS IVIIEPPPAG LHATELPFDR NGYLTYGSLN RISKISDAAI AAWARIMTGN PTSRLILKDH QIDDPAVRQT LLDKFAAQGI AAERLTLLGS TSRQEHLETL QQIDLGLDPF PQAGGVSTWE ALHMGVPVVS RLGNTVASRV GSAILSAAGL PDFIATSEER YIAIALDPDR ERLRAIRRGL PAFIAERCGP AAYTRAVEDA YRTMWRRWCA TPADAKPADG KRR
|
| |