Gene RPB_3801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3801 
Symbol 
ID3911604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4336545 
End bp4338686 
Gene Length2142 bp 
Protein Length713 aa 
Translation table11 
GC content68% 
IMG OID637885702 
ProductTPR repeat-containing protein 
Protein accessionYP_487406 
Protein GI86750910 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family
[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.386604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCCCG GACGCAACAA GGCCCCCAGC GCCTCGCCGC CCTACGACAA ATCCTACGAG 
CCCGTGCTGC TGCTGATGCG GGCGCGGGTG ATGCACCAGG CCGGGCAATA CGACGAGGCG
AAATCCGCCT ACAAGAAGGT GCTGAAGAAG AGCCCGAACA ACTTCCAGGC GCTGCACTTT
CTCGGCCTCG CCGAATTTCA GACCGGACAT TTCGACGCCG GCATCCGCTC GCTGAAGCGC
GCGCTGATCG AAGACCCGAA ATCGGCGCAG GCGCAGTCCG ACCTCGGCAG CGTGCTCAAC
GCCGCGCAGC GCTACGACGA AGCGCTGGTC GCCTGCGACA AGGCGATCGC GCTGGATCCG
GCGCTCGCCT TCGCCCATGC CAATCGCGGC AACGTGCTGA TCACGCTCGG CCGCTACGAC
GAAGCGGTCG CCAGCCTCGA CCGGGCGCTC GAGCTCGTTC CGGACCACAC CGACACCTGG
AACGACCGCG GCAACGCGCT GCACAAGCTC GGCCGCTACG ACGAGGCGCT GAACAGCTAC
GCCCAGGCGA TCAGGATCGA TCCGCTGCAC GACGTCGCCT TCATGAACCA GGCGACCACG
CTGAAGGAGA TGAAGCAGTT CGACCTGGCG CTGGCGAGCT ACGACCGCGC GCTGTCGATC
GGCAAGCGAC CGATCGACGC CGGCATCGCG CGCGCCGATC TGCTGCTGCA GATGAAGAAC
GTCGAGGGCG CGCTCGCGAC CTGCACGGCG CTGCTGAAGA TCGAGCCCGA CTTCGTCCCC
GCCCTGACGC TGCTCGGCAA TTGCATGGCC TCGCTCGGCG ACGCCGACAC CGCGACCGCG
CTGCACGGCC GCGCGCTGGC GCTGAAGCCG GACTACGAGC CGGCAATTTC CAGCCGGATC
TTCTCGATGG ACTTCTGCTC CGATGCGGAC TTCCAGTCGC AGCAGGCCGC GCGCGCGGAC
TGGTGGAAGC ACGTCGGCGC GCGGCTGTAC AAGAGCCATG CGGCGCCGCT CGCCAACGAT
CGCGACCCAG AGCGCCGCCT GGTGGTCGGT TACGTCTCGG CCGATTTCCG CCAGCATTCC
GCGGCGTTCT CGTTCCGCCC GGTGATCGAG AATCACGACC GCACGCAGGT CGAAGTGATC
TGCTACTCCG GCGTCGTGCT GCCCGACGCC GCGACCAAAT CGTTCGAGGC GATCGCCGAC
AGGTGGCGCG ACTCCTCGCA GTGGACCGAC GCCAGGCTCG CCGACACGAT CCGCGCCGAC
AAGGTCGACA TCCTGATCGA CCTGTCGGGC CATTCGGCCG GCAACCGCCT GCGGGTGTTC
GCGCGAAAGC CGGCGCCGGT GCAGGTCACC GCCTGGGGCC ACGCCACCGG CACCGGCCTG
CCGGTGATCG ACTATCTGCT GGCCGATCCG GTCGCGGTAC CCAACGAGGT TCGACAGTTC
TATGCGGAAG CGATCTACGA TCTGCCCTCG ATCGTGATCA TCGAACCGCC GCCTGCGGGG
CTGCATGCCA CCGAGCTGCC GTTCGACCGC AACGGCTATC TGACCTACGG CTCGCTCAAC
CGCATCAGCA AGATCTCGGA TGCGGCGATC GCGGCCTGGG CGCGGATCAT GACCGGCAAT
CCGACCTCGC GGCTGATCCT GAAGGATCAC CAGATCGACG ATCCCGCCGT GCGACAGACG
CTGCTCGACA AGTTCGCCGC GCAAGGCATC GCCGCCGAAC GCCTCACGCT GCTCGGTTCG
ACGTCGCGGC AGGAGCATCT GGAGACGCTG CAACAGATCG ACCTCGGCCT CGATCCGTTC
CCGCAAGCCG GCGGCGTTTC GACCTGGGAA GCGCTGCATA TGGGCGTGCC GGTGGTGAGC
CGGCTCGGCA ACACCGTCGC CAGCCGGGTT GGCTCTGCGA TCCTGTCGGC CGCCGGCCTG
CCGGACTTCA TCGCCACCAG CGAAGAGCGC TACATCGCGA TCGCGCTCGA TCCGGATCGC
GAGCGGCTGC GCGCGATCCG CCGCGGCCTG CCCGCCTTCA TCGCCGAGCG CTGCGGCCCC
GCCGCCTACA CCCGCGCCGT CGAGGACGCC TACCGCACGA TGTGGCGCCG CTGGTGCGCG
ACGCCGGCGG ACGCGAAGCC GGCGGACGGC AAGCGGCGCT GA
 
Protein sequence
MQPGRNKAPS ASPPYDKSYE PVLLLMRARV MHQAGQYDEA KSAYKKVLKK SPNNFQALHF 
LGLAEFQTGH FDAGIRSLKR ALIEDPKSAQ AQSDLGSVLN AAQRYDEALV ACDKAIALDP
ALAFAHANRG NVLITLGRYD EAVASLDRAL ELVPDHTDTW NDRGNALHKL GRYDEALNSY
AQAIRIDPLH DVAFMNQATT LKEMKQFDLA LASYDRALSI GKRPIDAGIA RADLLLQMKN
VEGALATCTA LLKIEPDFVP ALTLLGNCMA SLGDADTATA LHGRALALKP DYEPAISSRI
FSMDFCSDAD FQSQQAARAD WWKHVGARLY KSHAAPLAND RDPERRLVVG YVSADFRQHS
AAFSFRPVIE NHDRTQVEVI CYSGVVLPDA ATKSFEAIAD RWRDSSQWTD ARLADTIRAD
KVDILIDLSG HSAGNRLRVF ARKPAPVQVT AWGHATGTGL PVIDYLLADP VAVPNEVRQF
YAEAIYDLPS IVIIEPPPAG LHATELPFDR NGYLTYGSLN RISKISDAAI AAWARIMTGN
PTSRLILKDH QIDDPAVRQT LLDKFAAQGI AAERLTLLGS TSRQEHLETL QQIDLGLDPF
PQAGGVSTWE ALHMGVPVVS RLGNTVASRV GSAILSAAGL PDFIATSEER YIAIALDPDR
ERLRAIRRGL PAFIAERCGP AAYTRAVEDA YRTMWRRWCA TPADAKPADG KRR