Gene RPB_0831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0831 
Symbol 
ID3909089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp944818 
End bp947358 
Gene Length2541 bp 
Protein Length846 aa 
Translation table11 
GC content70% 
IMG OID637882724 
Producthypothetical protein 
Protein accessionYP_484453 
Protein GI86747957 
COG category[R] General function prediction only 
COG ID[COG4867] Uncharacterized protein with a von Willebrand factor type A (vWA) domain 
TIGRFAM ID[TIGR02302] conserved hypothetical protein TIGR02302 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.844423 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCGGAC GCAGTCCCGA CCCGTCACAG GCCCCGCGCG ATCCAGACGC GACCGCGCGG 
CTGCGGCTGG CCACGGCTCT GCAGCGGGCG ACGGTCGCGA TCGCCTGGGA GCGGAGTTGG
CCGCTGCTGG TCCGGCTGCT GAGCGTCGTG GGCCTGTTTC TGGCGGCGTC CTGGGCCGGG
CTGTGGCTGG CGCTGCCGTT CACCGGGCGC ATCGTCGGCC TGGTGCTGTT CGCTGCGCTG
GCGCTGGTGG CGCTCTACCC CGTCCTCAAG TTCCGCTGGC CGAGCCGCGA CGAGGCGCTC
GTCCGGCTCG ACCGCAACAC CGGGCTGAAG CATCGGCCCG CAACAGCGCT GACCGATACG
CTGGCGTCGA GCGATCCGGT GGCGCAGGCG CTGTGGCGGG CGCAACGCGA GCGCACGCTG
GCGGCGCTCC AGGGCATTCG CGCCGGGCTG CCGGCGCCGC GGCTGCCGAA GCACGATCCA
TGGGCGCTGC GCGCCCTGGT CGCGGTGCTG CTGGTCGCCA CCTTCATCGC CGCCGGCGAG
GAACGGACCG CGCGCGTCGC CGCGGCGTTC GACTGGAACG GCGCGCTGGC CGCGCCCAAC
GTCCGGGTCG ATGCCTGGGT GACGCCGCCG GTCTACACCA ACAAGCCGCC GATCATCCTG
TCGGCCGCCA ACAAGGATCT CGCCGCGCAG AACCAGGCGG CGCTGCCGGT GCCGGCCGGC
TCGACGCTGC TGGTGCGATC CAGCGGCGGC GCGCTCGACG TCGCGGTGAC GGGCGGGATT
GTCGAAGCCC AGCCGGAGGG CGAGGCGCCG GCGGGCACCA GCGAGCGGCA TTTCAAGATC
ACCGGCGACG GCACCGCGCA TGTCCGCGCC CCGTCCGGCC AGCCGCAATG GGCGTTCAAG
GTGACGACCG ATCACGCGCC GTCGATCGCT CTGGCCAAGG AGCCGGAGCG GCAGGCGCGC
GGCTCGCTGC AATTGTCCTA CAAGCTCGAA GACGATTACG GCGTCACCGA AGCCCATGCG
CTGATCGCCC CGTCGCCCTC CGCCGCCGCG CAGACCACCG AGCCGCCACG GCCGCTATAC
GAACCGCCGC ATTTCGCGCT GACGCTGCCG AATGCGCGCA CCCGCGCCGG TGTCGGCCAG
ACCGTGAAGG ATCTCAGCGA AGATCCCTAT GCGGGCGCCG AAGTGACGCT GACGCTGACC
GCCAAGGACG AGGCCGGCAA TGAGGGCCGC AGCGAACCGC ATAAGATGCG GCTGCCGGAG
CGGCTGTTCA CCAAGCCCTT GGCGCGGGCG CTGATCGAGC AGCGCCGCAT CCTGGCGCTC
GACGCCACCA GGAACGCGCA GGTCTACACC GCGCTCGACG CGCTGATGAT CGCGCCGGAA
GCGTTCACGC CGGACGCCGG CCAGTATCTC GGTCTCTACA CCGTCGCCGA CCAGCTCGAG
CGCGCTCGCA CCGACGACGC GCTGCGCGAG GTGGTGGCCA GCCTGTGGTC GCTCGCGCTG
GCGATCGAGG ATGGCGATAC GTCCGACGTC GAGAAGGCGC TACGCGCCGC GCAGGATGCG
CTGAAGCAGG CGCTGGAGCG TGGCGCCTCC GACGAGGAAA TCAAGAAGCT CACCGAAAAT
CTGCGCGCCG CGCTCGACAA TTTCATGCGC CAGCTCGCCG AGCAGATGAA GAACAATCCG
CAGCAACTCG CCCGCCCGCT CGATCCGAAC ACCCGGGTGA TGCGGCAGCA GGACCTCAAC
AACATGATCG AGCGCATGGA GCGGCTGTCG CGCTCCGGCG ACAAGGATGC CGCCAGGCAA
TTGCTCGAAC AGCTCGCCCA GATGCTCGAA AACCTGCAGA TGGCGCAGCC CGGCCAGGGC
GGCGACGACG ACATGCAGCA GTCGATGAAC GAGCTCGGCG ACATGATCCG CAAGCAGCAG
CAACTGCGCG ACAAGACCTT CAAGCAAGGC CAGGATCAGC GCCGTGACCG GATGCGCGGC
CAGAACGGTG AGCAGAGCCT CGGCGATCTG CAGCAGGATC AGCAGAACCT GCAGGAGCGG
CTGCGCAAGC TGCAGCAGGA ACTCGCCAAG CGCGGGATGG GGCAGCAGGG CCAGCGCGGC
CAGAACGGCG AGCAGGGCCA GCAAGGCGAG CAGGGCGAGG GCGGTCTCGA CCAGGCCGAG
TCGGCGATGG GCGACGCCGA AGGCCGGCTC GGCGAAGGCA ATGCCGACGG CGCCGTCGAT
TCCCAGGGCC GTGCGCTCGA TGCGCTGCGC AAGGGCGCGC AGAAGCTGGC CGAAGCGATG
CAGCAGGGCG ACGGGCAGGG CCAGGGCGAT GGCCCGGGCA GCCGTCCCGG CCGGCAGCAG
AGCAGCGGCA ACAACACCGA TCCGCTCGGT CGGCCGTTGC GCGGCCGCGA ATTCGGCGAC
GATCTCACGG TGAAGATTCC CGGCGAAATC GACGTCCAGC GCGTCCGCCG CATCCTCGAA
GAACTCCGCC GCCGCCTCGG CGATTCGGCC CGGCCGCAGC TCGAGCTCGA CTACATCGAG
CGGCTGCTGA AGGATTATTA G
 
Protein sequence
MSGRSPDPSQ APRDPDATAR LRLATALQRA TVAIAWERSW PLLVRLLSVV GLFLAASWAG 
LWLALPFTGR IVGLVLFAAL ALVALYPVLK FRWPSRDEAL VRLDRNTGLK HRPATALTDT
LASSDPVAQA LWRAQRERTL AALQGIRAGL PAPRLPKHDP WALRALVAVL LVATFIAAGE
ERTARVAAAF DWNGALAAPN VRVDAWVTPP VYTNKPPIIL SAANKDLAAQ NQAALPVPAG
STLLVRSSGG ALDVAVTGGI VEAQPEGEAP AGTSERHFKI TGDGTAHVRA PSGQPQWAFK
VTTDHAPSIA LAKEPERQAR GSLQLSYKLE DDYGVTEAHA LIAPSPSAAA QTTEPPRPLY
EPPHFALTLP NARTRAGVGQ TVKDLSEDPY AGAEVTLTLT AKDEAGNEGR SEPHKMRLPE
RLFTKPLARA LIEQRRILAL DATRNAQVYT ALDALMIAPE AFTPDAGQYL GLYTVADQLE
RARTDDALRE VVASLWSLAL AIEDGDTSDV EKALRAAQDA LKQALERGAS DEEIKKLTEN
LRAALDNFMR QLAEQMKNNP QQLARPLDPN TRVMRQQDLN NMIERMERLS RSGDKDAARQ
LLEQLAQMLE NLQMAQPGQG GDDDMQQSMN ELGDMIRKQQ QLRDKTFKQG QDQRRDRMRG
QNGEQSLGDL QQDQQNLQER LRKLQQELAK RGMGQQGQRG QNGEQGQQGE QGEGGLDQAE
SAMGDAEGRL GEGNADGAVD SQGRALDALR KGAQKLAEAM QQGDGQGQGD GPGSRPGRQQ
SSGNNTDPLG RPLRGREFGD DLTVKIPGEI DVQRVRRILE ELRRRLGDSA RPQLELDYIE
RLLKDY