Gene RPD_0940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0940 
Symbol 
ID4021415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1055775 
End bp1058330 
Gene Length2556 bp 
Protein Length851 aa 
Translation table11 
GC content69% 
IMG OID637961131 
Producthypothetical protein 
Protein accessionYP_568079 
Protein GI91975420 
COG category 
COG ID 
TIGRFAM ID[TIGR02302] conserved hypothetical protein TIGR02302 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCGGAC GCAGTCCCGA CCCGTCACAG ACCCCGCGAG ATCCAGACGC GATGGCGCGG 
CTGCAACTTG CCACGGCTCT GCGACGTGCG ACCTTCGCGA TCGCCTGGGA GCGGAGCTGG
CCGCTTCTCG TCCGTCTGCT CAGCGTCGTC GGCCTGTTTC TGGCTGCGTC GTGGGCCGGT
CTGTGGCTCT CGCTGCCTTT CGTTGGCCGC GTTGCCGGCC TTGCGCTGTT TGCGGCGCTG
GCGGTGGTCG CGCTGCTTCC CGTGCTCAAG TTCCGCTGGC CGAGCCGCGA CGACGCGCTC
GCCCGGCTCG ATCGCGCCAC CGGGCTGAAG CATCGCCCCG CGACCGCGCT GACCGATACG
CTGGCCTCCA GCGATCCGGT GGCGCAGGCG CTGTGGCAGG CGCAGCGCGA GCGAACGCTG
GCGGCGATCA AAGGCGTCAG CGCCGGATTG CCGGCGCCGC GGCTGCCGAA GCACGACCCA
TGGGCGCTGC GCGCGCTGGT CGCGGTGCTG CTGGTCGCGA CCTTCATCGC CGCCGGTGAA
GAGCGCACCG CGCGCGTGGC GGCGGCGTTC GATTGGAACG GAGCGCTGGC TGCGCCGAAC
GTCCGGGTCG ACGCCTGGGT GACGCCGCCG GTCTACACCA ACAAGCCGCC GATCGTGCTG
TCGGCTGCGA ACAGGACTCT CGCCTCCTCG AACGAAGCGG CGCTGCCGGT TCCGGCCGGC
TCGACGCTAC TGGTGCGCTC CAGCGGCGGC GATCTCGACG TTGCGATCAG CGGCGGCGTC
GTCGAGGCGT TGCCGGAGGG CGAAGCGCCG AAGGGCGCCA GCGAGCGGCG CTACAAGATC
ACCGGCGACG GCACGGCGCA TGTTCGTGCG CCGTCCGGTC AGCCGCAATG GTCGTTCAAG
GCGACGCCGG ATCATCCGCC GTCGATCGCT CTGGCGAAGG AGCCGGAGCG GCAGGCGCGC
GGATCGTTGC AACTGTCCTA CAAGCTCGAA GACGATTACG GCGTGACCGA AGCCCATGCG
CTGTTCGCCG CATCGCCCTC CGCGACCGAT CAGAACTCCG ATACGCCGCG GCCGCTCTAT
GAGCCGCCGC AGTTTGCGCT GACGCTGCCG AATGCGCGGA CCCGCGCCGG CGTCGGCCAG
ACCGTCAAGG ATATCAGCGA CGATCCCTAT GCGGGCGCCG AGGTCACGGT GACGCTGACG
GCGAAGGACG AGGCCGGCAA TGAGGGCCGC AGCGAGCCCA CCACGATGCG GCTGCCGGAG
CGACTGTTCA CCAAGCCGTT GGCGCGGGCG CTGATCGAAC AGCGCCGCAT TCTCGCGCTC
GACGCCAACA AGAACGCGCA GGTCTACGCC GCGCTCGACG CGCTGATGAT CGCGCCGGAA
GCATTCACGC CGGAGGCCGG CCAGTATCTC GGCCTCTATA CCGTTGCCGA CCAGCTCGAG
CGCGCGCGCA CCGACGACGC GCTGCGCGAG GTCGTCGGCA ATCTCTGGTC ACTCGCGCTC
AGCATCGAGG ACGGCAATAC ATCGGACGTC GAAAAGGCGC TGCGCGCCGC GCAGGACGCG
CTGAAGCAGG CGCTGGAGCG TGGCGCCTCC GACGAGGAGA TCAAGAAGCT CACCGAGAAT
TTGCGCGCCG CGCTCGACAA CTACATGCGC CAGCTCGCCG AGCAGCTCAA AAACAATCCG
CAGCAGCTCG CCCGTCCGCT CGATCCCAAC GCGCGGGTGA TGCGGCAGCA GGACCTCAAC
AACATGATCG AGCGGATGGA GCGGCTGTCG CGCTCCGGCG ACAAGGAAGC CGCCAAGCAG
TTGCTCGAAC AGCTGGCGCA GATGCTGGAA AACCTGCAGA TGGCGCAGCC CGGCCAGGGC
GGCGACGCCG ACATGCAGCA GGCGATGAAC GAACTCGGCG ACATGATCCG CAAGCAGCAG
CAATTGCGGG ACAAGACCTT CAAGCAGGGT CAGGACCAGC GGCGCGACCG GATGCGCGGC
CAGAAGGGCG AGCAGAATCT CGGCGATCTG CAGCAGGATC AGCAGAACCT GCAGGACCGG
CTGCGCAAGC TGCAGCAGGA GCTCGCCAAG CGCGGTCTCG GCCAGGCCCA GCGCGGCCAG
AAAGGCGAGC CGGGGCAGCA AGGCGAACAG GGCGAGCAGG GCGAGCAGAG CGAGGGCGGC
CTCGATCAGG CCGAGTCCGC GATGGGCGAC GCCGAAGGCC GGCTCGGCGA AGGCAACGCC
GACAGCGCGG TCGACTCGCA GGGCCGCGCG CTCGAGGCGT TGCGCAAGGG TGCGCAGAAG
CTCGCCGAGG CGATGCAGCA GGGCGACCAG CCGGGCCAGG GCGAAGGGCC GGGCAATCGC
CCCGGTCGCC AGCAGAGCAG CGGCAACGAC ACCGACCCGC TCGGCCGGCC GTTGCATGGC
CGCGAATTCG GCGACGATCT GTCGGTAAAA ATTCCCGGCG AGATCGATGT TCAGCGCGTC
CGCCGCATCC TCGAAGAACT CCGCCGCCGC CTCGGCGATT CAGCGCGACC ACAGCTCGAA
CTCGACTACA TCGAGCGGCT GCTGAAAGAT TACTGA
 
Protein sequence
MSGRSPDPSQ TPRDPDAMAR LQLATALRRA TFAIAWERSW PLLVRLLSVV GLFLAASWAG 
LWLSLPFVGR VAGLALFAAL AVVALLPVLK FRWPSRDDAL ARLDRATGLK HRPATALTDT
LASSDPVAQA LWQAQRERTL AAIKGVSAGL PAPRLPKHDP WALRALVAVL LVATFIAAGE
ERTARVAAAF DWNGALAAPN VRVDAWVTPP VYTNKPPIVL SAANRTLASS NEAALPVPAG
STLLVRSSGG DLDVAISGGV VEALPEGEAP KGASERRYKI TGDGTAHVRA PSGQPQWSFK
ATPDHPPSIA LAKEPERQAR GSLQLSYKLE DDYGVTEAHA LFAASPSATD QNSDTPRPLY
EPPQFALTLP NARTRAGVGQ TVKDISDDPY AGAEVTVTLT AKDEAGNEGR SEPTTMRLPE
RLFTKPLARA LIEQRRILAL DANKNAQVYA ALDALMIAPE AFTPEAGQYL GLYTVADQLE
RARTDDALRE VVGNLWSLAL SIEDGNTSDV EKALRAAQDA LKQALERGAS DEEIKKLTEN
LRAALDNYMR QLAEQLKNNP QQLARPLDPN ARVMRQQDLN NMIERMERLS RSGDKEAAKQ
LLEQLAQMLE NLQMAQPGQG GDADMQQAMN ELGDMIRKQQ QLRDKTFKQG QDQRRDRMRG
QKGEQNLGDL QQDQQNLQDR LRKLQQELAK RGLGQAQRGQ KGEPGQQGEQ GEQGEQSEGG
LDQAESAMGD AEGRLGEGNA DSAVDSQGRA LEALRKGAQK LAEAMQQGDQ PGQGEGPGNR
PGRQQSSGND TDPLGRPLHG REFGDDLSVK IPGEIDVQRV RRILEELRRR LGDSARPQLE
LDYIERLLKD Y