Gene RPD_1920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1920 
Symbol 
ID4022402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2158295 
End bp2159647 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content68% 
IMG OID637962113 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_569056 
Protein GI91976397 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.69001 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCTCG ACAGGCGTCG ACTGATCGGA TTAGCAGCCG CAGCGCTGAC GCTGTCGGCG 
ACACCGATGC GCGCGGCCCC GACCTCGCAA CGTGGCCGCG ACGCTGCCCA GCTCGGTATT
CGTCCCGACA GCTCTGATGA CCAGACCGCG GCGCTGCAAC GCGCAATCGA CAATGCCGCA
CACGCCCGCG TCCCGCTGGC GCTGCCGCCC GGCAATTATC GCACCGGTAC CCTCCGATTG
CCGTCAGGCG CACAGCTCAG CGGCGTCCGT GGCGCAACGC GCTTGATCTT CACCGGTGGA
CCGTCGCTGT TCGACAGCGC CGGCGCCGAG ACGCCGACGC TGAACGGCCT CGTCCTCGAC
GGCGGCGCGA TCCCGCTGCC GGCGCGGCGC GGCCTCGTAC ATGTCGTCGG CGCACGCAAC
CTGCGCATCA CGGATTGCGA GATCACCGCC AGCGGCGGCA GCGGCGTCTG GCTCGAAACC
ACCTCCGGCG CGATCACCGA CAATTTGCTG ACCGCGATCG CGGTGACCGG CGTGGTGTCG
TTCGACGCCA AGGGCCTGAG TGTCTCACGC AACGCCCTCG TCGGGGCCAA CAACAACGGC
ATCGAGATCC TGCGCACCTC GATCGGCGAC GACGGCAGCA TCGTCACCGG CAACAGGATC
GAGAACATCA AGGCCGGCCC CGGCGGCTCG GGCCAGTACG GCAACGCCAT CAACGCGTTT
CGCGCCGGCA ACGTCATCGT CAGCGGCAAC CGGATCAAGA ACTGCGATTA CTCCGCCGTA
CGAGGCAATT CGGCGTCGAA CATCCACATC ACCGACAATA GCGTCAGCGA CGTGCGCGAG
GTCGCGCTGT ACTCCGAATT CGCGTTCGAG GGCGCGGTGA TCTCGGGTAA CACCGTCGAC
GGAGCCGCGC TCGGCGTCTC GGTCTGCAAT TTCAACGAGG GCGGACGGCT CAGCGTCGTG
CAGGGCAACA TCATCCGCAA TTTGTTGCCG AAGCGGCCGA TCGGCACCGC GCCGGACGAC
GACGCCGGGA TCGGCATCTA TGTCGAGGCC GACACCGCGG TGACTGGCAA TGTGATCGAG
AACGCGCCGT CGTTCGGCAT CGTCGCCGGC TGGGGGCGAT ATCTGCGCGA CGTCGCGATC
ACCGGCAACG TCGTGCGCAG GGCGTTCGTC GGCATCGGCG TGTCCGTTGC CGAAGGCGCC
GGCACCGCCA CGATCAATGG CAACGTCATC GCCGAGGCGT CGCGTGGCGC AGTCGTCGGG
CTCGATCACG CGCGGCCGGT GACGCCGGAC CTGACCGCGC CCGGGGCTGC GCGGTTCGCC
CAGATCGCGC TCGGCAGTAA TTCGGTGCGG TGA
 
Protein sequence
MALDRRRLIG LAAAALTLSA TPMRAAPTSQ RGRDAAQLGI RPDSSDDQTA ALQRAIDNAA 
HARVPLALPP GNYRTGTLRL PSGAQLSGVR GATRLIFTGG PSLFDSAGAE TPTLNGLVLD
GGAIPLPARR GLVHVVGARN LRITDCEITA SGGSGVWLET TSGAITDNLL TAIAVTGVVS
FDAKGLSVSR NALVGANNNG IEILRTSIGD DGSIVTGNRI ENIKAGPGGS GQYGNAINAF
RAGNVIVSGN RIKNCDYSAV RGNSASNIHI TDNSVSDVRE VALYSEFAFE GAVISGNTVD
GAALGVSVCN FNEGGRLSVV QGNIIRNLLP KRPIGTAPDD DAGIGIYVEA DTAVTGNVIE
NAPSFGIVAG WGRYLRDVAI TGNVVRRAFV GIGVSVAEGA GTATINGNVI AEASRGAVVG
LDHARPVTPD LTAPGAARFA QIALGSNSVR