Gene RPD_3408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3408 
Symbol 
ID4023921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3790299 
End bp3791546 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content68% 
IMG OID637963613 
Productextensin-like 
Protein accessionYP_570533 
Protein GI91977874 
COG category[S] Function unknown 
COG ID[COG3921] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0393857 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTGTG CGCGCATGAC GCGCGGAGTT CGTTTGTATC TCGTCGGCTC CTTCGTCCTT 
GTGTCGCTTG CTGGTTGCGG ACGCGGACTG TTTCAGTCCG CCGAGCGTGA ACCGTGGCGG
ACCGAGGCCG AGGTCGCTTG TCTGAAATCG GGTGTTATCA AAGAGACCCC GGAACTGGTC
CGGATCGATC CGATTTCGGG CCCGGGCGTT TGCGGCGCCG AATTCCCCTT GAAAGTGGCC
GCGCTCGGCG AAGGCGGTGC GATCGGCTTT GCCGACGATT TGCGCCCGCC GGCTGCGATC
GGCGGCCGCG CCGGCCAGCC GAACTGGCCG GGCGCGCAGC CATCCTATGC CGCGCCGGCG
CGCAGCACTC CCAATCAGCA GGCGCAACCG CCGGCCTATG GCGCATCCGC GCGCTCCGGC
TACGGCGCGC CGAACGATGG CTATCGCGGC GGCAACGGCC CGGTGTCGCT GACTGCGCCG
GGTGTGGCGC CGGCTGGCCA GGACATCGAT CTGCCGGACG AAGGCGGCAT GCCGGCCGAT
CGTCCGCCCG CGGAGAACGT CACCGGCTAT TCGCGCAATC CGAACGACGC GCGGGCGCCG
GCTGGCCGCT ATCCCGACGA TGCGCAGCGA CCGCTGCCGC GCCTCGGTCC GGCGCAGGGC
AATATCACCG GCTCGGTCGG TCCGGTCGCG ATCAAGCCGA CCGCGACACT GGCGTGTCCG
ATCGTCTCGG CGCTGGATCG CTGGCTCGCG GAATCCGTGC AGCCATCGGC GATGCGCTGG
TTCGGCGTGC GCGTCGCCGA GATCAAGCAG ATTTCCGCGT ATTCGTGCCG TGGCATGAAC
GGCAATCCGA ACGCGCATAT TTCCGAGCAC GCCTTCGGCA ACGCGCTCGA TATCTCGGCC
TTCGTGCTGG CCGACGGCCG CCGCGTCACG ATCAAGGGCG GCTGGAAGGG CTTGCCGGAG
GAGCAGGCGT TCTTGCGCGA CGTGCAGAAT TCGGCGTGCC AGGTGTTCAA CACGGTGTTG
GCCCCGGGCT CGAACATCTA TCACTACGAT CACATTCACG TCGATCTGAT GCGGCGGCGC
AGCGCGCGCA GCATCTGCAA GCCGGCGGCG GTGTCGGGCG AGGAGGTCGC GGCGCGGCTG
CAGCAGCGCA ACCCCTACGC CAGCAACTGG TCTGGCGTCA CCGGCTCAAT CGGCCGCAAC
GCGGCCCGGT CGAAAGCGGT GGACCGCGAG GAAGCTGAAG ACGATTAG
 
Protein sequence
MDCARMTRGV RLYLVGSFVL VSLAGCGRGL FQSAEREPWR TEAEVACLKS GVIKETPELV 
RIDPISGPGV CGAEFPLKVA ALGEGGAIGF ADDLRPPAAI GGRAGQPNWP GAQPSYAAPA
RSTPNQQAQP PAYGASARSG YGAPNDGYRG GNGPVSLTAP GVAPAGQDID LPDEGGMPAD
RPPAENVTGY SRNPNDARAP AGRYPDDAQR PLPRLGPAQG NITGSVGPVA IKPTATLACP
IVSALDRWLA ESVQPSAMRW FGVRVAEIKQ ISAYSCRGMN GNPNAHISEH AFGNALDISA
FVLADGRRVT IKGGWKGLPE EQAFLRDVQN SACQVFNTVL APGSNIYHYD HIHVDLMRRR
SARSICKPAA VSGEEVAARL QQRNPYASNW SGVTGSIGRN AARSKAVDRE EAEDD