Gene RPD_1956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1956 
Symbol 
ID4022438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2195992 
End bp2197059 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content63% 
IMG OID637962149 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_569092 
Protein GI91976433 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3049] Penicillin V acylase and related amidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.550247 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.436051 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCCGT TCAGGCGTCG TTTCGTCACC GCATCCGCCG CAGCGCTTCT GATCGGAGGC 
GCCCTGCTTC CGCCGGTGGC GCAGGCGTGC ACCCGTCTGG TCTATCTCGG CGCCGACAAT
CAGGTGATCA CCGCCCGCTC GATGGACTGG TCGCGCGACA TCGGCACCAA TCTCTGGATC
TTGCCGCGCG GTATCAAGCG CTCCGGCGAG GCCGGGCCGA ACTCGCTGCA ATGGACCGCG
CGTTACGGCA GCGTGATCGC CTCGGCCTAT GACATCGCGA CCTCGGACGG CGTCAACGAG
GCCGGGCTGG TCGCGAACGT GCTGTGGCTC GCGGAATCGA CCTATCCGAA ACTCGACGGC
AGCAAGCCGG GTCTCGCGCT GTCGCTGTGG CCACAATATG TGCTCGATAG TTTCGGCACC
GTGCAGGAGG CGGTCGAGGC GCTGGCGAAA CAGCCGTTCA CCGTCGTCAC AGCCCAGTTG
CCCGACGAGA ACCGGCTGGC CACAGTACAT CTGTCGCTGT CGGATTCGAC CGGCGACAGC
GCCATCATCG AATATATCGA TGGCCAGCAG GTGATCCATC ACGGCCGGCA ATATCAGGTG
ATGACCAATT CGCCGACCTT CGATCAGCAG CTCGCGCTCA ACGCCTATTG GAAGCAGATC
GGCGGCACCG TGATGCTACC CGGCACCAAC CGCGCGGCCG ACCGCTTCGC CCGCGCGTCG
TTCTACGTCG ACGCGATCCC GAAGGCGGAA AACCCGGTCG AAGCGATCGC CAGCGTGTTC
GGCGTGATCC GCAACGCCTC GGTGCCTTAC GGCATCACCA CGCCCGATCA GCCGAACATC
TCCTCGACGC GCTGGCGCAC CGTCGTCGAC CACAAGCGCA AGCTCTACTT CTTCGAATCC
GCGCTGACCC CGAACGTGTT CTGGGTCGAC CTCACCCGGA TCGATTTTTC CGCCGACAAA
GGCGCGGTCA AGAAGCTCGA CCTCGGCGCC AACCAGACCA ACACCTTTTC GGGCGTGGTC
AATGATCAGT TCAAGGTCAG TCCGCCGTTC AAATTTCTCG GGCTGTGA
 
Protein sequence
MIPFRRRFVT ASAAALLIGG ALLPPVAQAC TRLVYLGADN QVITARSMDW SRDIGTNLWI 
LPRGIKRSGE AGPNSLQWTA RYGSVIASAY DIATSDGVNE AGLVANVLWL AESTYPKLDG
SKPGLALSLW PQYVLDSFGT VQEAVEALAK QPFTVVTAQL PDENRLATVH LSLSDSTGDS
AIIEYIDGQQ VIHHGRQYQV MTNSPTFDQQ LALNAYWKQI GGTVMLPGTN RAADRFARAS
FYVDAIPKAE NPVEAIASVF GVIRNASVPY GITTPDQPNI SSTRWRTVVD HKRKLYFFES
ALTPNVFWVD LTRIDFSADK GAVKKLDLGA NQTNTFSGVV NDQFKVSPPF KFLGL