Gene RPD_2006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2006 
Symbol 
ID4022488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2243897 
End bp2245933 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content64% 
IMG OID637962199 
Productendothelin-converting protein 1 
Protein accessionYP_569142 
Protein GI91976483 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3590] Predicted metalloendopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCGCC TCGCTTTCAT GCTCCTCGTG ATCTCCCCGA TAGTGACCCT CACCGCGCCT 
GCGCGCGCGG GAGACAAGCC GGTGCTCGGG AACTGGGGGA TCGAGACCCA GACGATTTCC
GCGACGGTCC GGCCCGGTGA CGACTTCTAT CGCTACGTCA ACGAAGGCTG GCTCAAGACC
GCCGCTCCGC CGCCGGGAAT GGCGTATGCA AATTCCTTCG TGGATGCCTA TCAGCGCACG
CAGGGACAGC TTCAGAAGCT GATCGACGGC ATCCTCGTTT CGACGCCAGC CCCGGGCAGC
GACGAGGCCA AGATCGCAGC GCTGTACAGG AGCTATATCG ACGTGGCCGG GCGCAACCAG
CGCGGCCTGT CGCCGATCAA GAGCGATCTC GACTCGATCT GGGCGATCAG GACCCACGAG
GACGCCGCCC GGCTGCAGGG CCAGCCGTTC TTCAAGTCAC CGATCGATAT CGGTGTCGTC
ACCGACGACA AGAAGCCGGA ACGCTATGTG ATCGGCGCAA TGCAGGCCGG CCTCGGCCTG
CCCAGCCGTG AATACTACCT CACCGCCGGC GAGCCCTTCG ACGGCCATCG CGCCGCCTAT
CTGGCCTATG TCGCCGACAT CTTCAAACGC GTCGGCGTTG CCGATGGCGG CGATCGGGCG
AAGGCCATCC TCGCCTTCGA GACCCGTCTC GCCGAGGCGC AGTGGACCGC GGCCGAACAA
CGCGACCCGG TGAAGAGCTA TCGGCTGCTG TCGATCGCCG AGCTTCAGGC CTACGCGCCA
GCGTTCCCGT GGCAGATCTA TCTTGAGGCG GCGGGCTTCA GCCGGCCGAC CGAGCTGGTG
CTGACGACGG ACACGGCGAT CCAGAAGAGC GCCGAGATCT TCAAGGCGAC GGATATCGAA
ACCATCAAAT CCTATCTCGC CTTCCACCTC GTCGATGATT TTGCCCCCAA CCTGACCGAG
GAACTGGATC GGGCCAGCTT CGCCTTCAAC AGCACACGGC TTCACGGCGT CCCCGAGCAG
GAGGCGCTCG AGAAACGCGC CCAGACTTTC GTGACGTCGA CATTCGGCGA GATCTTCGGC
CGGGCCTATG CCAAGGCCTA TTTCCCGGAA AACTATCGCG CCAAGATGGA CCGGATGATC
ACCAACATCC GCGCAGCCTT CCACAAGCGG CTGGACGCCA ATCCCTGGAT GGACGAGGCG
ACGCGCAAGG CGGCGATCGT CAAGCTCGAG GCGATCGTCA AACATGTCGG CTATCCGGAC
CGCTGGCGCG ACTGGTCGTC GGTCGCGTTC GATCCGACCG ACCTGGTCGG CAACCGGCGC
AAGGCCGAGG CGTTCGCCCG GGCAGACGCC ATCGCCAAGC TCGGCGAGAA ACGCCGCGAA
TGGCAGTGGA GCTATCCGGC CACCGACATC AACGCCGGCT ACAGTCCGCA GATGAATTCG
ATTACCTTCC CCGCGGGCAT TCTGCAGCCG CCGTTCTTCG ATCCGAATGC CGACGACGCC
GTCAATTACG GTGCGATCGG CGCCGTGATC GGCCATGAGC TCGGCCACGC GTTCGACGAT
CAGGGCAGCC AGTCCGACGC GACCGGCGCG CTGCGCAACT GGTGGACGGA CGTATCACGC
GCGGAATTCA GCAAACGGAC CGCCGTGCTG GTGCAGCAGT TCAGCGGCTA CTCGCCGCTT
CCGGGCATGC GGGTGAACGG CGAACTGACG CTGGGGGAGA ACATCGGCGA TCTCGGCGGC
ATTACGATCG CCCATGAGGC GTATCGGATG CTCGTCGATC AGGAGCATGG CGGCAAGGCG
CCGGTGATCG ACGGCTTCAC CGGTGACCAG CGCTTCTTCC TGTCCTGGGC GCAGGTTTGG
CGGGATTTCA CGACTCCCGA CCAGGCCCGA CAGAACCTGC TCTCGGATGC ACACAGCCCG
AGCGAATTCC GCGTCAACGG CGCGCTACGC AACCTCGACG CGTGGTACGC AGCCTTCGGC
GTGAAGGATG GCGACAAACT CTATCTGCCG CAGGACAGTC GCGTCCGGAT CTGGTGA
 
Protein sequence
MRRLAFMLLV ISPIVTLTAP ARAGDKPVLG NWGIETQTIS ATVRPGDDFY RYVNEGWLKT 
AAPPPGMAYA NSFVDAYQRT QGQLQKLIDG ILVSTPAPGS DEAKIAALYR SYIDVAGRNQ
RGLSPIKSDL DSIWAIRTHE DAARLQGQPF FKSPIDIGVV TDDKKPERYV IGAMQAGLGL
PSREYYLTAG EPFDGHRAAY LAYVADIFKR VGVADGGDRA KAILAFETRL AEAQWTAAEQ
RDPVKSYRLL SIAELQAYAP AFPWQIYLEA AGFSRPTELV LTTDTAIQKS AEIFKATDIE
TIKSYLAFHL VDDFAPNLTE ELDRASFAFN STRLHGVPEQ EALEKRAQTF VTSTFGEIFG
RAYAKAYFPE NYRAKMDRMI TNIRAAFHKR LDANPWMDEA TRKAAIVKLE AIVKHVGYPD
RWRDWSSVAF DPTDLVGNRR KAEAFARADA IAKLGEKRRE WQWSYPATDI NAGYSPQMNS
ITFPAGILQP PFFDPNADDA VNYGAIGAVI GHELGHAFDD QGSQSDATGA LRNWWTDVSR
AEFSKRTAVL VQQFSGYSPL PGMRVNGELT LGENIGDLGG ITIAHEAYRM LVDQEHGGKA
PVIDGFTGDQ RFFLSWAQVW RDFTTPDQAR QNLLSDAHSP SEFRVNGALR NLDAWYAAFG
VKDGDKLYLP QDSRVRIW