Gene RPD_2024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2024 
Symbol 
ID4022506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2270234 
End bp2271766 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content65% 
IMG OID637962217 
Producthypothetical protein 
Protein accessionYP_569160 
Protein GI91976501 
COG category[S] Function unknown 
COG ID[COG3333] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.482328 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.302719 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGAAC TCGCCAACCT GTTTCACGGC TTCGCCGTCG CGCTGATGCC GTTCAACATC 
CTGGTGATGG TCATCGGCAT CGTGCTCGGC GTCATCATCG GCGTGCTGCC GGGCCTCGGC
GGCGCCAACG GCGTCGCAAT CCTGCTGCCG CTCACCTTCA GCATGCCGCC GACCTCGGCG
ATCATCATGC TGTCGTGCAT CTATTGGGGC GCGCTGTTCG GCGGCGCGAT CACCTCCATC
CTGTTCAATA TCCCCGGCGA GCCATGGTCG GTCGCGACCA CATTCGACGG CTATCCGATG
GCGCAGCAGG GCAAGGCCGG CGAGGCTCTT ACCGCCGCCT TCACCTCGTC CTTCGTCGGC
GCGCTGTTCG CGATCGTCAT GATCACGCTG GTGGCCCCGC TGGTCGCGCG CTTCGCGCTG
GAGTTCGGCC CGCCGGAAAA ATTCGCGGTG TATTTCCTCG CCTTCTGCAG TTTCGTCGGA
CTATCGAAAG AACCACCCTT CAAGACCGTC GCGGCGATGA TGCTCGGCTT CGCTCTCGCC
GCCGTCGGGC TCGATTCGAT GACCGGGCAG CTCCGGCTGA CCTTCGGCTT CACCGAGATG
CTGAACGGCT TCGACTTCCT GATCGCGGTG ATTGGCCTGT TCGGCATCGG CGAAATCCTG
CTGACGATGG AAGACGGGCT CAGCTTCCGC GGCAGCAAGG CCAAGATCAA TCTGCGCGTC
GTGCTGCAGA CCTGGAAGGA GCTGCCGCGC TACTGGATGA CATCGCTGCG CTCCAGCGTG
ATCGGCTGCT GGATGGGCAT CACGCCGGCC GGCGCCACGC CGGCGTCGTT CATGAGCTAC
GGCATCGCCA AGCGGATGTC GAAGAACGGC CAGAATTTCG GTCGTGGCGA GATCGAAGGG
GTGATTGCGC CGGAGACCGC GGCGCACGCC GCAGGCACAG CCGCTCTGCT GCCGATGCTG
TCGCTCGGCG TGCCCGGTTC GCCGACCGCC GCGGTGCTGC TCGGCGGCCT CTTGATCTGG
GGCCTGCAGC CGGGGCCGAT GCTGTTCGTC GAGCAGAAGG AGTTCGTCTG GGGCCTGATC
GCCTCGATGT ATCTCGGCAA CGTCGTCGGC CTGCTGATCG TGCTGACCTG CGTGCCGGTG
TTCGCAGCGA TCCTGCGGAT TCCGTTCAGC ATCGTTGCGC CGCTGATCCT GGTGCTCTGC
GCGATCGGCG CCTATTCGGT GCACAACTCC ACCTTCGACG TGATGCTGAT GCTGGTATTC
GGGGTGATCG GCTACCTGCT GAAGAAATGC AACTATCCAC TTGCGCCGCT GGTGCTCGCG
ATCGTACTCG GCGACAAGGC GGAGGAAGCG TTCCGGCAAT CACTGCTCGC CTCGCAGGGC
GCGCTCGGCG TGTTCTTCTC GAACGGGCTG GTCGGCACGA TCATGGCGCT CGGGCTGATC
GCGCTGTTCT GGTCGCTGAT CAACGAGGGC TACGTCCGGC TGCGTTCCGC GGCGACCGGA
CGGCCGCGAG CGGCTGGCCC GGATTACGAA TAA
 
Protein sequence
MEELANLFHG FAVALMPFNI LVMVIGIVLG VIIGVLPGLG GANGVAILLP LTFSMPPTSA 
IIMLSCIYWG ALFGGAITSI LFNIPGEPWS VATTFDGYPM AQQGKAGEAL TAAFTSSFVG
ALFAIVMITL VAPLVARFAL EFGPPEKFAV YFLAFCSFVG LSKEPPFKTV AAMMLGFALA
AVGLDSMTGQ LRLTFGFTEM LNGFDFLIAV IGLFGIGEIL LTMEDGLSFR GSKAKINLRV
VLQTWKELPR YWMTSLRSSV IGCWMGITPA GATPASFMSY GIAKRMSKNG QNFGRGEIEG
VIAPETAAHA AGTAALLPML SLGVPGSPTA AVLLGGLLIW GLQPGPMLFV EQKEFVWGLI
ASMYLGNVVG LLIVLTCVPV FAAILRIPFS IVAPLILVLC AIGAYSVHNS TFDVMLMLVF
GVIGYLLKKC NYPLAPLVLA IVLGDKAEEA FRQSLLASQG ALGVFFSNGL VGTIMALGLI
ALFWSLINEG YVRLRSAATG RPRAAGPDYE