Gene RPD_3945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3945 
Symbol 
ID4024461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4389999 
End bp4391720 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content59% 
IMG OID637964149 
Producthypothetical protein 
Protein accessionYP_571067 
Protein GI91978408 
COG category[S] Function unknown 
COG ID[COG1479] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAACG CGGACCAACA TCCCGTCGCC GAAGTGCTGG GCGACAAGTT CATCCATGAA 
ATTCCGCCAT ATCAGCGCCC GTACGCCTGG ACATCGGATC AGGCCCTCCA ACTGATCGAA
GATCTCAGGG AGGCGATGAG CTCAGGCGCC GACGAACCCT ATTTTCTCGG CAGCATAGTG
CTGATACGGC CGCGCGGAGA ACCCGTCGGT CAGGTGGTCG ACGGCCAGCA GCGTCTGACG
ACGCTGACGA TATTGGCCGC CGTGCTGCGC GACCTGGCAA CCGATCCCGA CGCGCGCGAG
GCAATCTCCG GCGCGGTCTA CATCAAACCT AATCCCTACA AAAAGCAGGT CGAGTCGGTC
AGGATTCTTC CTCATTCGGA GGACCGCATT TTCTTCCGCG AGGCCATCCA GTTTCCCGAC
GCGACGTCGA AATCGTCTCC GCCCCATCAG CCGAAGACCG AAGCCCAGAA GCTGATGTGG
GACAACGCCC TGGCGTTGCG TAAGCGCGTC GTTGAGATGA CGATCGAAGA CCGTCAGAGA
CTCGTCGATT ACCTCCTGAA CAATTGCGTT CTGGTGGTGG TTTCCACGGA GTCGCGTGGC
GCGGCGCTCA GGATCTTCAG GGTGCTGAAC GATCGCGGCC TGGATCTTTC GAATGCGGAT
GTCATCAAGG CCGATCTGCT CGGGAAATTT AAAGACCATA CGGAGATGGC GCATCAGGCC
GCCCGGTGGC GCGACTTCGA GACCGACCTC GGCCGCAATG ATTTCGAGGA TCTGCTGGAA
AATCTCCGGT TCATCCGTGA GAAAGGCAAG AACCGAAGTT CGCTAAGTGA AGCGTACGAA
TTACGCTTCA AGATGGCTAC ACCTCCGGAC GTCAGAAATT TTCTCGATCA CGAACTTGCG
CCGGCCAAGC GCTGGTTTGC CGAGATCGTG GATGGAGACG GAGCGGATTT TCCGACCATT
CTCCGAAGCG GGCTGTCGGA AGCATTGGCG GGTCTTCGCC TGGTTCCCAA CAAGGATTGG
ATGCCGGTCG CTCTCGCCGC CGCGATGCAG TTCGGCGCCA CGGAGAAGCT GCTCTCGACG
CTGGTCAAGC TTGAGGGGCT GGCTTGGATC ATGCAGTTGG GACGCCGCTA CGATACACAG
CGGATGAACC GCTACGGCGA AATCATCGGA GCTCTTGGCG GTCCGGATGC GGAGCTCGAA
AGCAAGCTTG TTCCCTCGGT CGAAGAGAAC GACGATGCTT GGTCAGCGCT GAGCGGAAAG
CTCTACAGCA AGTTTCCCGT GCGAGTCGTC CGTGCTGTTC TCGAACGTTT GGACAGATTG
CTATCCGAGC AAATCGTCGT CTGGGATGGG CAAAAGACCG TCGAACACAT CCTTCCGCAG
AATCCCGAGG CCGGGGAATG GGTTGGTTTC GATTCAGAAC GGCGGGAGGC GGTCACGGAT
ACACTCGGTA ACCTGGTTCT GCTGACTTCG CGTAAGAACT CGTCTGCCTC CAATCTGCCC
TTCGCAAAGA AACGCCTAGT CTATTTTGGA CTCGCGGAAA CGAGCGCTGG AAAGAAGAGA
GCGACGTATG CGAGCGCCCA AGAACTGGGC GAGCTCCGCG ACTGGGATGT CCCCGCATAT
CGAAGTCGAC AAGAACGTCA CCTTGCGTTG CTCGCCAAGC GATGGGGCAT CACGCTTCAG
CCCACCGTCC AGCCGCCCGC TGCGGACGCT CTTCGGTCTT GA
 
Protein sequence
MINADQHPVA EVLGDKFIHE IPPYQRPYAW TSDQALQLIE DLREAMSSGA DEPYFLGSIV 
LIRPRGEPVG QVVDGQQRLT TLTILAAVLR DLATDPDARE AISGAVYIKP NPYKKQVESV
RILPHSEDRI FFREAIQFPD ATSKSSPPHQ PKTEAQKLMW DNALALRKRV VEMTIEDRQR
LVDYLLNNCV LVVVSTESRG AALRIFRVLN DRGLDLSNAD VIKADLLGKF KDHTEMAHQA
ARWRDFETDL GRNDFEDLLE NLRFIREKGK NRSSLSEAYE LRFKMATPPD VRNFLDHELA
PAKRWFAEIV DGDGADFPTI LRSGLSEALA GLRLVPNKDW MPVALAAAMQ FGATEKLLST
LVKLEGLAWI MQLGRRYDTQ RMNRYGEIIG ALGGPDAELE SKLVPSVEEN DDAWSALSGK
LYSKFPVRVV RAVLERLDRL LSEQIVVWDG QKTVEHILPQ NPEAGEWVGF DSERREAVTD
TLGNLVLLTS RKNSSASNLP FAKKRLVYFG LAETSAGKKR ATYASAQELG ELRDWDVPAY
RSRQERHLAL LAKRWGITLQ PTVQPPAADA LRS