Gene RPD_3631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3631 
Symbol 
ID4024145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4052958 
End bp4054481 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content62% 
IMG OID637963835 
Producthypothetical protein 
Protein accessionYP_570755 
Protein GI91978096 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.662915 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGG GCTGGAGTGC GATCCACATA GACGAGCCTC TGCTGTCGTT CGGTCACGAT 
CAAAGTGCCG AGCATCCGAA GGACGGTCTC TTCCTGTTCG GGCCCGTCGC GTCCGGACAG
AACCCCGCCC GAATGGACGT GGGTGTTATC GCCACTCCCG CCGGTCTCGA AAAATATTCG
AAATGGGTGG CCTCGATCGA GAAGTTCATC GACGTCCCGC CACAGGATCC CGACCGCAAG
CGCAACGGTG CGAACATGTT CGTCTGGCCG GGCTTCGAGG CCGTGTATGG CGCCGCGTGG
CCGAGCAGGC CGTTCGCGAC CTGCATCATT GACGCCGCCG AACTTAGTCG GAGGATCCTG
GGCGCCGATC GACACCAGGC CATCTATTCC GCCGTGGCGC TCTACGAAGA GGCGCTGCGA
AAGTACCTGC GCGAAGAGGA AGCCAGACCG CAGCTCTGGT TCGCGGTCGT CTCGGACGAG
ATCTATAAAT ACGGTCGGCC GAAGTCGTCC GTGCCCACGA AGCTCCGCAC GCCCGGGACA
CGCAAGCTCG GCATGAAGAC TGCCCGTTCG ATCCTGAAGC AGGGTTCAAT GTTCGCGGAG
GAGATGCAGG CGGCGGCCGT CTACGAATAC GAACTCAACT TCCACAACCA GCTCAAAGCC
AGATTGATGG ACACCGGCCA GGTGATCCAG GTCGCGAAGG AGGCGACGCT CGATCCGAGC
GATGCCGACC AGGCGCGCAT GCAGGACCCA GCGACTGTCG CATGGAATCT GTCGACCACC
AGCTTCTACA AGATCGGCGG CAGGCCCTGG CGGCTCGCCG ATCTGCGGGA TGGGGTCTGC
TACGTCGGCC TGGTCTTCAA GCGCATCGAC AGACCGAGCG GAACCGACAA CGCATGCTGC
GGGGCGCAGA TGTTCCTCGG TTCCGGCGAC GGACTGGTTT TCCGAGGAGC AGTGGGGCCT
TGGTATTCAG AGACAAAGGA CACGTTCCAT CTCGATCGGG ATCAGGCGGC GAAGCTGATG
AAGATGATCG TCCAGGCCTA CCAGGACAAT CATGGCGTGC CCCCGACCGA GCTCTTCATC
CACGGCAAGA CAAACTTCGA CGCCAACGAG TGGGCGGGCT TTTCCAGTGC CGTCCCGACG
TCCACGAAGC TCGTCGGAGT GCAGATTCGT GACAACGCCG ATATCAAAGC ATTCCGTTAC
GGCGCCAATG CCGTGCTGCG GGGGACCGCC GTCGTGACAT CGGAAGTTTC TGGCTACGTC
TGGACGCGAG GATACATCCC GCGCCTGCGG ACTTATCCGG GACGCGAAGT CCCAAATCCC
TTGACCGTCG AGATCAGGCG CGGGTCGGCC GACATCGAAC AAGTCATGCG GGACGTGATG
TCGCTCACGA AGCTGAATTT CAACGGAGCC GAGTTCTGCG ACGGTTTGCC GGTGACGTTG
AGGTTCGCCG ACCTCGTCGG TGAAATTTTG ACGGCCGGGC CGATCGCGGA GCACCTGCCG
TTGCCATTCA AGTTCTACAT CTGA
 
Protein sequence
MTAGWSAIHI DEPLLSFGHD QSAEHPKDGL FLFGPVASGQ NPARMDVGVI ATPAGLEKYS 
KWVASIEKFI DVPPQDPDRK RNGANMFVWP GFEAVYGAAW PSRPFATCII DAAELSRRIL
GADRHQAIYS AVALYEEALR KYLREEEARP QLWFAVVSDE IYKYGRPKSS VPTKLRTPGT
RKLGMKTARS ILKQGSMFAE EMQAAAVYEY ELNFHNQLKA RLMDTGQVIQ VAKEATLDPS
DADQARMQDP ATVAWNLSTT SFYKIGGRPW RLADLRDGVC YVGLVFKRID RPSGTDNACC
GAQMFLGSGD GLVFRGAVGP WYSETKDTFH LDRDQAAKLM KMIVQAYQDN HGVPPTELFI
HGKTNFDANE WAGFSSAVPT STKLVGVQIR DNADIKAFRY GANAVLRGTA VVTSEVSGYV
WTRGYIPRLR TYPGREVPNP LTVEIRRGSA DIEQVMRDVM SLTKLNFNGA EFCDGLPVTL
RFADLVGEIL TAGPIAEHLP LPFKFYI