Gene RPC_4601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4601 
Symbol 
ID3972092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5134291 
End bp5135508 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content71% 
IMG OID637927712 
Productlate embryogenesis abundant protein 
Protein accessionYP_534442 
Protein GI90426072 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCAAG CCGATTCTCG ATCCTTGGAA GAGATACGTC GCGACGCCGA ACTCGCGCGG 
GCGGGCCTGA CTGCGACCGT CGATCAGCTG AAATCCACGG TGACGGACAC CGCGCAGGAT
TTCCGCGAAC GCTACTCGCC GGACGCCATC AAGGCCGAGG TCTCCGGCTA CATCAAGAGC
CGCGGCGAAG CCATGATCGA CAGCGTGACC GACGCGATCC GCAATAACCC GCTGCAGGCG
CTGGCGATCG GCGCCAGCAT CGCGGTGCCG CTGCTGCGCG TGGTGAGGAC GATTCCCGCG
CCGGTGCTGA TGGTCGGCGC CGGGCTTTAT CTGGCCGGGA CCAAGCGCGG CCAGGATGTG
GCGCGTCAGG CCAATGACGC CGCCATGGAG CTGGCCGGCG AAGTCGGTCG CCGCGCCCGC
GACATCGGCG CCGAGGTCGG CGAGGCTGCG GCCGCCACCC GCGATTACGC CGCCGATCGC
TACGCCGCCG CCAGCGAGGC GGTTGCGGTC GGCACCGAAC AGCTCAAGGG CAAGGCCGCG
GAACTCGGTG CGACGATCTC GTCCACCGTC GACGGGCTGC GTCACCAGGC CAACGACGCC
GGCGACCGGA TTTCCGACGA GGTGTCGGAG TTGTCGGAGC GCGGCTCGCG CAGCGCCGCC
GAGGCGGTCG ACTCGGTCCG CGACTCCGCC TCGACGGTGC GGCAGGCCGC CGCGTCGATG
CGCGAAACCG CCGCCGAAGC CGCGGCGCGT TTGCGGCAGA CGGCCTCGGC TTCGGTCGAC
GCCGGTCGCG ACGCCGCAGC CATTGCACGG GATCGCGCCG CGGATCTGGC GCATCGCGCG
GCCCGAGCCG GCGATCGGGC CGGGCGCACG CTGATGGACA CGGCGACGCA GAACCCGCTT
CTGGTCGCCG GCATCGGTCT GGTGCTGGGT GGACTGATTG CCAGTGCGTT ACCGCGCTCG
CGGATCGAAG ATCGGCTGGT CGGCGGCACC GCCCGCGGCC TCAAGGAGCG GGCGCGCGAT
GTCGCGGCGC AGAGCGTCGA AGGCGTCAAG GAGGCGGTGA GCGGCGCTTA TCAGGAAGTC
AGCCGCGCTG CCGAGCAGGA AGGGCTGACT CCGGACGGTG TTGCCGGGGC CGCCGGCGAT
CTCGGGCAGC GGGCCCGCAA GGTGGCGGAG GCCGCGACCG GTTCGTTCGA CCGACCATCG
TCGCACAACA AGCATTGA
 
Protein sequence
MAQADSRSLE EIRRDAELAR AGLTATVDQL KSTVTDTAQD FRERYSPDAI KAEVSGYIKS 
RGEAMIDSVT DAIRNNPLQA LAIGASIAVP LLRVVRTIPA PVLMVGAGLY LAGTKRGQDV
ARQANDAAME LAGEVGRRAR DIGAEVGEAA AATRDYAADR YAAASEAVAV GTEQLKGKAA
ELGATISSTV DGLRHQANDA GDRISDEVSE LSERGSRSAA EAVDSVRDSA STVRQAAASM
RETAAEAAAR LRQTASASVD AGRDAAAIAR DRAADLAHRA ARAGDRAGRT LMDTATQNPL
LVAGIGLVLG GLIASALPRS RIEDRLVGGT ARGLKERARD VAAQSVEGVK EAVSGAYQEV
SRAAEQEGLT PDGVAGAAGD LGQRARKVAE AATGSFDRPS SHNKH