Gene RPC_0443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_0443 
Symbol 
ID3970205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp477481 
End bp478911 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content72% 
IMG OID637923559 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_530337 
Protein GI90421967 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1231] Monoamine oxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGCC GCGAGTTCCT GGCGGCCTCG GCGGCCTTGG CGCTGACGCC GGCGTTGCGC 
TGGCCGGCTT TCGCCGCACC GCTGCCGCGC GAGGCCGACG TGGTGGTGAT CGGCGCCGGC
GCCGCCGGCA TCGCGGCGGC CCGGCGAATC ATCGCCGCGG GCCGCAAGGT GATCGTGATC
GAGGCCGGCC GGCAGGTCGG CGGCCGCTGC CTGACCGATA CGACCAGCTT CGAGGTTCCG
TTCGATCGCG GCGCGCGCTG GCTGCACAAT CCGGAGACCA ACCCGCTGGT CAAGCTGGCG
CGCGGCGCAG GGCTCGACGT CGCGCCGGCG CCGCCCGGGC AGAAGCTGCG GATCGGCCGC
CGCAACGCTC GCCCCGGCGA GACCGAGGAT CTGCTGGCGA GTCTGGTGCG CGCCAACCGC
GCCATCGACG ACGCCGCGCG CAAGGCCGAT CTGTCCTGCG CGGCGGCGCT GCCGAAGGAT
CTCGGCGACT GGGCCGGGAC GGTGGAATTC TTGCTCGGGC CCTACGCCAC CGGCAAGGAT
CTCAAGGACC TGTCGGCGCT CGATCAGTTC CGCGCCCAGG ATCGCAACGC CATGGTGGCC
TGCCGCCAGG GACTCGGCAC CATGATCGGC AAGCTCGGCG AGGGTCTGCC CGTCGCGCTG
GCGACGCCGG CGACGCGAAT CACCTGGGGC GGCCGCGACG TCGGCGTCGA AACCCCAGCC
GGGCGGATCA CGGCGCGCGC CGTGATCGTC ACGGTGTCGA GCAACGTGCT GGCGTCCGGC
GCCATCAAGT TCGCCCCGGA GCTGCCGAAG CGGCAGCTCG ACGCCGCCGC CAAGCTGACG
CTCGGCAGCT ACGATCACGT CGCATTGCTG TTGTCCGGCA ATCCGCTGGG ATTGCCCAAG
GACGAGATCA TGATCGAGCA GTCGAGCGAC GCCCGCACTG CCTTCCTGGT CGCCAATATG
GCCGGCAGCT CGCTGTGCTC GGTCGACGTC GCGGGCGCGT TCGGCCGTGA CCTCGCGGCG
CAGGGCGAGG CGGCGATGAT CGATTTCGCC AAGGAATGGC TCGGCAAGCT GTTCGGCAGC
GACGCCGTCG CGGAGGTCCA AGGCGCCAGC GCGACGCGCT GGAATGCGCA GCCCTTGGTG
CAAGGCGCGA TGTCGGCGGC GCTGCCCGGC GGGCAGTTCG CAAGGCGGGT GCTGGCCGAG
CCGATCGGCA ACCTGTTTCT GGCCGGCGAA GCCAGCCATG AGACGCTGTG GGGCACCGTG
GACGGCGCCT GGGACTCAGG CGAACGCGCC GCCGACGCCG CGTTGAAGCG GATCGGCAGC
CTGAAGAGCG ACACGCCGGC GCAGTCTTCG CCGCGCAAGC CGAAGCAACG CCGCGAGCGG
GGCGTCTCCG GCGCGAGTGC CGGAGACCTC GGCTGGCCGG GGAGGCGGTA G
 
Protein sequence
MSRREFLAAS AALALTPALR WPAFAAPLPR EADVVVIGAG AAGIAAARRI IAAGRKVIVI 
EAGRQVGGRC LTDTTSFEVP FDRGARWLHN PETNPLVKLA RGAGLDVAPA PPGQKLRIGR
RNARPGETED LLASLVRANR AIDDAARKAD LSCAAALPKD LGDWAGTVEF LLGPYATGKD
LKDLSALDQF RAQDRNAMVA CRQGLGTMIG KLGEGLPVAL ATPATRITWG GRDVGVETPA
GRITARAVIV TVSSNVLASG AIKFAPELPK RQLDAAAKLT LGSYDHVALL LSGNPLGLPK
DEIMIEQSSD ARTAFLVANM AGSSLCSVDV AGAFGRDLAA QGEAAMIDFA KEWLGKLFGS
DAVAEVQGAS ATRWNAQPLV QGAMSAALPG GQFARRVLAE PIGNLFLAGE ASHETLWGTV
DGAWDSGERA ADAALKRIGS LKSDTPAQSS PRKPKQRRER GVSGASAGDL GWPGRR