Gene RPD_2052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2052 
SymbolmoaA 
ID4022534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2301181 
End bp2302215 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content64% 
IMG OID637962245 
Productmolybdenum cofactor biosynthesis protein A 
Protein accessionYP_569188 
Protein GI91976529 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2896] Molybdenum cofactor biosynthesis enzyme 
TIGRFAM ID[TIGR02666] molybdenum cofactor biosynthesis protein A, bacterial 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.128137 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGTG CAGTGATGAC TCCACCGACC GTCGGCGCTT CGGCGATGAC CGATCCGTTC 
GGCCGGACGA TCAGCTATCT GCGGGTGTCC ATCACCGACC GCTGCGACTT TCGCTGTGTC
TACTGCATGT CGGAAGACAT GACCTTCCTG CCCCGCGCCG ATCTTCTGAC GCTGGAGGAA
CTCGACCGGC TCTGCTCGGC CTTCATCGCC CGCGGCGTCC GCAAGCTTCG GCTGACGGGG
GGGGAGCCAC TGGTCCGGCG CAACATGATG TCACTGGTGC GCTCACTGTC GCGCCATCTC
GGCACCGGCG CGCTCGACGA ACTCACCCTC ACCACCAACG GCTCGCAGCT CGCCCGATTC
GCCGAAGAAC TAAGCGACTG CGGCGTCCGC CGCATCAACG TCTCGCTCGA TACGCTCGAT
CCCGGAAAAT TCCGCGCGAT CACCCGCTGG GGCGACCTCG ACCGGGTGTT GGCCGGAATC
GAGGCGGCGC GCGCCGCCGG CCTCGCCGTC AAGATCAACG CCGTGGCGCT GAAGAACATC
AATGAGGACG AGATTCCGTC ACTCATGCAA TGGGCCCACG GCCTCGGTAT GGGACTGACG
CTGATCGAGG TGATGCCGCT CGGCGAGATC GGCGAAGGCC GGATCGATCA ATATGTTCCG
CTGTCGCTGG TTCGCGCGAG GCTTTCGAAC AACTACACCT TGACTGATTT GCCAGATAGC
ACCGGCGGCC CAGCCCGCTA CGTCCGGGTC GATGAAACCG GCGGCAAGCT CGGCTTTATC
ACGCCCCTCA CCCATAATTT CTGCGAATCA TGCAACCGGG TGCGGATCAC CTGCACCGGG
ACCCTACACA CCTGCCTCGG ACAGGAGGAT GCGTCCGACC TGCGCCGGCC GCTCCGCGCA
TCGCCGGACG ACGATCTGCT CAACGCCGCG ATTGATCGTG CGATCGGCCA CAAGCCGAAG
GGGCACGACT TCATCATCGA CCGCAAGCAC AACCGGCCCA GCATTGGCCG TCATATGAGC
GTCACCGGCG GCTGA
 
Protein sequence
MSSAVMTPPT VGASAMTDPF GRTISYLRVS ITDRCDFRCV YCMSEDMTFL PRADLLTLEE 
LDRLCSAFIA RGVRKLRLTG GEPLVRRNMM SLVRSLSRHL GTGALDELTL TTNGSQLARF
AEELSDCGVR RINVSLDTLD PGKFRAITRW GDLDRVLAGI EAARAAGLAV KINAVALKNI
NEDEIPSLMQ WAHGLGMGLT LIEVMPLGEI GEGRIDQYVP LSLVRARLSN NYTLTDLPDS
TGGPARYVRV DETGGKLGFI TPLTHNFCES CNRVRITCTG TLHTCLGQED ASDLRRPLRA
SPDDDLLNAA IDRAIGHKPK GHDFIIDRKH NRPSIGRHMS VTGG