Gene RPB_3389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3389 
SymbolmoaA 
ID3911191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3873127 
End bp3874164 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content68% 
IMG OID637885292 
Productmolybdenum cofactor biosynthesis protein A 
Protein accessionYP_486996 
Protein GI86750500 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2896] Molybdenum cofactor biosynthesis enzyme 
TIGRFAM ID[TIGR02666] molybdenum cofactor biosynthesis protein A, bacterial 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.142655 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGATG CTGACGTGAC GCCCCCGCCG ACCGCCGGCG CTTCGACGAT GGTCGATCCG 
TTCGGCCGGA CCATCGACTA TCTGCGGGTC TCGATCACCG ACCGCTGCGA TTTTCGCTGC
GCGTACTGCA TGTCCGAGGA CATGACCTTC CTGCCCCGCG CCGATCTGCT GACGCTGGAG
GAACTCGACC GGCTGTGTTC GGCCTTCATC GTCCGGGGCG TGCGCAAGCT CAGGCTCACC
GGCGGCGAGC CGCTGGTCCG GCGCAACATG ATGTCGCTGG TGCGGTCGCT CTCCCGCCAT
CTCGACACCG GCGCGCTGCG CGAACTCACC CTCACCACCA ACGGATCACA GCTCGCCCGC
TTCGCCGCCG AACTACGCGA CTGCGGGGTT CGGCGCATCA ACGTCTCGCT CGACACGCTT
GATCCGGCGA AGTTCCGCGC GATCACCCGC TGGGGCGAAT TCGACCGGGT GATCGCCGGC
ATCGAGGCCG CGCGCGCGGC CGGCCTCGCC GTCAAGATCA ACGCGGTGGT GCTGAAGGGC
GTCAACGAGG ACGAGATCCC CGCCTTGATG CAATGGGCGC ACGGGCTCGG CATGGGGCTG
ACGCTGATCG AGGTGATGCC GCTGGGCGAG ATCGGCGAAG GCCGGATCGA CCAGTACGTG
CCGCTGTCGC TGGTTCGCGC CCGGCTGTCG AACAACTACA CGCTGACGGA TCTGCCCGAC
AGCACCGGCG GCCCGGCCCG CTACGTCCGG GTCGAGGAGA CCGGCGGCAA GCTCGGATTC
ATCACCCCCC TCACCCACAA TTTCTGCGAA TCCTGCAACC GGGTTCGGAT CACCTGCACC
GGCACGCTGC ACACCTGCCT CGGCCAGGAA GACGCCGCCG ATCTGCGCCG GCCGCTGCGC
GCCTCGCCCG ACGATGCGCT GCTCAACGCC GCGATCGATC GCGCCATCGG CCACAAGCCC
AAGGGCCACG ACTTCATCAT CGACCGCAAA CACGACCGGC CGAGCGTCAG CCGGCATATG
AGCGTGACCG GGGGCTGA
 
Protein sequence
MIDADVTPPP TAGASTMVDP FGRTIDYLRV SITDRCDFRC AYCMSEDMTF LPRADLLTLE 
ELDRLCSAFI VRGVRKLRLT GGEPLVRRNM MSLVRSLSRH LDTGALRELT LTTNGSQLAR
FAAELRDCGV RRINVSLDTL DPAKFRAITR WGEFDRVIAG IEAARAAGLA VKINAVVLKG
VNEDEIPALM QWAHGLGMGL TLIEVMPLGE IGEGRIDQYV PLSLVRARLS NNYTLTDLPD
STGGPARYVR VEETGGKLGF ITPLTHNFCE SCNRVRITCT GTLHTCLGQE DAADLRRPLR
ASPDDALLNA AIDRAIGHKP KGHDFIIDRK HDRPSVSRHM SVTGG