Gene RPB_3419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3419 
Symbol 
ID3911221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3908164 
End bp3909408 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content67% 
IMG OID637885322 
Productpyrroloquinoline quinone biosynthesis protein PqqE 
Protein accessionYP_487026 
Protein GI86750530 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR02109] coenzyme PQQ biosynthesis protein E 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.058581 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.446493 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCG CGAGGAACGA GACGATTCAC GCTGCCGACG GCGTCGCCGT GCTGGAGTCG 
CGCGGCTCGG TCGCGGAGAA TTTCGGCATT CCGCTCGCGG TGCTGCTGGA GCTGACGCAT
CGCTGTCCGC TGCAATGTCC GTACTGCTCG AATCCCGTGG AGCTCGAGCG CGGCGGCGCC
GAACTCACGA CCGACGAGTG GAAGCGCGTG CTCGGCGAAC TCGCCGCCAT CGGCGTGCTG
CAGGTGCATT TTTCCGGCGG CGAGCCGACC GCGCGCAAGG ACCTCGTCGA GCTGGTGCGA
CACGCCAGCG ACGTCGGGCT GTACACCAAT CTGATCACCT CGGCGGTGCT GCTGACGCGC
GAGCGCCTCG CGGCGCTGGC CGATGCCGGG CTGTGCCATG TCCAGATCAG CTTTCAGGGC
TACGAACCTG TCGTTGCCGA TCGCGTCGCC GGATTCGCGA ACGGCCATGC GAAGAAGATC
GAAGCCGCCC GCTGGACCCG CGAACTCGAT CTGCCGCTCA CCGTCAATGC GGTGATGCAC
CGCCAGAACC TGCATCAATT GCCGGACATC ATCGACATGG CGGTGGCACT CGACGCCGAC
CGGCTCGAAG TCGCCAATGT GCAGTATTAC GGCTGGGCGC TGAAGAACCG CGCCGCGCTG
ATGCCGACGC TGCAGCAGAT CGATGACTGC ACCGCGATCG TGGAGGCCGC GCAGTCGCGG
CTGAAGGGGC AGCTCGCGAT CGACTACGTC GTGCCGGATT ACTACGCGCT GCGGCCGAAG
ACGTGCATGG GCGGCTGGGG CCGGCAGTTC TTCAACATCT CGCCGAGCGG CAAGGTGCTG
CCGTGCCACG CCGCCGAGAC CATCACCGGG CTCGCCTTCG ACTCGGTGCG TGGAGGCGCG
TCGATCGCCG AGATCTGGCG CAATTCCGAG GCGCTGAACC GCTATCGCGG CACCTCGTGG
ATGCAGCAGC CCTGTGCGAG CTGCGCCTTC AAGGAGATCG ATTTCGGCGG CTGCCGCTGC
CAGGCCTTCG CGCTCGCCGG CGACGCCGCC GCGACCGATC CGGCCTGTGC ACTGTCGCCG
CTGCACAAGC GGATCTTCAA GACTGCGGAA GCAGAGGCGG AAGCCGGGGG CGACAAATTC
GTGTATCGCA ATTTCGCCGG CGGCACGGCG GAGGGTCGTA GCACCACCTC ACCACGTCAT
TCCGGGGCGC GCACAGCGCG AACCCGGAAT CTCGCGGTGC CATGA
 
Protein sequence
MSAARNETIH AADGVAVLES RGSVAENFGI PLAVLLELTH RCPLQCPYCS NPVELERGGA 
ELTTDEWKRV LGELAAIGVL QVHFSGGEPT ARKDLVELVR HASDVGLYTN LITSAVLLTR
ERLAALADAG LCHVQISFQG YEPVVADRVA GFANGHAKKI EAARWTRELD LPLTVNAVMH
RQNLHQLPDI IDMAVALDAD RLEVANVQYY GWALKNRAAL MPTLQQIDDC TAIVEAAQSR
LKGQLAIDYV VPDYYALRPK TCMGGWGRQF FNISPSGKVL PCHAAETITG LAFDSVRGGA
SIAEIWRNSE ALNRYRGTSW MQQPCASCAF KEIDFGGCRC QAFALAGDAA ATDPACALSP
LHKRIFKTAE AEAEAGGDKF VYRNFAGGTA EGRSTTSPRH SGARTARTRN LAVP