Gene RPD_3851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3851 
Symbol 
ID4024367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4289118 
End bp4290629 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content68% 
IMG OID637964055 
ProductNADH-ubiquinone oxidoreductase, chain 49kDa 
Protein accessionYP_570973 
Protein GI91978314 
COG category[C] Energy production and conversion 
COG ID[COG3261] Ni,Fe-hydrogenase III large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.425582 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTCGC TGATCGATCT CAATCTCGAA GGCCTGCAGG TGGCGCGGCA TCGGCCGTGG 
TCCCGCACGG TTGTCGACAG CGCGGTGTGG AGCTTCGCGG CCAAGATGCT GAGCGAGCAG
CAGTGGCAGT TGCTCTCGCT GTGGGGGGAG CCGGCGATCG TGCACATGGC GCTGTTCGAT
CCTCGCACCG CCGAGATCGG CGTCGTCAGC CTCGACTGCC CGCATCGCAG TTTCCCCTCG
GTCGCAGCCA AGCATCTGCC GGCGTTGCGG CTCGAGCGCA CCATTCGCGA TCTCTACGGG
CTCGTCCCTG AGGGCGCGAT TGACGACCGG CCCTGGCTGG ATCACGGCCA ATGGGGCGTG
CACGCGCCGC TCGGTGAGCC TGTCGAATTC CTGGCGCCGT CGCCGTCCTA TCGCTTCCTG
CCAGTCGAAG GCGAGGGCGC GCATCAGGTC GCGGTCGGCC CGGTCCATGC CGGCATCATC
GAGCCCGGCC ATTTCCGGTT CACCGTCTCC GGCGAGACCG TGGTGCGGCT GGAGCAGCGG
CTCGGCTATG TGCACAAGGG CATCGACGGG CTGCTCGCCG GCGCGGAATT GTCGCGCGCC
GCGCGGCTCG CCGGCCGCGT CTCCGGCGAC AGCACGGTGG CCTATGCGCT GGCGTTTGCG
CGCGCCGTCG AGGCCGCGGC CGAGATCGCG GCGCCGGCGC GCGCGGTGTG GCTACGCGCG
CTGGTCGCCG AGCTCGAACG CCTCGCCAAT CATCTCGGCG ACATCGGCGC GATCTGCAAC
GACGCGGCAT TCGCGCTCAT GCTGGCGCAG TTCGGCGTGC TGCGCGAAAA CGTGCTGCGC
GCCGCGGACG CCGCCTTCGG CCATCGTCTG ATGCGCGACG TGGTGACGCC GGGCGGCGTG
ACGCGCGATC TCGACGCGAC CGGAACGGCA GCGATCCGCA AGGCGCTGGC CGCTGTGCAC
AACAAGCTGC CGGAACTGAT CGAACTTTAC GACCGCACGG CGTCGCTGCA GGACCGCACC
GTCGGGACCG GGATTCTCAA GGCGTCGCTG GCCGATCAAT ATGCCGCCGG CGGCTTTGTC
GGCCGCGCCT CCGGTCGCGC CTTCGATGCA CGGCGCACGC TGGGCTACGC CCCTTATGAC
GAGTTGCGCT TCGAAATCCC GGTGCGGACC GAGGGCGACG TCGACGCGCG CATCTGGATT
CGCATCAGCG AGGTCGAGCA GAGCCTCGCG CTGGTCGACC AGATTCTCGA TCGCCTGCCG
GGCGGCGATA TTTGCGTCGC CCCGGCGCTG CCGGAGCAGG TCTGCGAGGG CCTGGCGCTG
GTCGAGGGTT TTCGCGGCGA CATTCTGGTG TGGCTGCGAT TGCGTGACGG AGTCGTCGAG
CGCTGCCACC TGCGCGATCC GTCCTGGTTT CAATGGCCGC TGCTGGAAGC CGTGATCGAG
GGCAATATCA TCGCCGACTT CCCGCTGTGC AACAAATCGT TCAACTGCTC TTATTCCGGC
GTGGATCTGT AG
 
Protein sequence
MPSLIDLNLE GLQVARHRPW SRTVVDSAVW SFAAKMLSEQ QWQLLSLWGE PAIVHMALFD 
PRTAEIGVVS LDCPHRSFPS VAAKHLPALR LERTIRDLYG LVPEGAIDDR PWLDHGQWGV
HAPLGEPVEF LAPSPSYRFL PVEGEGAHQV AVGPVHAGII EPGHFRFTVS GETVVRLEQR
LGYVHKGIDG LLAGAELSRA ARLAGRVSGD STVAYALAFA RAVEAAAEIA APARAVWLRA
LVAELERLAN HLGDIGAICN DAAFALMLAQ FGVLRENVLR AADAAFGHRL MRDVVTPGGV
TRDLDATGTA AIRKALAAVH NKLPELIELY DRTASLQDRT VGTGILKASL ADQYAAGGFV
GRASGRAFDA RRTLGYAPYD ELRFEIPVRT EGDVDARIWI RISEVEQSLA LVDQILDRLP
GGDICVAPAL PEQVCEGLAL VEGFRGDILV WLRLRDGVVE RCHLRDPSWF QWPLLEAVIE
GNIIADFPLC NKSFNCSYSG VDL