Gene RPC_3968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3968 
Symbol 
ID3969391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4420356 
End bp4421483 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content65% 
IMG OID637927072 
ProductNAD(P)(+) transhydrogenase (AB-specific) 
Protein accessionYP_533813 
Protein GI90425443 
COG category[C] Energy production and conversion 
COG ID[COG3288] NAD/NADP transhydrogenase alpha subunit 
TIGRFAM ID[TIGR00561] NAD(P) transhydrogenase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.52416 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.395533 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTG CTGTTGCTAA AGAACTAGAT CCTGCTGAAC CGAGGGTCGC GGCGACGCCC 
GACACGGTGA AGAAGTTCAA AGCGTTGGGG ATCGATATCG CGATCGAGCC CGGCGCCGGG
ATCAAGTCCG GACTGCCGGA TCAGGAATTC ACCGCGGTCG GCGCCACCGT CAGCGCCGAT
GCGCTGAAGG ACGCCGATAT CATCATCAAG GTGAAGCGCC CCGAGGCCTC TGAACTTGCG
AGCTACAAGC GCGGCGCGCT GGTGATCGCC ATCATGGACC CCTACGGCAA CGAAGCCGCG
CTGAAGACCA TCGCCGACGC CGGCGTCTCG GCCTTCGCGA TGGAGCTGAT GCCGCGCATC
ACCCGCGCGC AGGTGATGGA CGTGCTGTCG AGCCAGGCCA ATCTGGCCGG CTACCGCGCC
GTGATCGAGG CCGCCGAATC GTTCGGCCGC GCCTTTCCGA TGATGATGAC CGCGGCCGGC
ACGATTCCCG CCGCCAAGGT GTTCGTGATG GGTGTCGGCG TCGCCGGCCT GCAGGCGATC
GCCACCGCGC GCCGGCTCGG CGCCGTGGTC ACCGCCACCG ACGTGCGCCC CGCCACCAAG
GAGCAGGTCG AAAGTCTCGG CGCCAAATTC CTCGCCGTCG AAGACGAGGA ATTCAAGAAC
GCCCAGACCG CCGGCGGCTA CGCCAAGGAA ATGTCCAAAG AGTATCAGGC CAAGCAGGCC
GCGCTCACCG CCGAGCACAT CAAGAAGCAG GACATCATCA TCACCACCGC GTTGATCCCC
GGCCGGCCCG CGCCGCGCCT GGTCACCGCC GAGATGGTGG CGTCGATGAA GCCCGGTTCA
GTGCTGGTTG ACCTCGCGAT CGAGCGCGGC GGCAACGTCG AAGGCGCGGT GGCCGGTCAG
GTCACCGACG TCGGCGGCAT CAAGATCGTC GGCTACACCA ACGTCGCCGG CCGGGTCGCC
GCTTCGGCCT CGAGCCTGTA TTCCCGCAAC CTGTTCAACT TCATCGAGAC GCTGTTCGAC
AAGGCGTCGA AGTCGCTCGC GGTGAAGTGG GACGACGAGT TGGTGAAGGC CACCGCGCTG
ACCAAAGACG GCGCGGTGAT TCACCCGAAC TTCCAGCCGA AAGCTTAA
 
Protein sequence
MKIAVAKELD PAEPRVAATP DTVKKFKALG IDIAIEPGAG IKSGLPDQEF TAVGATVSAD 
ALKDADIIIK VKRPEASELA SYKRGALVIA IMDPYGNEAA LKTIADAGVS AFAMELMPRI
TRAQVMDVLS SQANLAGYRA VIEAAESFGR AFPMMMTAAG TIPAAKVFVM GVGVAGLQAI
ATARRLGAVV TATDVRPATK EQVESLGAKF LAVEDEEFKN AQTAGGYAKE MSKEYQAKQA
ALTAEHIKKQ DIIITTALIP GRPAPRLVTA EMVASMKPGS VLVDLAIERG GNVEGAVAGQ
VTDVGGIKIV GYTNVAGRVA ASASSLYSRN LFNFIETLFD KASKSLAVKW DDELVKATAL
TKDGAVIHPN FQPKA