Gene Rpal_3297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3297 
Symbol 
ID6410967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3547235 
End bp3548443 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content65% 
IMG OID642713173 
ProductNADH dehydrogenase subunit D 
Protein accessionYP_001992274 
Protein GI192291669 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGACG CTGCTGCTCC CGACGCTGCC AGCGTCCGCA ACTTCACCAT CAATTTCGGT 
CCGCAGCATC CGGCGGCGCA CGGCGTGCTC CGGCTGGTGC TGGAGCTCGA CGGCGAGGTG
GTCGAGCGGG TCGATCCGCA TATCGGCCTC CTGCATCGCG GCACCGAGAA GCTGATCGAG
CAGAAGACCT ATCTGCAGGC GATTCCGTAT TTCGATCGGC TCGATTACGT CGCGCCGATG
AACCAGGAAC ACGCCTTCTG CCTGGCTGTG GAAAAGCTGC TCGGGATCGC GGTGCCGCGG
CGCGCCCAAC TGATCCGCGT TCTGTACGCC GAGATCGGCC GCATCCTGTC GCATCTGCTG
AACGTCACCA CGCAGGCGAT GGACGTCGGC GCGCTGACTC CGCCGCTGTG GGGCTTCGAA
GAGCGCGAAA AGCTGATGAT GTTCTACGAG CGCGCCTCCG GCAGCCGGAT GCACGCCGCG
TATTTCCGCG TCGGCGGCGT GCATCAGGAT CTGCCGCCGA AGCTGGTCGA CGACATCGAC
GCCTGGTGTG ACGCATTCCC GGCGGTGGTG AACGATCTCG ACCGTCTGCT CAGCGACAAC
CGCATCTTCA AGCAGCGCAA CGTCGATATC GGCGTGGTGA CGCTCGATCA GGCCTGGTCC
TGGGGCTTCT CCGGCGTGAT GGTGCGCGGC TCCGGCGCGG CCTGGGACCT GCGCAAGTCG
CAGCCCTACG AATGCTACGC CGAGCTCGAT TTCGAAGTGC CGATCGGCAA GAACGGTGAC
TGCTACGACC GCTACCACAT CCGCATGGAA GAGATGCGGC AGTCGGTTCG GATCATGAAG
CAGTGCATTG CCAAGCTGCG GGCGCCGGAC GGGCAGGGGC CGGTTGTGGT CGACGACCAC
AAGATCTTCC CGCCGCGCCG CGGCGAGATG AAGCGCTCGA TGGAAGCGCT GATCCATCAC
TTCAAGCTGT ACACCGAGGG CTTCCACGTC CCGGCCGGCG AAGTCTATGT CGCGGTCGAG
GCGCCGAAGG GCGAGTTCGG CGTGTACCTG GTGTCCGACG GCAGCAACAA GCCTTACAAG
TGCAAGATCC GTGCGCCGGG CTTCGCCCAT CTGCAGGCGA TGGACTTCCT CAGCCGCGGC
CATCTGCTCG CCGACGTCTC GGCGATTCTC GGTTCGCTCG ACATCGTGTT CGGAGAGGTC
GATCGGTGA
 
Protein sequence
MADAAAPDAA SVRNFTINFG PQHPAAHGVL RLVLELDGEV VERVDPHIGL LHRGTEKLIE 
QKTYLQAIPY FDRLDYVAPM NQEHAFCLAV EKLLGIAVPR RAQLIRVLYA EIGRILSHLL
NVTTQAMDVG ALTPPLWGFE EREKLMMFYE RASGSRMHAA YFRVGGVHQD LPPKLVDDID
AWCDAFPAVV NDLDRLLSDN RIFKQRNVDI GVVTLDQAWS WGFSGVMVRG SGAAWDLRKS
QPYECYAELD FEVPIGKNGD CYDRYHIRME EMRQSVRIMK QCIAKLRAPD GQGPVVVDDH
KIFPPRRGEM KRSMEALIHH FKLYTEGFHV PAGEVYVAVE APKGEFGVYL VSDGSNKPYK
CKIRAPGFAH LQAMDFLSRG HLLADVSAIL GSLDIVFGEV DR