Gene RPC_1964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1964 
Symbol 
ID3973637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2136672 
End bp2138192 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content64% 
IMG OID637925075 
Productaldehyde dehydrogenase 
Protein accessionYP_531840 
Protein GI90423470 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTACG CCGCTCCCGG CACCGCCGGA GCCCCAGTCG ATTTCAAGTC GCGCTACGAC 
AATTTCATCG GCGGCCGCTG GTCGGCGCCG GTGAACGGCC GCTATTTCGA CAGCGTCACC
CCGATCACCG GGCAAGCCTT TACCCAGGCT GCACGTTCGG ATGAAGTTGA TATCACGCTC
GCGCTCGACG CCGCTCACGC CGCCGCCGAT GCCTGGGGCC GCACCAGCGT CGCCGAACGC
GCGCTGGTGC TGAACCGCAT CGCCGACCGC ATGGAAGAGA ATCTCGAACG GCTCGCTTAT
GCGGAGTCCG TCGACAACGG CAAGCCGATC CGCGAGACGC TGGCCGCCGA CATTCCGTTG
GCGATCGATC ATTTCCGTTA CTTCGCCTCG TGCGTCCGCT CGCAAGAAGG CACGCTGGCG
CAGCTCGACG AACACACCGT CGCCTATCAC TTCCACGAGC CGCTCGGCGT GGTCGGCCAG
ATCATTCCGT GGAATTTCTC GATCCTGATG GCGGCGTGGA AATTGGCGCC GGCGTTGGCC
TCCGGCAACT GCATCGTGCT GAAGCCCGCC GAGCAGACTC CGATCAGCAT CCTGGTGCTG
GTGGAGCTGA TCGCCGATCT GCTGCCGCCG GGCGTGCTCA ACGTGGTCAA CGGCTTCGGC
CTGGAGGCGG GCAAGCCGCT GGCGTCTTCG AACCGCATCT CCAAGATCGC TTTCACCGGC
GAGACCAGCA CCGGCCGGCT GATCATGCAA TACGCCAGCG CCAATCTGAT CCCGGTGTCG
CTCGAGCTGG GCGGCAAGTC GCCGAACATC TTCTTCGACG ACGTCGCCGC TTCGGACGAC
GCTTACTTCG ACAAGGCGAT CGAGGGCTTC GTGATGTTCG CGCTCAACCA GGGCGAGGTC
TGCACCTGTC CGTCGCGCGC GCTGATTCAG GAGTCGCTGT ACGACCGCTT CATCGATCGG
GCGCTGGCGC GGGTGACGGC GATCCGTCAG GGCAATCCGC TCGACACCGA GACCATGATC
GGAGCTCAAG CCTCCTCCGA GCAGATGGAG AAGATCCTGT CTTACTTCAC CATCGGCCGC
GACGAGGGCG CCAAGGTGCT GACCGGCGGC GCGCGCGCCG AGCTCGGCGG CGATCTCGCC
GAGGGCTACT ACGTCCAGCC GACCGTGCTG AAGGGGCACA ACCGGATGCG GGTGTTCCAG
GAAGAGATCT TCGGGCCGGT CGTCGCGGTC ACCACCTTCA AGGATGAGGA CGAGGCGCTG
CATCTGGCCA ACGACACCCA TTATGGCCTC GGTGCCGGCG TCTGGACCCG CGATGGCAAC
CGGGCCTACC GCTTCGGCCG CGGCATCAAA GCGGGCCGGG TGTGGACCAA CTGCTACCAC
CTCTATCCGG CGCATGCGGC GTTCGGCGGC TACAAGCAAT CCGGGATCGG CCGTGAAAAC
CATCACATGA TGCTAGACCA TTATCAGCAG ACCAAGAACC TGCTGGTCAG CTACAGCCCC
GACGCGCTGG GCTTCTTCTA A
 
Protein sequence
MKYAAPGTAG APVDFKSRYD NFIGGRWSAP VNGRYFDSVT PITGQAFTQA ARSDEVDITL 
ALDAAHAAAD AWGRTSVAER ALVLNRIADR MEENLERLAY AESVDNGKPI RETLAADIPL
AIDHFRYFAS CVRSQEGTLA QLDEHTVAYH FHEPLGVVGQ IIPWNFSILM AAWKLAPALA
SGNCIVLKPA EQTPISILVL VELIADLLPP GVLNVVNGFG LEAGKPLASS NRISKIAFTG
ETSTGRLIMQ YASANLIPVS LELGGKSPNI FFDDVAASDD AYFDKAIEGF VMFALNQGEV
CTCPSRALIQ ESLYDRFIDR ALARVTAIRQ GNPLDTETMI GAQASSEQME KILSYFTIGR
DEGAKVLTGG ARAELGGDLA EGYYVQPTVL KGHNRMRVFQ EEIFGPVVAV TTFKDEDEAL
HLANDTHYGL GAGVWTRDGN RAYRFGRGIK AGRVWTNCYH LYPAHAAFGG YKQSGIGREN
HHMMLDHYQQ TKNLLVSYSP DALGFF