Gene RPC_1965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1965 
Symbol 
ID3973638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2138308 
End bp2139459 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content66% 
IMG OID637925076 
Productiron-containing alcohol dehydrogenase 
Protein accessionYP_531841 
Protein GI90423471 
COG category[C] Energy production and conversion 
COG ID[COG1454] Alcohol dehydrogenase, class IV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGA CCACCTTTTT CATTCCCTCG CTGAATATGA TGGGCGCCGG CTGCCTCACC 
GGCGCCGTCT CCAGCGTGAG CCGCTACGGC TTCAAGCGGG CGCTGATCGT CACCGACCAG
ATGCTGTTCA AGCTCGGCAT GGCCGACAAG CTCGCCTCAC AGCTCGAGCA GCAGGGCATC
GCCGCCTCGA TCTTTCCGGG CGCGCAACCC AACCCCACCG TCGGCAATGT CGAGAGCGGT
CTGGCTCAAC TCCAGGCCGA CGGCTGCGAC TGCGTGATCT CGCTCGGTGG CGGGTCGTCG
CACGACTGCG CCAAGGGCAT CGCGCTCACC GCCACCAACG GCGGCAACAT CCGCGATTAC
GAAGGCGTCG ACCGCTCGGC CAAACCGCAG CTGCCGCTGA TCGCCATCAA CACCACCGCC
GGCACCGCCA GCGAGATGAC GCGGTTCTGT ATCATCACGG ACGAGAAGCG GCTGGTGAAG
ATGGCGATCG TCGATCGCAA CGTCACGCCA CTTCTCTCGG TCAACGACCC CGAACTGATG
TTGGGCAAGC CGCAGATGCT GACCGCGGCC ACCGGCATGG ACGCACTGAC TCATGCGATC
GAGGCTTATG TCTCGGTCGC GGCGACGCCG ATCACCGACG CCTGCGCGCT GAAGGCGATC
GCGATCATCG CCAACAACCT GCGTACCGCG GTCGCCGAAG GCAGCAATCT GGCGGCACGC
GAAGCCATGG CCTATGCCGG CTTCATGGCC GGCATGGCCT TCAACAATGC CTCGCTCGGC
TACGTTCACG CCATGGCGCA TCAATTGGGC GGCTTCTACG ACCTGCCGCA CGGGGTCTGC
AATGCGGTGC TGCTGCCGCA TGTCGAAGCC TTCAATGCGG AGGTCTCGGC CGCCCGCTTG
CGCGACGTCG CCCACGCGCT CGGCGTGGAC ACCAACGGCA TGAGCGACCA GCAGGGCGCG
GAGGCTTGCA TCCATGCGAT CCAGCGGCTG TCGACCGACA TCGGCATTCC GACCGGGCTC
GCGCAGCTCG GGGTCAAGGA AGAAGACATT CCGACCCTGG CGGCCAACGC GTTGAAAGAC
GCCTGCGGCC TGACCAATCC GCGCCGCGCC AGCCAGGCCG ACATCGAAGC CATCTTCCGC
GTCGCCGCGT GA
 
Protein sequence
MTETTFFIPS LNMMGAGCLT GAVSSVSRYG FKRALIVTDQ MLFKLGMADK LASQLEQQGI 
AASIFPGAQP NPTVGNVESG LAQLQADGCD CVISLGGGSS HDCAKGIALT ATNGGNIRDY
EGVDRSAKPQ LPLIAINTTA GTASEMTRFC IITDEKRLVK MAIVDRNVTP LLSVNDPELM
LGKPQMLTAA TGMDALTHAI EAYVSVAATP ITDACALKAI AIIANNLRTA VAEGSNLAAR
EAMAYAGFMA GMAFNNASLG YVHAMAHQLG GFYDLPHGVC NAVLLPHVEA FNAEVSAARL
RDVAHALGVD TNGMSDQQGA EACIHAIQRL STDIGIPTGL AQLGVKEEDI PTLAANALKD
ACGLTNPRRA SQADIEAIFR VAA