Gene Rpal_1398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1398 
Symbol 
ID6409055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1470990 
End bp1472141 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content66% 
IMG OID642711297 
Productiron-containing alcohol dehydrogenase 
Protein accessionYP_001990413 
Protein GI192289808 
COG category[C] Energy production and conversion 
COG ID[COG1454] Alcohol dehydrogenase, class IV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.145523 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCAA CGACCTTCTT CATTCCCTCT CTCAATCTGT TCGGCGCCGG CTGCGTATCG 
AGCGCAGCGG ACCACGCCAA GGCACGCGGC TTCAAGCGCG CGCTGATCGT CACCGACAGC
GGGCTGCACA AGCTCGGCGT CGCCGATCAG ATCGCCTCGA TGCTGATCGA GCGCAACGTG
ACCAGCGTCG TCTTCCCGGG CGCGAAGCCA AACCCGACGA TCAAGAACGT CGAGGACGGG
CTTGCACTGC TGAAGCAGGA ACACTGCGAC TGTGTGATCT CGCTCGGCGG CGGTTCAGCG
CACGACTGCG CGAAGGGCAT CGCGCTGACC GCCACCAATG GCGGCAGCAT CAAGGACTAT
GAAGGCGTCG ATCGGTCGGC GCACGCTCAG CTTCCGCTGA TCGCCATCAA CACCACGGCC
GGTACGGCGA GTGAGATGAC GCGGTTCTGC ATCATCACCG ACGAGGAACG CCAGGTGAAG
ATGGCGATCG TCGACCGCCA CACCACGCCG CTGCTGTCGG TCAACGATCC GGTACTGATG
CTCGGCAAGC CGCCGGCCCT CACCGCCGCG ACCGGCATGG ACGCGCTGAC GCACGCGATC
GAAGCCTATG TGTCGATTGC CGCAACGCCG ATCACTGACG CCTGCGCGCT GAAGGCGATG
TCGATCATCT CCAACAGTCT GCGCACCGTG GTCGCCGAGG GCCAGAACCT CGTCGCCCGC
GAGGCGATGT CGTATGCGGG CTTCCTCGCC GGCATGGCGT TCAACAATGC CTCGCTCGGC
TATGTACATG CGATGGCGCA CCAGCTCGGC GGCTTCTACG ACCTGCCGCA CGGCGTCTGC
AACGCGGTGC TGCTGCCGCA CGTGCAGGCC TACAACGCGC AAGTCGCGGC GGGACGGCTG
AAGGACGTCG CACACGCGCT CGGCGTCGAC ACCACCGGCA TGACCGATGC CCAGGGCGCC
GATGCCGCCA TTCATGCCAT CCAGCGGCTA TCGGCCGATG TCGGCATTCC GCCCGGTCTC
GGCGGTCTCG GCATGAAGGA AACCGACGTG CCGATCCTCG CCGCCAACGC GCTGAAGGAT
GCGTGCGGCT TCACCAATCC GAAGCAGGCG ACGCAGACCG AGATCGAAAC CATCTTCCGG
GCGGCCGCCT GA
 
Protein sequence
MTATTFFIPS LNLFGAGCVS SAADHAKARG FKRALIVTDS GLHKLGVADQ IASMLIERNV 
TSVVFPGAKP NPTIKNVEDG LALLKQEHCD CVISLGGGSA HDCAKGIALT ATNGGSIKDY
EGVDRSAHAQ LPLIAINTTA GTASEMTRFC IITDEERQVK MAIVDRHTTP LLSVNDPVLM
LGKPPALTAA TGMDALTHAI EAYVSIAATP ITDACALKAM SIISNSLRTV VAEGQNLVAR
EAMSYAGFLA GMAFNNASLG YVHAMAHQLG GFYDLPHGVC NAVLLPHVQA YNAQVAAGRL
KDVAHALGVD TTGMTDAQGA DAAIHAIQRL SADVGIPPGL GGLGMKETDV PILAANALKD
ACGFTNPKQA TQTEIETIFR AAA