Gene Rpal_4446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4446 
Symbol 
ID6412130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4775551 
End bp4776528 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content66% 
IMG OID642714328 
Productdehydrogenase E1 component 
Protein accessionYP_001993417 
Protein GI192292812 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCGG AGACCTCGCG CCGCCTGTTG TTCGACATGC TGCGGATCCG CAGCGTGGAG 
GAGACCATCG CGGCGCGCTA CGGCGAGCAG AAGATGCGCT GCCCGACGCA TCTGTCGGTC
GGGCAGGAGG CCGTCTCTGC GGCGGCCGGG GCGGTGCTGA GGCCGACCGA TCTTGCAGTC
AGCGGTCATC GCGCCCACGC GCACTATCTT GCCAAGGGCG GATCACTGAA GGCGATGATC
GCCGAGATCT ACGGCAAGGT CACCGGCTGC GCCCGCGGCA AAGGCGGCTC GATGCATCTG
GTCGACGAGA GCGTCGGCTT CATGGGCTCG ACCGCGATCG TCGGCGGAAC GGTGCCCGTC
GGCGTCGGGC TGTCGTATCC GATGAAGCTG AATCAGACGG GTCAGATTTC CTGCGTGTTT
CTTGGCGACG CGGTTCCGGA AACCGGCGTG TTCTTCGAGT CGGTGAACTT CGCGGTCGTG
AAGCAGCTCC CGGTGTTGTT CCTGTGCGAG AACAATGGCT ACTCGGTGTA TTCGCCGCTG
TCGGTGCGGC AGCCGCCCTG CCGCAAGCTG TACGAGCTTG TCGCCGGCTT CGGCCTCAAG
ACGCATCACG GCGACGGCAA TGATGCGCGC GCCGTGTATG CCGCGCTGAG CGAAGGCGTT
GCGGCGATCC GGGCCGGCGA GGGGCCGCGG TTCTACGAAT TCGAGACCTA TCGCTGGCGC
GAGCATTGCG GCCCGATGTA CGACAACGAT CTCGGCTATC GCACGGCGTC CGAATTCGAG
GCGTGGAAGC TACGCGATCC GGTGCCGACG CTGCAGCGCG CGCTGATCAC CGAAGCTATC
GTGACCGCCG CCGACGTCGC CGACATGCAG GCGGAGATCG ATGCCGAGAT CGAGGAGGCG
TTCGCCTTCG CAGAGAGCTC GCCGTTTCCG CCGCCCGAAG ACGCCTTCAC CGACGTCTAT
GCGTCAGCAG CAGGCTAA
 
Protein sequence
MNPETSRRLL FDMLRIRSVE ETIAARYGEQ KMRCPTHLSV GQEAVSAAAG AVLRPTDLAV 
SGHRAHAHYL AKGGSLKAMI AEIYGKVTGC ARGKGGSMHL VDESVGFMGS TAIVGGTVPV
GVGLSYPMKL NQTGQISCVF LGDAVPETGV FFESVNFAVV KQLPVLFLCE NNGYSVYSPL
SVRQPPCRKL YELVAGFGLK THHGDGNDAR AVYAALSEGV AAIRAGEGPR FYEFETYRWR
EHCGPMYDND LGYRTASEFE AWKLRDPVPT LQRALITEAI VTAADVADMQ AEIDAEIEEA
FAFAESSPFP PPEDAFTDVY ASAAG