Gene Rpal_4256 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4256 
Symbol 
ID6411940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4575860 
End bp4577020 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content64% 
IMG OID642714138 
Producthopanoid biosynthesis associated radical SAM protein HpnH 
Protein accessionYP_001993227 
Protein GI192292622 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR03470] hopanoid biosynthesis associated radical SAM protein HpnH 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.59824 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTATTC CGTTTCACAA GGAACTGGTG ATCGGCGGTT ATCTGCTGAA GCAGAAGCTG 
CTCGGGCGGA AGCGTTATCC GCTGGTACTG ATGCTGGAGC CGCTGTTCCG CTGTAACCTC
GCCTGCGCCG GCTGCGGCAA GATCGACTAT CCCGACGCGA TCCTGAACCG CCGGATGACC
GCACAAGAGT GCTGGGACGC CGCCGAGGAA TGCGGCGCGC CGATGGTTGC GATCCCGGGC
GGCGAACCGC TGATCCACAA GGAGATCGGC GAGATCGTGC GCGGCCTGGT GGCGCGCAAG
AAGTTCGTGT CGCTGTGCAC CAACGCGCTG CTGCTCGAGA AGAAGCTGCA TCTGTTCGAG
CCGTCGCCCT ACCTGTTCTT CTCGGTGCAT CTCGACGGCC TGAAGGAGCA CCACGACAAG
GCGGTGTCGC AGCAGGGCGT GTTCGACCGC GCAGTCGCGG CGATCAAGGC CGCCAAGGCC
AAGGGCTTCA CCGTCAACGT CAACTGCACG GTGTTCGACG GCTACGCCGC CGAAGACATC
GCCAAGTTCA TGGACTTCAC CGAGGAACTC GGCGTCGGCG TCTCGATCTC GCCGGGCTAC
GCCTATGAGC GCGCTCCGGA CCAGGAGCAC TTCCTCAACC GCACCAAGAC CAAGAACCTG
TTCCGCGAGG TGTTCGCGCG CGGCAAGGGC AAGAAGTGGA GCTTCATGCA CTCCAGCATG
TTCCTCGACT TCCTGGCCGG CAATCAGGAG TTCGAGTGCA CGCCGTGGGG TATGCCGGCG
CGCAACATTT TCGGCTGGCA GAAGCCCTGC TACCTGCTCG GCGAAGGCTA CGCCAAGACT
TTCCAGGAGC TGATGGAAAC CACCGATTGG GATTCCTACG GCACCGGCAA GTACGAGAAG
TGCGCCGACT GCATGGCGCA TTGCGGCTAC GAACCGACCG CGGCGATGGC CTCTCTCAAC
AATCCGCTGA AGGCCGCCTG GGTGGCGCTC CGCGGCATCA AGACCTCGGG CCCGATGGCG
CCGGAGATCG ACATGTCGAA GCAGCGCCCG GCGCAGTACG TGTTCTCCGA GCAGGTCCAG
AAGACGCTGA CGCAGATCCG CCAGGACGAG GCCGCCGAGG CCAAGGACAA GCGGCAGGCG
GAAAGGTCGA CGGCGGCCTG A
 
Protein sequence
MAIPFHKELV IGGYLLKQKL LGRKRYPLVL MLEPLFRCNL ACAGCGKIDY PDAILNRRMT 
AQECWDAAEE CGAPMVAIPG GEPLIHKEIG EIVRGLVARK KFVSLCTNAL LLEKKLHLFE
PSPYLFFSVH LDGLKEHHDK AVSQQGVFDR AVAAIKAAKA KGFTVNVNCT VFDGYAAEDI
AKFMDFTEEL GVGVSISPGY AYERAPDQEH FLNRTKTKNL FREVFARGKG KKWSFMHSSM
FLDFLAGNQE FECTPWGMPA RNIFGWQKPC YLLGEGYAKT FQELMETTDW DSYGTGKYEK
CADCMAHCGY EPTAAMASLN NPLKAAWVAL RGIKTSGPMA PEIDMSKQRP AQYVFSEQVQ
KTLTQIRQDE AAEAKDKRQA ERSTAA