Gene Rpal_4004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4004 
Symbol 
ID6411686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4291100 
End bp4292026 
Gene Length927 bp 
Protein Length308 aa 
Translation table11 
GC content67% 
IMG OID642713886 
ProductHflC protein 
Protein accessionYP_001992975 
Protein GI192292370 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0330] Membrane protease subunits, stomatin/prohibitin homologs 
TIGRFAM ID[TIGR01932] HflC protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.781168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGCCG GAGTCGCAGG AGCCGTCGCG CTGATCGTTG CGTTGGTCGC GATCATCGTC 
GGCTGGTCGT CGCTGTTCAC CGTGAGCCAG ACCGAGCAGG TGCTGCTGGT CCGCCTCGGT
GAGCCGGTGC GGGTCGTCAC CGAGCCGGGT CTCCACTTCA AGGCCCCGTT CATCGATACG
GTGATCAGCA TCGACAAGCG GATTCTCGAC CTCGAGAACC CGTCCCAGGA AGTCATCGCC
GCCGACCAGA AGCGGCTGGT GGTCGACGCG TTCGCGCGTT ACCGGATCAA GAACGCGCTG
CGCTTCTATC AGAGCGTCGG CTCGATCCCG GCCGCCAACG TCCAGCTCAC CACGCTGCTC
AACGCGTCGC TGCGCCGTGT GCTCGGCGAG GTCACGTTCA TCCAGGTGGT GCGCGACGAG
CGCGAAGGCT TGATGGCGCG AATCCGCACC CAGCTCGACA AGGAAGCCGA AGGCTACGGC
ATCTCGGTGG TCGACGTCCG TATCCGCCGC GCCGATCTGC CCGAGCAGAA CAGCCAGGCG
GTGTATCAGC GGATGCAGAC CGAGCGTCAG CGCGAAGCCG CCGAGTTCCG CGCGCAGGGC
GGCCAGAAGG CGCAGGAGAT CCGCTCCAAG GCCGACCGCG AAGCCACCGT GATCATCGCC
GAGGCCAATT CCGAGGCCGA GCAGATCCGC GGTTCGGGCG ACGCCGAGCG CAACCGGCTG
TTCGCGACCG CCTATTCGAA GGACCCCGAG TTCTTCGCGT TCTATCGGTC GATGACCGCC
TATGAGCAGT CGCTGAAGAG CAACGACACC CGGTTCCTGC TGCGGCCGGA TTCGGACTTC
TTCCGGTTCT TCGGCAGTGC CGAAGGGCGG GCCCCGGCAG GCGCTGCCGC GGCGGCAGCT
CCTGCGGCCC CAACCGCACC GCGCTGA
 
Protein sequence
MKAGVAGAVA LIVALVAIIV GWSSLFTVSQ TEQVLLVRLG EPVRVVTEPG LHFKAPFIDT 
VISIDKRILD LENPSQEVIA ADQKRLVVDA FARYRIKNAL RFYQSVGSIP AANVQLTTLL
NASLRRVLGE VTFIQVVRDE REGLMARIRT QLDKEAEGYG ISVVDVRIRR ADLPEQNSQA
VYQRMQTERQ REAAEFRAQG GQKAQEIRSK ADREATVIIA EANSEAEQIR GSGDAERNRL
FATAYSKDPE FFAFYRSMTA YEQSLKSNDT RFLLRPDSDF FRFFGSAEGR APAGAAAAAA
PAAPTAPR