Gene Rpal_5044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5044 
Symbol 
ID6412738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5425625 
End bp5426539 
Gene Length915 bp 
Protein Length304 aa 
Translation table11 
GC content67% 
IMG OID642714929 
Productthioesterase superfamily protein 
Protein accessionYP_001994008 
Protein GI192293403 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2050] Uncharacterized protein, possibly involved in aromatic compounds catabolism 
TIGRFAM ID[TIGR00369] uncharacterized domain 1 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGACC AATCGATCCC TGATCCGGTA ATGTCGTTCG ACGACAAAGC GCGGATGATT 
CAGCACCGCC GTTCGATCCA CGGCGCCATC ATCGGCCTGC AGCTCGACCG CTACGCACCC
GCGGAAGCGT GGAGCAGCCT GCCCTATCAC CCGGTATTCG TCGGCGACGT TTCGACCGGC
GTGATCCATG GCGGCGTCGT CACCGCGATG CTCGACGAGA GCTGCGGCAT GGCGGTGCAG
CTCGCGCTGC CCGGCACCAC CGCGATCGCC ACGCTCGACC TGCGGATCGA TTACTTGCGA
CCGGCGACAC CCGGACAGGT GATGCGCGCG CACGCGCATT GCTATCACCT CACCCGTTCG
ATCGCGTTCG TCCGCGCCAC CGCGTATCAG GACGCCGAGG ACGTTCCGAT CGCCACCGCC
ACCGCGATGT TCATGGTCGG CGCCAACCGC ACCGATATGC TGCGGCAGAC GCCGAAGGTG
ACGATGGACT CCGCGCCCGA GCTGGTCGCC CCCGAAGATC CGGACGGCGG GCCGCTTGCG
ATCAGCCCGT ATCCACGCTT CCTCGGCATC CGCGTCGATG GCGATGCCCA GGCGATGATG
CCATATCATC CGAAGCTGGT CGGCAATCCG ATCCTGCCGG CGCTGCATGG CGGCGTGATC
GGCGCCTTCC TGGAGACCGC CGCGATCGTC AGCGTGCGCC GAGAGATCGG CCTCGCCACC
GCGCCGAAAC CGATCGGGCT CACCGTGAAC TATCTGCGTT CGGGCCGGCC GCTCGACACC
TTCGCCAAGG TCTCGATCGT CAAGCAGGGC CGCCGGGTGG TTGCGTTCGA AGCGCAGGCC
TATCAGCGCG ATCCGGCCGA GCCGATCGCG TCGTGCTACG GCCACTTCAA GCTGCGCTCC
GGCCCGGCGG AGTAA
 
Protein sequence
MSDQSIPDPV MSFDDKARMI QHRRSIHGAI IGLQLDRYAP AEAWSSLPYH PVFVGDVSTG 
VIHGGVVTAM LDESCGMAVQ LALPGTTAIA TLDLRIDYLR PATPGQVMRA HAHCYHLTRS
IAFVRATAYQ DAEDVPIATA TAMFMVGANR TDMLRQTPKV TMDSAPELVA PEDPDGGPLA
ISPYPRFLGI RVDGDAQAMM PYHPKLVGNP ILPALHGGVI GAFLETAAIV SVRREIGLAT
APKPIGLTVN YLRSGRPLDT FAKVSIVKQG RRVVAFEAQA YQRDPAEPIA SCYGHFKLRS
GPAE