Gene Rpal_3936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3936 
Symbol 
ID6411617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4224067 
End bp4225140 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content66% 
IMG OID642713817 
Product3-hydroxyisobutyryl-CoA hydrolase 
Protein accessionYP_001992907 
Protein GI192292302 
COG category[I] Lipid transport and metabolism 
COG ID[COG1024] Enoyl-CoA hydratase/carnithine racemase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGGTG CCGTGACCGA ACCCGATCTG ATCGTCCGGC GCGAAGGTGC CGCCGGTGTA 
ATCCGTCTCA ACCGGCCGAA GGCGATCAAC GCGCTGACGC TGGAGATGGC GCGCGAGCTC
GATCGGGCGC TCGACACATT CGCTGCTGAT CCGGATGTCG CTCTGATCCT GGTCGAGGGC
GCCGGTGAGC GCGGGCTGTG CGCCGGCGGG GACATCGTCG GCCTCTACAA CAGCGCGCGC
GAAGGCGGCG ATCTCGGCGA GGTGTTCTGG CGCGAGGAAT ACATCACCAA CGCTCGAATC
GCGAAGTATC CGAAGCCCTA TGTGGCGTTC ATGGACGGCT TGGTGATGGG CGGCGGCGTC
GGCATCGCCG GCCACGGCAG CCACCGCATC GTCACCGACA AGACCAAGAT CGCAATGCCG
GAAGTCGGCA TCGGCTTTTT CCCCGATGTC GGCGGCACCT GGCTGCTGAC GCGCTCGCCC
GGCGAGATCG GAACGTATTT CGGCCTGACC GGCGACATCA TGAACGGCGC CGACGCGATC
CACGCGCAAT TCGCTGATGC CTACGTGCCA ACCGCGAACT GGGCGGCGCT GCGTCAGGCG
CTCACCGAGG CACCGAGCGG GGCTTCGGCC GAGCAGGTCA GCGGCATCAT CGCGCGTTTT
GCGGAACAAC CGGCCCAAGG CCCCGCCGAA CTGCATCAGG CCGACATCGA CCGCTGGTTC
GGGCACGATA CGATCGAACA GATCGTCGCC GCGCTGGAGA GCGATGCGTC GGAGTTCGCG
CAAGCCGCGC TGAAGACGCT GCGGATCAAA TCACCGACCA GCCTCAAAGT GACGCTGAAG
CTGCTGCGCG CCGCGCGGCA CAGGGCGTCG CTGGAAGAGT GTCTGGTGCA TGAATATCGC
GCCGCACTGC AGGTGTTCGT CAGCGACGAT TTCGTCGAGG GCGTCCGTGC CGCGGTGATC
GACAAGGACC GCCAGCCGAA ATGGCGGCCT GCGACGATCG CAGAGGTGAC GCCGGAGATC
GTGGCGCGCT ACTTTGAAGA TCGCGGCGAC GACGAGCTGA AATTTCCGGG CTGA
 
Protein sequence
MNGAVTEPDL IVRREGAAGV IRLNRPKAIN ALTLEMAREL DRALDTFAAD PDVALILVEG 
AGERGLCAGG DIVGLYNSAR EGGDLGEVFW REEYITNARI AKYPKPYVAF MDGLVMGGGV
GIAGHGSHRI VTDKTKIAMP EVGIGFFPDV GGTWLLTRSP GEIGTYFGLT GDIMNGADAI
HAQFADAYVP TANWAALRQA LTEAPSGASA EQVSGIIARF AEQPAQGPAE LHQADIDRWF
GHDTIEQIVA ALESDASEFA QAALKTLRIK SPTSLKVTLK LLRAARHRAS LEECLVHEYR
AALQVFVSDD FVEGVRAAVI DKDRQPKWRP ATIAEVTPEI VARYFEDRGD DELKFPG