Gene Rpal_3952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3952 
Symbol 
ID6411633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4239801 
End bp4240937 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content68% 
IMG OID642713833 
Productputative thiolase 
Protein accessionYP_001992923 
Protein GI192292318 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.130005 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTACA TCACCGGCGT GGGTCTCACG CCGTTCGGCA AGATCGATGG TTCGACCACG 
CTCGGCCTGA TGCGGGAGGC GGCGGAGGCG GCGATCGCGG ATGCGGGGCT GAAGCGCGGC
GACATCGACG GGCTGCTGTG CGGCTATTCG ACCACGATGC CGCACATCAT GCTGGCGACG
GTGTTCGCCG AGCATTTCGG CATCCGGCCG AGCTATTGCC ACGCGGTGCA GGTCGGCGGC
GCCACCGGCA TGGCGATGGC GATGCTGGCG CATCAGCTGG TCGAGAGCGG GGCGGCCAAG
AACATCCTGG TGGTGGGCGG CGAGAACCGG CTGACCGGGC AGAGCCGCGA CGCCTCGGTG
CAGGCCCTGG CGCAGGTTGG TCACCCGATC TACGAGGTGC CGCTGGGGCC GACCATCCCG
GCCTATTACG GCCTGGTGGC GTCGCGCTAC ATGCACGACC ACGGCGTCAC CGAGGAAGAC
CTCGCCGCGT TCGCGGTGCT GATGCGCAGC CACGCGATCA CCCATCCCGG CGCGCAGTTT
CACGAGCCGA TCAGCGTCGC CGAGGTGATG GCGTCGAAGC CGATTGCCTC GCCTCTGAAG
CTGCTCGATT GCTGCCCGGT GTCCGATGGC GGCGCCGCGC TGGTGATCAG CCGCGAGCCG
ACTACCGCGC ATCAGATCAA GGTGCGCGGC TGCGGCCAGG CTCATACCCA TCAGCACGTC
ACAGCAATGC CGGCGGATGG GCCGTCTGGA GCGGAGCTGT CGATCGCGCG CGCCTGGGCC
ACAAGCGGTG TCGGAATTGC CGACGTGAAA TATGCTGCCG TGTACGACAG CTTCACCATC
ACGCTGCTGA TGCTGCTCGA AGACCTCGGG CTCGCAGGCC GAGGCGAGGC GGCGGCGCGG
GCGCGGGACG GCCACTTCTC GCGAACCGGC GCGATGCCGC TGAACACCCA TGGCGGCCTA
TTGTCCTACG GCCATTGCGG CGTCGGCGGC GCGATGGCGC ATCTGGTCGA GACGCATCTG
CAGATGACCG GCCGGGCCGG CGACCGTCAG GTGCGTGATG CGTCGCTGGC GCTGCTGCAC
GGCGATGGCG GCGTGTTGTC GTCGCATGTC AGCATGATCC TGGAGCGGGT GCGATGA
 
Protein sequence
MSYITGVGLT PFGKIDGSTT LGLMREAAEA AIADAGLKRG DIDGLLCGYS TTMPHIMLAT 
VFAEHFGIRP SYCHAVQVGG ATGMAMAMLA HQLVESGAAK NILVVGGENR LTGQSRDASV
QALAQVGHPI YEVPLGPTIP AYYGLVASRY MHDHGVTEED LAAFAVLMRS HAITHPGAQF
HEPISVAEVM ASKPIASPLK LLDCCPVSDG GAALVISREP TTAHQIKVRG CGQAHTHQHV
TAMPADGPSG AELSIARAWA TSGVGIADVK YAAVYDSFTI TLLMLLEDLG LAGRGEAAAR
ARDGHFSRTG AMPLNTHGGL LSYGHCGVGG AMAHLVETHL QMTGRAGDRQ VRDASLALLH
GDGGVLSSHV SMILERVR