Gene RPD_3314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3314 
Symbol 
ID4023824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3670884 
End bp3672098 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content68% 
IMG OID637963518 
Productthiolase 
Protein accessionYP_570439 
Protein GI91977780 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTCCG GGCCCGCGCC AAGGGGCGCG TCCCGGAATG ACGGACGATG GGGTGATGCT 
AAGCGCCGGG CGCGCTTCAT GAGTTACATC TCCGGCACAG GCCTCACCCC GTTCGGCAAG
ATCGAAGGCT CGACCACGCT GTCGCTGATG CGCGAGGCGG CCGAGCTGGC CGTCGCCGAC
GCCGGGCTCG CGCGGAGCGA TATCGACGGG CTGCTGTGCG GCTATTCGAC GACGATGCCG
CACATCATGC TGGCTACGGT GTTCGCCGAG CATTTCGGCA TCCGGCCGAG TTACTGCCAC
GCGATACAGG TCGGCGGCGC CACCGGGATG GCAATGACGA TGCTGGCGCA TCAGCTCGTC
GAAAGCGGCG CGGCGAAGAA CATCCTGGTT GTCGGCGGCG AGAACCGGCT GACCGGCCAG
AGCCGCGACG CTTCCGTGCA GGCGCTGGCG CAGGTCGGCC ATCCGACTTA CGAAGTGCCG
CTGGGGCCGA CCATTCCTGC GTATTACGGC CTGGTCGCGT CGCGCTACAT GCACGATCAC
GGCGTCACCG AAGAGGACCT CGCCGAATTC GCGGTGCTGA TGCGCGCACA TGCGGCGACC
CATCCCGGCG CGCAGTTTCG CGATTCCATC AGCGTCGCCG AGGTGATGGC GTCGAAGCCG
ATTGCCTCGC CGCTGAAGCT GCTCGATTGC TGCCCGGTGT CGGACGGCGG CGCGGCGCTG
GTGATCAGCG CCGAGCCGAC CACGGCGCAT CGCGTCAAGG TGCGCGGCTG CGCCCAGGCG
CATACTCATC AGCACGTCAC CGCGATGCCG GCCGCGGGGC CATCGGGAGC GGAGCTTGCA
GTGGAGCGGG CGAAAGCGGC GAGCGGTGTG GCGATCGGCG ATGTCCGCTA CGCCGCGGTC
TATGACAGCT TCACCATCAC GCTGTTGATG CTACTCGAAG ACCTCGGCCT CGCCAAACGC
GGCGAAGCCG CCGCGCAAGC GCGCAGTGGG AATTTCTCGC GCGCTGGCGT GATGCCCCTG
AATACCCATG GCGGGCTGCT GAGCTACGGC CATTGCGGCG TCGGCGGCGC GATGGCGCAT
CTGGTCGAGA CCCATCTGCA GATGACCGGT CGCGCCGGCG ATCGTCAGGT CCGCGACGCC
TCGGTGGCGC TGCTGCACGG CGACGGCGGC GTGCTGTCGT CGCATGTCAG CATGTTTCTG
GAGCGGGTGC GATGA
 
Protein sequence
MDSGPAPRGA SRNDGRWGDA KRRARFMSYI SGTGLTPFGK IEGSTTLSLM REAAELAVAD 
AGLARSDIDG LLCGYSTTMP HIMLATVFAE HFGIRPSYCH AIQVGGATGM AMTMLAHQLV
ESGAAKNILV VGGENRLTGQ SRDASVQALA QVGHPTYEVP LGPTIPAYYG LVASRYMHDH
GVTEEDLAEF AVLMRAHAAT HPGAQFRDSI SVAEVMASKP IASPLKLLDC CPVSDGGAAL
VISAEPTTAH RVKVRGCAQA HTHQHVTAMP AAGPSGAELA VERAKAASGV AIGDVRYAAV
YDSFTITLLM LLEDLGLAKR GEAAAQARSG NFSRAGVMPL NTHGGLLSYG HCGVGGAMAH
LVETHLQMTG RAGDRQVRDA SVALLHGDGG VLSSHVSMFL ERVR