Gene RPB_3607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3607 
Symbol 
ID3911409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4138945 
End bp4140456 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content69% 
IMG OID637885509 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_487213 
Protein GI86750717 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.103572 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGCC AACTCCCGCC CGAGCGCATT CCCGTCATCG CCGGAATCGG CGAAATCGCC 
GATCACCCCA AGGACATTGC GCAAGGGCTG GAGCCGCTGG CGCTGCTCGA ACAGGCGGCG
CGACGCGCCG GCGACGACAG CTCTGTGCAT CTGCTGCGCG AGATCGACTC GCTCGATATC
GTCAACTTCC TGAGCTGGCG CTATCACGCG CCCGAGCAGC AGCTCGCCGC GAAACTCGGC
GTTTCGCCGC GGCACTGCTA CTACGGGCCG GTCGGCGGCG AGAGCCCGAT CCGCTTCATC
CACGAAGCGG CGCTGCGGAT CGCGCGCGGC GAAGCGCACG TCGCCGTGGT CTGCGGCGCC
GAGGCGCAAT CGACCGTCAC CAAGGCGGCG CGCGCCAAGC TCGAATTGCC GTGGACGCCG
TTCGCGAGCG ACGCGCCCGA ACCGAAGCGC GGCGCCGCGT TCCAGAAGCC GATCGCCACG
CAGCTCGGCG TCGCGCGGCC GATCACCGTG TATCCGCTGT ACGAGGCGGC GACGGCCGCG
CATTGGGGCC AGACGCCGCG GCAGGCGCTC GACGAATCCG GCGTGCTGTG GTCGCGCTAC
GCGCAGGCCG CCGCGGCCAA TCCGAACGCC TGGATCAAGC GCGCCTTCGC ACCGAGCGAG
ATCACCACGC CCTCGCCCGA CAACCGGCTG ATCGCCTGGC CCTATACCAA GCTGATGGTC
GCCAATCCGA GCGTCAATCT CGGCGCGGCA GTGCTGCTGA CCTCGCTGGC GAAGGCACGC
GAGGCAGGCA TCGCCGAGGA AAAACTGATC TACATCCATG GCGGCGCCTC GGCCGAAGAG
CCGCGCGATT ATCTCGCCCG CGATCAATTC CACCAGAGCC ACGCCCAGAA CGCGGTGTTG
GAGACGATAA AGGCGATGGT CGGCGGCGAC GGCCGCGTGT TCGACGCGAT CGAGCTGTAT
TCCTGCTTCC CTGTGGTGCC GAAAATGGCG CGGCGCACGC TTGGGCTCGG CGACGACGTG
CAGCCGACGG TGACCGGCGG CCTCACCTTC TTCGGCGCGC CGCTCAACAC CTATATGACC
CACGCGGCCT GCGCGATGGT GCGCAGGCTG CGGGGCGGCG CCAGGCTCGG CCTGCTGTAT
GGACAGGGGG GCTTCGTCAC CAAGCACCAC GCGCTGGTGC TGTCGCGCAC GCCATCGCAA
CAAGCGCTGA GCGAGAGCGT CAGCGTACAG ACGAAGGCCG ATGCGGCTTA CGGCGACGTC
CCGCCGTTCG TGACAGACGC CTCGGGCGAC GGCACGGTCG AGAGCTTCAC CGTGATCTTC
ACCGGCAAGG GCGACGTCGA ACACGGCGTC GTGGTGCTAC GCACCTCGGA CGGCGCGCGC
ACGCTGGCGC GGGTGCCGGC GCAGGATCAG GCGACGCTGG CCGTGCTGAC GAACATGGAT
CGCAGTCCGG TCGGCACGAA CGGTCCGATC ACGACGAGCG CCGATGGCGT GCTGGAGTGG
CGTGCTGTCT AG
 
Protein sequence
MASQLPPERI PVIAGIGEIA DHPKDIAQGL EPLALLEQAA RRAGDDSSVH LLREIDSLDI 
VNFLSWRYHA PEQQLAAKLG VSPRHCYYGP VGGESPIRFI HEAALRIARG EAHVAVVCGA
EAQSTVTKAA RAKLELPWTP FASDAPEPKR GAAFQKPIAT QLGVARPITV YPLYEAATAA
HWGQTPRQAL DESGVLWSRY AQAAAANPNA WIKRAFAPSE ITTPSPDNRL IAWPYTKLMV
ANPSVNLGAA VLLTSLAKAR EAGIAEEKLI YIHGGASAEE PRDYLARDQF HQSHAQNAVL
ETIKAMVGGD GRVFDAIELY SCFPVVPKMA RRTLGLGDDV QPTVTGGLTF FGAPLNTYMT
HAACAMVRRL RGGARLGLLY GQGGFVTKHH ALVLSRTPSQ QALSESVSVQ TKADAAYGDV
PPFVTDASGD GTVESFTVIF TGKGDVEHGV VVLRTSDGAR TLARVPAQDQ ATLAVLTNMD
RSPVGTNGPI TTSADGVLEW RAV