Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3607 |
Symbol | |
ID | 3911409 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4138945 |
End bp | 4140456 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637885509 |
Product | acetyl-CoA acetyltransferase |
Protein accession | YP_487213 |
Protein GI | 86750717 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.103572 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAGCC AACTCCCGCC CGAGCGCATT CCCGTCATCG CCGGAATCGG CGAAATCGCC GATCACCCCA AGGACATTGC GCAAGGGCTG GAGCCGCTGG CGCTGCTCGA ACAGGCGGCG CGACGCGCCG GCGACGACAG CTCTGTGCAT CTGCTGCGCG AGATCGACTC GCTCGATATC GTCAACTTCC TGAGCTGGCG CTATCACGCG CCCGAGCAGC AGCTCGCCGC GAAACTCGGC GTTTCGCCGC GGCACTGCTA CTACGGGCCG GTCGGCGGCG AGAGCCCGAT CCGCTTCATC CACGAAGCGG CGCTGCGGAT CGCGCGCGGC GAAGCGCACG TCGCCGTGGT CTGCGGCGCC GAGGCGCAAT CGACCGTCAC CAAGGCGGCG CGCGCCAAGC TCGAATTGCC GTGGACGCCG TTCGCGAGCG ACGCGCCCGA ACCGAAGCGC GGCGCCGCGT TCCAGAAGCC GATCGCCACG CAGCTCGGCG TCGCGCGGCC GATCACCGTG TATCCGCTGT ACGAGGCGGC GACGGCCGCG CATTGGGGCC AGACGCCGCG GCAGGCGCTC GACGAATCCG GCGTGCTGTG GTCGCGCTAC GCGCAGGCCG CCGCGGCCAA TCCGAACGCC TGGATCAAGC GCGCCTTCGC ACCGAGCGAG ATCACCACGC CCTCGCCCGA CAACCGGCTG ATCGCCTGGC CCTATACCAA GCTGATGGTC GCCAATCCGA GCGTCAATCT CGGCGCGGCA GTGCTGCTGA CCTCGCTGGC GAAGGCACGC GAGGCAGGCA TCGCCGAGGA AAAACTGATC TACATCCATG GCGGCGCCTC GGCCGAAGAG CCGCGCGATT ATCTCGCCCG CGATCAATTC CACCAGAGCC ACGCCCAGAA CGCGGTGTTG GAGACGATAA AGGCGATGGT CGGCGGCGAC GGCCGCGTGT TCGACGCGAT CGAGCTGTAT TCCTGCTTCC CTGTGGTGCC GAAAATGGCG CGGCGCACGC TTGGGCTCGG CGACGACGTG CAGCCGACGG TGACCGGCGG CCTCACCTTC TTCGGCGCGC CGCTCAACAC CTATATGACC CACGCGGCCT GCGCGATGGT GCGCAGGCTG CGGGGCGGCG CCAGGCTCGG CCTGCTGTAT GGACAGGGGG GCTTCGTCAC CAAGCACCAC GCGCTGGTGC TGTCGCGCAC GCCATCGCAA CAAGCGCTGA GCGAGAGCGT CAGCGTACAG ACGAAGGCCG ATGCGGCTTA CGGCGACGTC CCGCCGTTCG TGACAGACGC CTCGGGCGAC GGCACGGTCG AGAGCTTCAC CGTGATCTTC ACCGGCAAGG GCGACGTCGA ACACGGCGTC GTGGTGCTAC GCACCTCGGA CGGCGCGCGC ACGCTGGCGC GGGTGCCGGC GCAGGATCAG GCGACGCTGG CCGTGCTGAC GAACATGGAT CGCAGTCCGG TCGGCACGAA CGGTCCGATC ACGACGAGCG CCGATGGCGT GCTGGAGTGG CGTGCTGTCT AG
|
Protein sequence | MASQLPPERI PVIAGIGEIA DHPKDIAQGL EPLALLEQAA RRAGDDSSVH LLREIDSLDI VNFLSWRYHA PEQQLAAKLG VSPRHCYYGP VGGESPIRFI HEAALRIARG EAHVAVVCGA EAQSTVTKAA RAKLELPWTP FASDAPEPKR GAAFQKPIAT QLGVARPITV YPLYEAATAA HWGQTPRQAL DESGVLWSRY AQAAAANPNA WIKRAFAPSE ITTPSPDNRL IAWPYTKLMV ANPSVNLGAA VLLTSLAKAR EAGIAEEKLI YIHGGASAEE PRDYLARDQF HQSHAQNAVL ETIKAMVGGD GRVFDAIELY SCFPVVPKMA RRTLGLGDDV QPTVTGGLTF FGAPLNTYMT HAACAMVRRL RGGARLGLLY GQGGFVTKHH ALVLSRTPSQ QALSESVSVQ TKADAAYGDV PPFVTDASGD GTVESFTVIF TGKGDVEHGV VVLRTSDGAR TLARVPAQDQ ATLAVLTNMD RSPVGTNGPI TTSADGVLEW RAV
|
| |