Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1212 |
Symbol | |
ID | 3910147 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1385868 |
End bp | 1386953 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637883106 |
Product | chorismate synthase |
Protein accession | YP_484833 |
Protein GI | 86748337 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.535517 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTTCA ATACCTTCGG CCACATGTTT CGTGTCACCA CGTTCGGCGA AAGCCACGGG GTGGCGATCG GCTGCGTGGT CGACGGCTGT CCGCCGCTGA TCCCGCTGAC CGAGGCCGAC ATCCAGGGCG ACCTCGACCG CCGCCGGCCG GGGCAGTCGC GCTTCACCAC GCAGCGCCAG GAAGCCGACC AGGTGAAGAT CCTGTCCGGC GTGATGGCGC ATCCGGAGAC CGGCGTGCAG GTCACCACCG GGACGCCGAT CGCGCTCTTG ATCGAGAACA CCGACCAGCG CTCCAAGGAT TATTCCGAGA TCCAGAACAA GTTTCGGCCC GGCCATGCCG ACTTCACCTA TGAGGCGAAA TACGGCATCC GCGACTATCG CGGCGGCGGC CGCTCCTCGG CGCGCGAGAC CGCGACCCGC GTCGCCGCCG GCGCGGTGGC GCGCAAGGTG ATCGCCGGCA TGACCGTGCG CGGCGCGCTG GTGCAGATCG GCCCGCACCA GATCGACCGC GACAAATGGG ACTGGGCCGA GATCGGCAAC AACCCGTTCT TCTGCCCCGA CAAGGACAAG GCGGCGTTCT TCGCCGATTA TCTCGACGGC ATCCGCAAGA GCGGCTCGTC GATCGGCGCG GTGATCGAAG TGGTCGCCGA AGGCGTGCCC GCGGGCCTCG GCGCGCCGAT CTACGCCAAG CTCGACACCG ACCTCGCCGC GGCGCTGATG AGCATCAACG CGGTCAAGGG CGTCGAGATC GGCGACGGCT TCGCCACCGC GGCGCTGACC GGCGAGGAGA ACGCTGACGA GATGCGGATG GGCAATGCCG GCCCGCAATT TCTGTCGAAC CATGCGGGCG GCATTTTGGG CGGCATCTCC ACGGGGCAGC CGGTGGTGGC GCGGTTCGCG GTGAAGCCGA CCTCGTCGAT CCTGTCGCCG CGCAAGACCA TCGATCGCGC CGGCCACGAC ACCGATATCC TGACCAAGGG CCGCCACGAC CCCTGCGTCG GCATCCGCGC GGTCCCGGTC GGCGAGGCGA TGGTCGCCTG CGTGCTGGCC GATCATCTGC TGCGCCACCG CGGGCAGGTC GGCTAG
|
Protein sequence | MSFNTFGHMF RVTTFGESHG VAIGCVVDGC PPLIPLTEAD IQGDLDRRRP GQSRFTTQRQ EADQVKILSG VMAHPETGVQ VTTGTPIALL IENTDQRSKD YSEIQNKFRP GHADFTYEAK YGIRDYRGGG RSSARETATR VAAGAVARKV IAGMTVRGAL VQIGPHQIDR DKWDWAEIGN NPFFCPDKDK AAFFADYLDG IRKSGSSIGA VIEVVAEGVP AGLGAPIYAK LDTDLAAALM SINAVKGVEI GDGFATAALT GEENADEMRM GNAGPQFLSN HAGGILGGIS TGQPVVARFA VKPTSSILSP RKTIDRAGHD TDILTKGRHD PCVGIRAVPV GEAMVACVLA DHLLRHRGQV G
|
| |