Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1314 |
Symbol | |
ID | 4021791 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 1477174 |
End bp | 1478259 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637961507 |
Product | chorismate synthase |
Protein accession | YP_568453 |
Protein GI | 91975794 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTTCA ATACATTCGG CCACATGTTT CGCGTCACCA CCTTCGGCGA GAGCCATGGG GTGGCGATCG GTTGCGTGGT CGACGGCTGC CCGCCGCTGA TCCCGCTGAC CGAGGCCGAT ATCCAGGGCG ATCTCGACCG CCGCCGGCCC GGCCAATCGC GCTTCACCAC CCAGCGCCAG GAAGCCGATC AGGTAAAGAT CGTGTCCGGC GTGATGGCGC ATCCGGAGTC CGGTGCGCAG GTCACCACCG GCACGCCGAT CGCGCTGATG ATCGAGAACA CCGACCAGCG CTCGAAGGAC TATTCCGACA TCAAGGACAA GTATCGGCCC GGCCACGCCG ACTTCACCTA TGAGGCCAAA TACGGCATCC GCGACTATCG CGGCGGCGGC CGTTCCTCGG CGCGCGAGAC CGCGAGCCGG GTCGCCGCTG GGGCGATTGC GCGAAAAGTG ATCACCGGCA TGAGTGTGCG CGGCGCGCTG GTGCAGATCG GGCCGCACAA GATCGATCGC GAGAAGTGGG ATTGGGACGA GATCGGCAAC AATCCGTTCT TCTGCCCCGA TAAGGACGCC GCCTCGGTGT GGGAGGCCTA TCTCGACGGC ATCCGGAAGA GCGGCTCGTC GATCGGCGCG GTGATCGAGG TGATCGCCGA GGGCGTGCCC GCCGGGCTCG GCGCGCCGAT CTACGCCAAG CTCGACGGCG ACATCGCCGC GGCGCTGATG AGCATCAACG CGGTCAAGGG CGTCGAGATC GGCGACGGCT TTGCCACCGC CGCGCTGACC GGCGAGGAGA ACGCTGACGA GATGCGGATG GGCAATCACG GCCCAGCGTT TCTCTCGAAC CACGCCGGCG GCATTCTCGG CGGCATCTCC ACCGGCCAGC CGGTGGTGGC GCGGTTCGCG GTGAAGCCGA CCTCGTCGAT CCTGTCGCCG CGCAGGACCG TCGATCGCGA AGGCCATGAC ACCGACATCC TCACCAAGGG CCGTCACGAC CCCTGCGTCG GTATCCGCGC GGTGCCGGTC GGCGAGGCGA TGGTCGCCTG CGTGCTGGCC GATCATCTGC TGCGCCATCG CGGCCAGGTG GGCTAG
|
Protein sequence | MSFNTFGHMF RVTTFGESHG VAIGCVVDGC PPLIPLTEAD IQGDLDRRRP GQSRFTTQRQ EADQVKIVSG VMAHPESGAQ VTTGTPIALM IENTDQRSKD YSDIKDKYRP GHADFTYEAK YGIRDYRGGG RSSARETASR VAAGAIARKV ITGMSVRGAL VQIGPHKIDR EKWDWDEIGN NPFFCPDKDA ASVWEAYLDG IRKSGSSIGA VIEVIAEGVP AGLGAPIYAK LDGDIAAALM SINAVKGVEI GDGFATAALT GEENADEMRM GNHGPAFLSN HAGGILGGIS TGQPVVARFA VKPTSSILSP RRTVDREGHD TDILTKGRHD PCVGIRAVPV GEAMVACVLA DHLLRHRGQV G
|
| |