Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_1142 |
Symbol | |
ID | 3969537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 1241910 |
End bp | 1242998 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637924252 |
Product | chorismate synthase |
Protein accession | YP_531024 |
Protein GI | 90422654 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.579597 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.271094 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTTCA ACACCTTCGG CCACATGTTT CGCGTCACCA CCTTCGGCGA GAGCCATGGC GTGGCGATCG GCTGCGTGGT CGACGGCTGT CCGCCCTTGA TCGCGCTTAC CGAGGCCGAC ATCCAGCGCG ACCTCGACCG CAGGCGGCCG GGGCAGTCGC GCTTCACCAC CCAGCGCCAG GAAGCCGACC AGGTGAAGAT CCTGTCCGGG GTGATGGTGC ATCCGCAGAG CGGCTTGCAG GTCACCACCG GCGCGCCGAT CGCGCTCTTG ATCGAGAACA CCGACCAGCG CTCGAAAGAC TATTCCGAGA TCAAGGACAA GTTTCGCCCC GGCCACGCCG ACTTCACCTA TGAGGCGAAA TACGGCATCC GCGATTATCG CGGCGGCGGC CGTTCCTCGG CGCGCGAGAC CGCGACCCGC GTCGCCGCCG GTGCGATCGC CCGCAAAGTG GTGCCCGGCA TCACCGTGCG CGCCGCTTTG GTGCAGATGG GGCCGCACCA GATCGACCGC GACAACTGGG ATTGGGAGGA GGTCGGCAAC AATCCGTTCT TCTGCCCGGA CAAGGACAAG GCGAAATTCT TCGAGGACTA TCTCGACGGC ATCCGCAAGA ACGGCTCCTC GATCGGCGCG GTGATCGAGG TGGTCGCCGA CGGCGTGCCG GCGGGGTGGG GCGCGCCGAT CTACGCCAAG CTCGACACCG ACATCGCCGC GGCGCTGATG AGCATCAACG CGGTGAAGGG CGTCGAGATC GGCGACGGCT TCGCCACCGC AGCACTCACC GGCGAGCAGA ACGCCGACGA AATGCGCGCC GGCAATGATG GCCCGAGCTT CCTGTCGAAC CACGCCGGCG GCATTTTGGG CGGCATCTCC ACCGGGCAGC CGGTGGTGGC GCGGTTTGCG GTGAAGCCGA CCTCCTCGAT CCTGGCGCCG CGCAAGACCG TGGATCGCGA CGGCCACGAC ACCGACATTC TCACCAAGGG CCGCCACGAC CCCTGCGTCG GCATCCGCGC GGTGTCGGTG GCCGAAGCCA TGGTCGCCTG CGTGCTCGCC GATCACCTGA TCCGCCACCG CGGCCAGATC GGCGGGTAG
|
Protein sequence | MSFNTFGHMF RVTTFGESHG VAIGCVVDGC PPLIALTEAD IQRDLDRRRP GQSRFTTQRQ EADQVKILSG VMVHPQSGLQ VTTGAPIALL IENTDQRSKD YSEIKDKFRP GHADFTYEAK YGIRDYRGGG RSSARETATR VAAGAIARKV VPGITVRAAL VQMGPHQIDR DNWDWEEVGN NPFFCPDKDK AKFFEDYLDG IRKNGSSIGA VIEVVADGVP AGWGAPIYAK LDTDIAAALM SINAVKGVEI GDGFATAALT GEQNADEMRA GNDGPSFLSN HAGGILGGIS TGQPVVARFA VKPTSSILAP RKTVDRDGHD TDILTKGRHD PCVGIRAVSV AEAMVACVLA DHLIRHRGQI GG
|
| |