Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_4388 |
Symbol | |
ID | 3970442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 4892572 |
End bp | 4894734 |
Gene Length | 2163 bp |
Protein Length | 720 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637927497 |
Product | anthranilate synthase |
Protein accession | YP_534230 |
Protein GI | 90425860 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I [COG0512] Anthranilate/para-aminobenzoate synthases component II |
TIGRFAM ID | [TIGR00566] glutamine amidotransferase of anthranilate synthase or aminodeoxychorismate synthase [TIGR01815] anthranilate synthase, alpha proteobacterial clade |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0144881 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCGAA CCGTATTCGC GCTGCCCGCG CGCAGCGATT ACGCCACCGC CGGCGGGCTT TCGGTGACGC GCAGCGTGCA GCAATTCACC GGCGGCGACG CGCTCGACAA TCTGATCGAC CTGCTCGACC ACCGCCCCGG CGTGATGCTG TCGTCCGGCA CCACGGTGCC CGGCCGCTAC GAGAGCTTCG ACCTCGGCTT CGCCGATCCG CCGCTGCGGC TGGTTTCCAC CGGCGACAAC TTCGCGTTGA CCGCGCTGAA CGCCCGCGGC GAGGTGCTGC TGGCGTTCCT CGCCGCCACC CTGCAGGAGC CCTGCGTGGT GATCGACAAA GTCGCCGGCC GCCAGATCGA CGGCCATATC ATCCGCGGCG AAGCGCCGGT CGACGAGGAC CAGCGCACCA GGCGCGCCAG CATGATGTCG CTGGTGCGGG CGCTGGTGGC GGCGTTCGCC TCGCCGGGCG ATCCGATGCT CGGGCTGTTC GGCGCCTTCG CCTATGACCT GGTGTTTCAG TTCGAAGATC TGAAGCCAAA GCGCGCCCGC GAAGCCGACC AGCGCGACAT CGTGCTGTAC GTCCCGGACC GGCTGTTGGC CTATGACCGC GCCACCGGCC GCGGCGTGCA TCTGGCCTAC GAATTCTCCT GGAACGGCCG CTCCACCGAG GGGCTGTCGC ACGACACCCC GGACAGCGTC TACGCCAAGA GCCCACGGCA GGGCTTTGCC GATCACGCCC CCGGCGAATA TCAGGCCACG GTGGAGGTCG CCCGCGCGGC GTTCGCCCGC GGCGATCTGT TCGAGGCGGT GCCCGGGCAA TTGTTCGCCG AGCCGTGCGA GCGCTCGCCG GCGGAAGTAT TTCAGCGGCT CTGCCGGATC AATCCGTCGC CCTACGGCGC CTTGATGAAT CTCGGCGACG GCGAATTTTT GGTGGCGGCG TCGCCGGAAA TGTTCGTGCG CTCGGACGGG CGCCGCATCG AGACCTGCCC GATTTCCGGC ACCATCGCGC GCGGCGTCGA CGCCATCGGC GACGCCGAGC AGATCCGGCA ATTGCTGAAT TCGGAGAAAG ACGAATTCGA GCTCAACATG TGCACCGACG TCGACCGCAA CGACAAAGCG CGGGTCTGCG TGCCGGGCAC AATTAAAGTT CTCGCGCGCC GCCAGATCGA AACCTATTCA AAACTGTTCC ACACCGTCGA CCACGTCGAG GGCATGTTGC GGCCGGGCTT TGATGCGCTC GACGCCTTCC TGACCCACGC CTGGGCGGTG ACGGTGACCG GGGCGCCGAA ATTATGGGCG ATGCAGTTCG TCGAGGACCA CGAGCGCTCG AGCCGCCGCT GGTACGCCGG CGCGATCGGC TGCGTGAATT TCGACGGCAG CATCAACACC GGGCTGACCA TCCGCACCAT CCGGATGAAG GACGGCCTCG CCGAAGTCCG CGTCGGCGCC ACGCTATTGT TCGATTCCGA TCCGGTCGCC GAAGAGAAGG AATGTCAGAC CAAGGCCGCG GCGCTGTTCC AGGCGCTGCG CGGCGATCCG CCGAAGCGGC TGTCGGCGCT GGCGCCGGAC GCCTCGGGCT CCGGCAAGAA GGTGCTGCTG ATCGATCACG ACGACAGCTT CGTGCACATG CTGGCGGATT ACTTCCGCCA GGTCGGCGCT CAGGTCACCG TGGTGCGCTA CATCCACGCG CTGCCGATGC TGGCGAACAA CGACTACGAT CTGCTGGTGC TGTCGCCCGG CCCCGGCCGG CCGGAGGACT TCAAGATCAA GGCGACGATC GACGCTGGGC TGCAGAAGAA CATGCCAATC TTCGGGGTGT GCCTGGGCGT GCAGGCGATG GGCGAGTATT TCGGCGGCCA GCTCGGGCAA TTGGCGCAGC CGGCGCATGG CCGGCCGTCG AAGATCCAGG TCCGCGGCGG CACGCTGATG CGCGGCCTGC CGGACGAGAT CGTGATCGGC CGCTATCACT CGCTCTATGT CGAGCAGGAC AGCATGCCCG AGGTGTTGGC CGTCACCGCC GCCACCGAGG ACGGCATCGC CATGGTGATC GAGCACAAGA CCTTGCCGGT CGGCGGCGTG CAGTTTCATC CCGAATCGCT GATGTCGCTC GGTGGCGAGG TCGGCCTGCG GATCGTCGAA AACGCCTTCC GGCTTGGTCT TCCGGCCAAT TGA
|
Protein sequence | MNRTVFALPA RSDYATAGGL SVTRSVQQFT GGDALDNLID LLDHRPGVML SSGTTVPGRY ESFDLGFADP PLRLVSTGDN FALTALNARG EVLLAFLAAT LQEPCVVIDK VAGRQIDGHI IRGEAPVDED QRTRRASMMS LVRALVAAFA SPGDPMLGLF GAFAYDLVFQ FEDLKPKRAR EADQRDIVLY VPDRLLAYDR ATGRGVHLAY EFSWNGRSTE GLSHDTPDSV YAKSPRQGFA DHAPGEYQAT VEVARAAFAR GDLFEAVPGQ LFAEPCERSP AEVFQRLCRI NPSPYGALMN LGDGEFLVAA SPEMFVRSDG RRIETCPISG TIARGVDAIG DAEQIRQLLN SEKDEFELNM CTDVDRNDKA RVCVPGTIKV LARRQIETYS KLFHTVDHVE GMLRPGFDAL DAFLTHAWAV TVTGAPKLWA MQFVEDHERS SRRWYAGAIG CVNFDGSINT GLTIRTIRMK DGLAEVRVGA TLLFDSDPVA EEKECQTKAA ALFQALRGDP PKRLSALAPD ASGSGKKVLL IDHDDSFVHM LADYFRQVGA QVTVVRYIHA LPMLANNDYD LLVLSPGPGR PEDFKIKATI DAGLQKNMPI FGVCLGVQAM GEYFGGQLGQ LAQPAHGRPS KIQVRGGTLM RGLPDEIVIG RYHSLYVEQD SMPEVLAVTA ATEDGIAMVI EHKTLPVGGV QFHPESLMSL GGEVGLRIVE NAFRLGLPAN
|
| |