Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_4213 |
Symbol | |
ID | 4024734 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 4678953 |
End bp | 4681115 |
Gene Length | 2163 bp |
Protein Length | 720 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637964419 |
Product | anthranilate synthase |
Protein accession | YP_571331 |
Protein GI | 91978672 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I [COG0512] Anthranilate/para-aminobenzoate synthases component II |
TIGRFAM ID | [TIGR00566] glutamine amidotransferase of anthranilate synthase or aminodeoxychorismate synthase [TIGR01815] anthranilate synthase, alpha proteobacterial clade |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.848003 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGGA CCGTTTTCTC GCTACCCAAG CACAGCGACC ACGCCACCGC CGCGGGCCTT GGCGTGTCGC GCACGGTGGA GGCCTTCACC GGCGGCCAGG CGCTCGATGA CCTGATCGAC CTGCTGGATC GTCGCCGCGG CGTGATGCTG TCGTCCGGCA CCACGGTGCC GGGGCGCTAC GAGAGCTTCG ATTTCGGCTT CGCCGATCCG CCGCTGATGC TGACTACCCG AGCCGATCAG TTTTCGATCG AGGCGCTGAA CGCACGCGGC CAGGTGCTGA TCGCGTTCCT GTCCGACCGG CTCGAAGAGC CCTGCGTCGT GGTCGAGCAG GCCTGCGCGA CCAAGATCAG GGGCCACATC GTCCGCGGCG AGGCGCCGGT GGACGAGGAC CAGCGCACCC GCCGCGCCAG CGCGATGTCG CTGGTGCGCG CGCTGGTCGC GGCGTTCGCC TCGCCGGCCG ATCCGATGCT CGGGCTGTAC GGCGCCTTCG CCTATGATCT GGTGTTTCAG ATCGAGGATC TGAAGCAGAA ACGTGCGCGC GAGGCCGACC AGCGCGACAT CGTGCTGTAT GTGCCGGATC GCCTGCTGGC CTACGACCGC GCCACCGGCC GCGGCGTCAA TATCGCCTAT GAATTCGCCT GGAAAGGCAA GTCCAGCTCG GGCCTGTCGA CCGCGACCGC GGAGAGCGTC TACACCCAGA CCGGCCGGCA GGGTTTCGCC GACCATGCGC CGGGCGAATA CGCCAAGGTG GTCGAGACGG CGCGCGAATC CTTCGCCCGC GGCGACCTGT TCGAGGCGGT GCCCGGGCAG TTGTTCGGCG AGCCGTGCGA GCGCTCGCCG GCCGAAGTGT TCAAGCGGCT GTGCCGGATC AACCCGTCTC CTTACGGTGG CCTGCTCAAT CTCGGCGACG GCGAATTCCT GGTGTCGGCC TCGCCGGAAA TGTTCGTCCG CTCCGACGGC CGCCGGGTCG AGACCTGCCC GATCTCCGGC ACCATCGCCC GCGGTGTCGA CGCCATCGCG GACGCCGAGC AGATCCAGAA GCTCCTGAAC TCCGAGAAGG ACGAGTTCGA GCTGAACATG TGCACCGATG TCGATCGCAA CGACAAGGCG CGGGTCTGCG TGCCGGGCAC GATCAAGGTT CTGGCGCGAC GCCAGATCGA GACCTACTCG AAACTGTTCC ACACCGTCGA TCACGTCGAG GGCATGCTGC GGCCCGGCTT CGACTCGCTC GACGCCTTCC TGACCCACGC CTGGGCGGTG ACGGTGACCG GCGCGCCGAA ATTATGGGCG ATGCAGTTCG TCGAGGATCA CGAGCGGACG CCGCGGCGCT GGTACGCCGG CGCATTCGGG GTCGTCGGCT TCGACGGTTC GATCAACACC GGCCTGACCA TCCGCACCAT CCGGATGAAG GACGGCCTCG CCGAGGTCCG TGTCGGCGCG ACCTGTCTGT TCGATTCCGA TCCGGTCGCC GAGGACAAGG AATGCCAGGT CAAGGCCGCG GCGCTGTTCC AGGCGCTGCG CGGCGATCCG CCGAAGCCGC TGTCGGCGGT GGCGCCGGAC GCCACCGGCT CCGGCAAGAA GGTGCTGCTG ATCGATCACG ACGACAGCTT CGTCCACATG CTGGCGGATT ACTTCCGCCA GGTCGGCGCC CAGGTCAGCG TCGTGCGCCA CATCCATGCG CAGAAGATGC TGGCCGACAA TGCCTATGAT CTGCTGGTGC TGTCGCCTGG CCCCGGCCGG CCGGAGGACT TCAAGATCAA GTCCACGATC GACGCCGCGC TGGCGCGGAA GCTGCCGATC TTCGGCGTCT GCCTCGGCGT TCAGGCGATC GGTGAGTACT TTGGCGGCCA TCTCGGCCAG CTCGCGCAGC CGGCGCATGG CCGGCCGTCG CGGATTCAGG TGCGCGGCGG CACGCTGATG AACGGCCTCC CGAACGAGAT CACGATCGGC CGCTATCACT CGCTGTTCGT CGAGATGCAG GACATGCCCG ACACGCTCAA GGTCACCGCC TCGACCGAGG ACGGCATCGC GATGGCGATC GAGCACAAAT CGCTTCCGGT GGGCGGCGTG CAGTTCCACC CGGAGTCGCT GATGTCGCTG GGCGGCCAGG TGGGCCTGCG GATCGTCGAA AACGCCTTCC GCCTCGGCCT GACGCCGAAC TGA
|
Protein sequence | MNRTVFSLPK HSDHATAAGL GVSRTVEAFT GGQALDDLID LLDRRRGVML SSGTTVPGRY ESFDFGFADP PLMLTTRADQ FSIEALNARG QVLIAFLSDR LEEPCVVVEQ ACATKIRGHI VRGEAPVDED QRTRRASAMS LVRALVAAFA SPADPMLGLY GAFAYDLVFQ IEDLKQKRAR EADQRDIVLY VPDRLLAYDR ATGRGVNIAY EFAWKGKSSS GLSTATAESV YTQTGRQGFA DHAPGEYAKV VETARESFAR GDLFEAVPGQ LFGEPCERSP AEVFKRLCRI NPSPYGGLLN LGDGEFLVSA SPEMFVRSDG RRVETCPISG TIARGVDAIA DAEQIQKLLN SEKDEFELNM CTDVDRNDKA RVCVPGTIKV LARRQIETYS KLFHTVDHVE GMLRPGFDSL DAFLTHAWAV TVTGAPKLWA MQFVEDHERT PRRWYAGAFG VVGFDGSINT GLTIRTIRMK DGLAEVRVGA TCLFDSDPVA EDKECQVKAA ALFQALRGDP PKPLSAVAPD ATGSGKKVLL IDHDDSFVHM LADYFRQVGA QVSVVRHIHA QKMLADNAYD LLVLSPGPGR PEDFKIKSTI DAALARKLPI FGVCLGVQAI GEYFGGHLGQ LAQPAHGRPS RIQVRGGTLM NGLPNEITIG RYHSLFVEMQ DMPDTLKVTA STEDGIAMAI EHKSLPVGGV QFHPESLMSL GGQVGLRIVE NAFRLGLTPN
|
| |