Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4318 |
Symbol | |
ID | 3912131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4901468 |
End bp | 4903630 |
Gene Length | 2163 bp |
Protein Length | 720 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637886222 |
Product | anthranilate synthase |
Protein accession | YP_487916 |
Protein GI | 86751420 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I [COG0512] Anthranilate/para-aminobenzoate synthases component II |
TIGRFAM ID | [TIGR00566] glutamine amidotransferase of anthranilate synthase or aminodeoxychorismate synthase [TIGR01815] anthranilate synthase, alpha proteobacterial clade |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.717639 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGGA CCGTTTTCTC GCTTCCCGCG CAGAGCGACT ACAAGACCGC CGCGGGCCTC GCCGTGTCGC GCACGGTGGC GGCCTTCACC GGCGGCAGGG CGCTCGATGA TCTGATCGAC CTTCTGGACC GCCGCCGCGG CGTGATGCTG TCGTCCGGCA CTACGGTGCC CGGGCGCTAC GAGAGCTTCG ATTTCGGCTT CGCCGATCCG CCGCTGGCGC TGACCATCCG CGCCGAGCAG TTTTCGATCG AGGCGCTCAA TCCGCGCGGC CATGTGCTGG TCGCGTTCCT GTCCGACAGG CTCGACGAGC CCTGCGTGGT GGTCGAGCAG GCCTGCGCGA CGAAGATCAG GGGCCACATC GTTCGCGGCG AGGCGCCGGT CGATGAGGAG CAGCGCACCC GCCGCGCCAG CGCGATCTCG CTGGTGCGCG CGCTGGTGGC GGCGTTCGCC TCACCGGCGG ACCCGATGCT CGGCCTGTAC GGCGCCTTCG CCTACGATCT GGTGTTCCAG TTCGAGGATC TGGCGCAGAA GCGCGCGCGC GAGGCCGACC AGCGCGACAT CGTGCTGTAT GTGCCGGATC GGCTGTTGGC CTATGACCGC GCCACCGGCC GCGGCGTCAA TATCGCCTAT GAATTCGCCT GGAAGGGCAA ATCCACCCAG GGCCTGCCCA ACGACACCGC CGAGAGCGCC TACACCCAGA CCGGCCGGCA GGGCTTCGCC GACCACGCGC CGGGCGAATA CGCCAAAGTC GTCGAGGTCG CCCGCGAGCA TTTCGCCCGC GGCGATCTGT TCGAGGCGGT GCCCGGGCAG TTGTTCGGCG AACCCTGCGA GCGCTCGCCG GCCGAAGTGT TCAAGCGGCT GTGCCGGATC AATCCGTCGC CCTATGGCGG GTTGCTCAAT CTCGGCGACG GCGAATTCCT GGTGTCGGCC TCGCCGGAGA TGTTCGTCCG CTCCGACGGC CGCCGGATCG AGACCTGCCC GATCTCCGGC ACCATCGCCC GCGGCGTCGA CGCCATTGCG GATGCCGAGC AGATCCAGAA GCTTTTGAAC TCCGAGAAGG ACGAGTTCGA GCTCAACATG TGCACCGACG TCGACCGCAA CGACAAGGCG CGGGTCTGCG TGCCGGGCAC CATCAAGGTG CTGGCGCGGC GCCAGATCGA GACCTATTCG AAGCTGTTCC ACACGGTGGA CCACGTCGAG GGCATGCTCC GCCCCGGCTT CGACGCGCTC GATGCCTTCC TCACCCACGC CTGGGCGGTG ACGGTGACCG GCGCGCCGAA ACTCTGGGCG ATGCAGTTCG TCGAGGATCA CGAGCGCTCG CCGCGGCGCT GGTATGCCGG CGCGTTCGGC GTCGTCGGCT TCGACGGCTC GATCAACACC GGCCTCACCA TCCGTACCAT CCGGATGAAG GACGGCCTCG CCGAAGTCCG CGTCGGGGCG ACTTGTCTGT TCGATAGCGA CCCGGTCGCC GAGGACAAGG AGTGCCAGGT GAAAGCCGCC GCGCTGTTCC AGGCGCTGCG CGGCGATCCG CCGAAGCCGC TGTCGGCGGT GGCACCGGAT GCCACCGGCT CCGGCAAGCG CGTGCTGCTG GTCGATCACG ACGACAGCTT CGTGCACATG CTGGCGGACT ATTTCCGCCA GGTCGGCGCC CAGGTCAGCG TGGTGCGCCA CATCCACGCG CAGAAGATGC TGGCCGAGAA TGCCTATGAT CTGCTGGTGC TGTCGCCCGG CCCGGGCCGG CCGGCGGATT TCAAGATTGC GTTGACGATC GACACCGCTC TGGCGAAGCA ACTGCCGATC TTCGGGGTCT GCCTCGGCGT GCAGGCGATG GGCGAATATT TCGGCGGTAC GCTCGGCCAG CTTAAGCAGC CGGCGCATGG CCGGCCGTCG CGGATCCAGG TGCGCGGCGG CACGCTGATG CACGGCCTGC CGAACGAGAT CACCATCGGT CGCTATCATT CACTCTATGT CGACATGCAG GACATGCCGG ATGCGCTCGA CGTCACCGCC TCGACCGAGG ACGGCATCGC GATGGCGATC GAGCACAAGA CGCTGCCGGT CGGCGGCGTC CAATTCCACC CGGAATCGCT GATGTCGCTC GGCGGCGAGG TCGGGCTGCG GATCGTCGAA AACGCCTTCC GGCTGGGCCG GCCGATGGCC TGA
|
Protein sequence | MNRTVFSLPA QSDYKTAAGL AVSRTVAAFT GGRALDDLID LLDRRRGVML SSGTTVPGRY ESFDFGFADP PLALTIRAEQ FSIEALNPRG HVLVAFLSDR LDEPCVVVEQ ACATKIRGHI VRGEAPVDEE QRTRRASAIS LVRALVAAFA SPADPMLGLY GAFAYDLVFQ FEDLAQKRAR EADQRDIVLY VPDRLLAYDR ATGRGVNIAY EFAWKGKSTQ GLPNDTAESA YTQTGRQGFA DHAPGEYAKV VEVAREHFAR GDLFEAVPGQ LFGEPCERSP AEVFKRLCRI NPSPYGGLLN LGDGEFLVSA SPEMFVRSDG RRIETCPISG TIARGVDAIA DAEQIQKLLN SEKDEFELNM CTDVDRNDKA RVCVPGTIKV LARRQIETYS KLFHTVDHVE GMLRPGFDAL DAFLTHAWAV TVTGAPKLWA MQFVEDHERS PRRWYAGAFG VVGFDGSINT GLTIRTIRMK DGLAEVRVGA TCLFDSDPVA EDKECQVKAA ALFQALRGDP PKPLSAVAPD ATGSGKRVLL VDHDDSFVHM LADYFRQVGA QVSVVRHIHA QKMLAENAYD LLVLSPGPGR PADFKIALTI DTALAKQLPI FGVCLGVQAM GEYFGGTLGQ LKQPAHGRPS RIQVRGGTLM HGLPNEITIG RYHSLYVDMQ DMPDALDVTA STEDGIAMAI EHKTLPVGGV QFHPESLMSL GGEVGLRIVE NAFRLGRPMA
|
| |