Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0086 |
Symbol | |
ID | 4710517 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 100258 |
End bp | 101337 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639854544 |
Product | anthranilate phosphoribosyltransferase |
Protein accession | YP_001001683 |
Protein GI | 121996896 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0547] Anthranilate phosphoribosyltransferase |
TIGRFAM ID | [TIGR01245] anthranilate phosphoribosyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAGG AGGAGGCGCT ACGCCGCTTC GGTCGGATGA TCAGCCGGGT GGTGTCCGGG GCGAGGCTCT CCCGCGAGGA AGCGGCGGAG GCGTATCGGC AGGTCATCCT GAACGAACAG CCCGAGTTGC AGCAGGGCGC CTTTCTGGCG GCCCACCAAG CCCGCGGACC GACGACCGAG GAGTTCTCCG GTGCCTGGGA TGCCCTCGAC GCCTACGACA CGGCGAAGAT CCACCCGCAG ATCGAAGGGC CTGTGGGGGA TATCGTGGGC ACTGGCTCGG ACTCCCTGAA GACCGTCAAC GCGTCGACCC CCGCGGCGCT CATCGCTGCC GGGTGTGGGT TGCCAGTGGC GAAGAAGGGG GCCCGGCTGG TGACCGGAGT ATCCGGGGCG TCGGACATCT TCGAGCGCCT CGGCGTGGAT TTGGAGGCCC CGCTGGAGCG CGCCGAGGCC TGCCTGGAAG CCCACGGGAT CGGCTACCTG CCCGGCGAGG CCTTCCTGCG GGCCGGGTGG GCGCGGCTGA TCCAGCGCAT GCGTTTTACT TCCGCCTTCA ACTTCATCGG GCCGCTGGCC ATGCCCTGCG CCAGGACCAG CCATGTCGTC ATCGGCGCCT ACTCCCCGCG GCTGTGTGAT CAGCTGGTCG CCATCCTGCG CGAGATCGGT ATGCCGGCGG CGCTCGCGCC CTTCGGCCGC GCCGAGGGCG AGGATCCGCA GCTGGGGATG GATGAGATCT CGCCGTGCGG GCCGACGCGG CTGGTTCTGC TCAAGAGTGG GTGGATCGAT ACCTTCGAAG TGACCCCGGC GGATTTCGGT CTGCGGACTC GGTCCCTGGG CGAGGTCGCC AGCAGCAAGC GGGCCGGGGA TAACGCGCAA CGCATCCTAG CGACCCTGGA GGGCCGCTAC GACACACCGG AGGCCGACTT CTTCGCCATG AACGCCGCCG CGCTTCTCTG GCTTGCCGAT CAGGCCCCCA ACCTCGCCCG GGCCACCGAT CAGGCGAAGG AGGCGCTGGC TACGGGGCGC GCGCTGGACA AGCTCGACGC CCTGCGGCGT GTCCAGGGGG TGGAAGGGGC TGTCGCTTAG
|
Protein sequence | MSEEEALRRF GRMISRVVSG ARLSREEAAE AYRQVILNEQ PELQQGAFLA AHQARGPTTE EFSGAWDALD AYDTAKIHPQ IEGPVGDIVG TGSDSLKTVN ASTPAALIAA GCGLPVAKKG ARLVTGVSGA SDIFERLGVD LEAPLERAEA CLEAHGIGYL PGEAFLRAGW ARLIQRMRFT SAFNFIGPLA MPCARTSHVV IGAYSPRLCD QLVAILREIG MPAALAPFGR AEGEDPQLGM DEISPCGPTR LVLLKSGWID TFEVTPADFG LRTRSLGEVA SSKRAGDNAQ RILATLEGRY DTPEADFFAM NAAALLWLAD QAPNLARATD QAKEALATGR ALDKLDALRR VQGVEGAVA
|
| |