Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2962 |
Symbol | |
ID | 5734834 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3736977 |
End bp | 3738494 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280106 |
Product | anthranilate synthase component I |
Protein accession | YP_001545728 |
Protein GI | 159899481 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00182854 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTCGC CAACATTTGA ACAAGTGCAA GCATGGGCTG CGGCTGGCTA CACCCAATGT GCAGTCTATC GTGAGTTAGT CGCCGATTTG GAAACGCCAG TGTCGGCGTA CTTGAAGGTT GCTCAGGGTC ATTATAGTTT CTTGCTTGAA AGTGTTGAGG GTGGCGAGCA AATTGGTCGC TATTCATTTA TTGGCTGTGA GCCTCATTTA ATTATTCGTG GCCTTGGCCA GCAAAGTATT ATTGAAACTG CCAATGGCGA ACGTACCAGC TTTGATGATC TGACGACGCT TGATCAATTA GAACGCTTGG TGGTTGGCAC GCAACGGGCT AACCCTGCGC CTCAACCTGA TTTGCCGCGC TTTACGGGCG GGGCGGTTGG CTTTTTGGGC TATGAAACTG TGCGCACGTT TGAACGTTTG CCCGCGCCAA CCCTGCGCCC ATTGCAAATT CCCGATGGCG TGTGGATGGT GGTCAAAACG GTTTTGGCTT TCGACCATGT GCGCCATACG ATCAAAATTA TGAGCACACT GGTGTTTGAT TCGGCGGTAG ATTTGGCAAC TCAATTTGCC GAAGCCAACC AAGCGATCGA AGCCATGACC AAAAAATTGG TACGGCCATT AGCACCCGAA GTCTATAGCT CAAGCGCCGC CTCGCCCAGC TTACCCGAAC TCAACGAGCA ATTGCAATCA AATCAATCGT TTAGCGAATT TAGCACCGCG ATTGAAAAGG CTCGCGAATA TATTCGGGCT GGCGATATTT TTCAAGTGGT GTTATCGCAG CGTTTTCAGC GTGAAACCGA TGCCGAGCCA TTTGCGGTCT ATCGAGCACT GCGCACGGTC AATCCATCAC CATACATGTT CTTTTTGAAT GTGCCTGATG CGGCGATTAT TGGGGCATCG CCCGAAATGT TGGTGCGGGT TGAAGATGGC ATTATTGAGA CCCACCCAAT TGCTGGTACG CGCCGCCGTG GCCGCGATGC CGATGACGAA GCTCGCATGC AGGCTGAATT ATTAGCCGAC GAAAAAGAAC GGGCTGAGCA TTTGATGCTG GTTGATCTTG GGCGCAACGA TGTGGGGCGG GTTTCGCTGC CTGGCACCGT CCACGTGCCC AAATTTATGC AAATTGAAAA ATATTCGCAT GTGATGCACT TAGTTTCGGT GGTCAAAGGC ACGCTGGATA CCTCGCGCTA CTCGCCATTG CATGCCTTAC GCGCCTGCTT CCCTGCTGGC ACGCTGACTG GTGCGCCCAA AGTGCGGGCC ATGGAAATTA TCGCTGAATT AGAGCCAAGC CAACGCGGGC CATATGGTGG TTGCGTTGGC TATGTCTCGT TTGGCGGGTT GTCGCTTGAC ACAGCGATTA CTATTCGCAC AATGGTTATC AAAGATGGCG TAGCCTATAT GCAAGCTGGC GCGGGGATTG TCGCCGATAG CGATGTTAAA TTGGAAGATC TCGAAACCCG CAACAAAGCT GGTTCGCTGA TTCGCGCCTT GCACGTCGCC GAGATGTTGG AGTTGTAA
|
Protein sequence | MASPTFEQVQ AWAAAGYTQC AVYRELVADL ETPVSAYLKV AQGHYSFLLE SVEGGEQIGR YSFIGCEPHL IIRGLGQQSI IETANGERTS FDDLTTLDQL ERLVVGTQRA NPAPQPDLPR FTGGAVGFLG YETVRTFERL PAPTLRPLQI PDGVWMVVKT VLAFDHVRHT IKIMSTLVFD SAVDLATQFA EANQAIEAMT KKLVRPLAPE VYSSSAASPS LPELNEQLQS NQSFSEFSTA IEKAREYIRA GDIFQVVLSQ RFQRETDAEP FAVYRALRTV NPSPYMFFLN VPDAAIIGAS PEMLVRVEDG IIETHPIAGT RRRGRDADDE ARMQAELLAD EKERAEHLML VDLGRNDVGR VSLPGTVHVP KFMQIEKYSH VMHLVSVVKG TLDTSRYSPL HALRACFPAG TLTGAPKVRA MEIIAELEPS QRGPYGGCVG YVSFGGLSLD TAITIRTMVI KDGVAYMQAG AGIVADSDVK LEDLETRNKA GSLIRALHVA EMLEL
|
| |