Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_1071 |
Symbol | |
ID | 4485319 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 1181193 |
End bp | 1182650 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639729845 |
Product | anthranilate synthase, component I |
Protein accession | YP_872829 |
Protein GI | 117928278 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.14675 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.387124 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGACGC TCGCCACCCG TCCGCCGACC GATTCGGGCA TTCGCCGGCT GACCCGGCGG TTATTGGCGG ACGGCGAAAC CGCAATCGGC CTCTACCGGA AGCTCGCCGG CGACCGTCCG GGGACGTTCC TGCTGGAATC CGCCGAGAGC GGCAGATCCT GGTCGCGTTA TTCCTTCGTC GGGGTGCGCA GCGCCGCTGT GCTGACGGAG CGCGGGGGCC GAGGGCACTG GCTGGGCACG CCGCCGCCCG GCGTTCCCAC CGACGGCGAC CCGCTGGCCG TCCTGGCGGG GACGCTCGAC GCGCTGCGTA CGCCGCGCGA TCCGCAGTTG CCGCCGCTTG CCGGCGGGTT GGTCGGTTTC ATCGGCTACG ACATGGTCCG GCGCCTCGAG CGCCTTCCCC AGACGACAGT CGACGACCTC GGCCTGCCGG AGCTGCTCCT CCTTCTGGTC ACGGACCTTG CCGTCTTGGA CCACTACGAC GGCAGTGTGC TGCTCATCGC CAACGTGTTG CCGGGTGATT CGGTGGACGC CGCGGCGGCG CGGCTTGACG AGATGACGTC CGCGCTGCGG CGTCCCGCAC CATCGACGGT CACCGTGGTC GACCAGGTGC CGGCACGCGA GCCGCTGCGG CGCACCGCCA GCGACGAGTA CTGCGCATGG GTGGAGCGGG CCCGGGAGTA CATCCGGGCC GGCGACATTT TCCAGGTCGT TCTCAGTCAG CGTTTCGAGA TGACGACGAC GGCGTCGGCA TTGGACATTT ACCGGGTGCT GCGGACCCGC AATCCCAGTC CGTATCTTTT TCTCCTGCGG TTCGAGGGTT TCGACGTCGT CGGATCCAGT CCCGAGGCGC ACGTGACCGT GAAGGACGGC CGCGCGACCA TGCACCCGAT CGCTGGCAGC AATCCCCGTG GGGCTACTCC CGAGGAAGAC GCTGCGCTTG CCGCTGCGCT GCTTGCGGAT CCGAAAGAAC GTGCGGAACA CGTGATGCTC GTCGATTTGG CCCGTAATGA CCTGGGCAGG GTGTGTGCGC CCGGCACGGT CGAGGTCGTG GATTTCATGG CGGTCGAGCG CTACAGCCAC ATCATGCACT TGGTCTCCAC CGTGGTTGGG CAGGTTGCGC CGGGACGGAA TGCGCTCGAC GTTCTAACGG CGACATTTCC AGCCGGCACG CTCTCCGGTG CGCCCAAGGT ACGGGCGATG GAAATCATCG AGGAATTGGA GCCGACCCGC CGCGGCCTCT ACGGCGGCGT TGTCGGGTAC GTCGATTTTG CCGGCGACCT CGACACCGCG ATTGCGATTC GTACCGCGTT GCTGCGCGAC GGCACCGTCT ACGTTCAGGC CGGAGCGGGA TTGGTTGCCG ATTCCAACCC GGTGAGAGAA GACCAGGAAT GTTGCAACAA GGCCAACACG GTGCTGTCCG CGGTGACGAT CGCCGAAACG CTGCATCCCG CCGGCTGA
|
Protein sequence | MTTLATRPPT DSGIRRLTRR LLADGETAIG LYRKLAGDRP GTFLLESAES GRSWSRYSFV GVRSAAVLTE RGGRGHWLGT PPPGVPTDGD PLAVLAGTLD ALRTPRDPQL PPLAGGLVGF IGYDMVRRLE RLPQTTVDDL GLPELLLLLV TDLAVLDHYD GSVLLIANVL PGDSVDAAAA RLDEMTSALR RPAPSTVTVV DQVPAREPLR RTASDEYCAW VERAREYIRA GDIFQVVLSQ RFEMTTTASA LDIYRVLRTR NPSPYLFLLR FEGFDVVGSS PEAHVTVKDG RATMHPIAGS NPRGATPEED AALAAALLAD PKERAEHVML VDLARNDLGR VCAPGTVEVV DFMAVERYSH IMHLVSTVVG QVAPGRNALD VLTATFPAGT LSGAPKVRAM EIIEELEPTR RGLYGGVVGY VDFAGDLDTA IAIRTALLRD GTVYVQAGAG LVADSNPVRE DQECCNKANT VLSAVTIAET LHPAG
|
| |