Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_1828 |
Symbol | |
ID | 4485435 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 2071477 |
End bp | 2073033 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639730618 |
Product | para-aminobenzoate synthase, subunit I |
Protein accession | YP_873586 |
Protein GI | 117929035 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.388219 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCGCCC GAAACCCGAT CGTCGTCCGG CGCTGGGACG CGGTTCCCGA TCCCGCAGCC GCCTACGACG CCCTCTTCGC GGGCCGGCTT GGCGCGTTCT GGTTGGACGG CGAATATTCC GCTGTGCTGG ACGGCGACCC GATGTCCACG CTGGCGGCCG ATGACCGCCC GGGCTGGGGC TGCTCAATCC TCGGTGCCGC GGACGGACCA TTGGGCGAAT TCCTCAGTTA CGACGTGGAA ACCGGCACCC TCGCGGTCCG GACGGCGCAA ACAACCCGCA CGTATCGCGT GGCGGACATC TTCTCCTACC TCAACGAACA GATCGCGGCA CGACATATCA CCGCGCCGTG CGGCATTCCC GGTGATATCG CCCCCGGCTA CGTCGGATAC CTCGGGTATG AGCTGAAAGC GCTGACCGGC GGCCGTGCCG TCCACCGCTC ACCCTTTCCG GACGCCGCTC TTACCTTCGC GGATCGGCTT CTCCTCCTCG ACCATCACAG CGGAGCAACC TACGCGCTCG CGCTTACCGG CTCCGCCTCC GAGACGTGGT TTGCCCGGGT CGAGCGCGCA CTCGCCGATG CGCCGACCCG TCATGTCCGC GGCCCCCGGA GTCCGTCCCA GCTCGGCCCT GCCTCCGTCC CGACACGGCA GGACGCCGTC GTGAGGCTGG CCGCCCTTGC GGTGACCGAA GAATTCAGCA GCCCGTACCC GGTCACACCC CGCCACGATG CGGCCGCCTA CCGGGCACGC ATCGAGGAGT GCCGTCGCGA CATCGCGGCT GGTGACGCAT ACGAACTCTG TTTGACCACG ATGCTGACCA CGCCGCGGGT CGATCCACTG CGGTTGTTTC ACACGGTCCG AGCCGTCACA CCGGCGCCGT ATGCGGCCCT CCTTGAATTC CCGGACGCCG CTGTGGTCTG CGCATCACCT GAGCTCTTCT GTACGGTCGA CGCCGCCGGA TGGGTCACCT CAGGGCCGAT CAAAGGCACG CGCCGCCGAA GCGCCGATCC CCTCGTCGAC GCCGCACTGC GCCGTGACCT CGCCACCTGC GCCAAAGACC GGGCAGAGAA CGTAATGATT GTTGACTTGC TGCGCAACGA CCTCGGCCGG GTATGCCGGC CCGGAACGAT CGACGTAAGC CGACTCTGCG CGGTCGAGAC GTATCCAACG GTGCATCAAC TCGTCTCCAC CATTCGCGGC CGGCTCAACC CCGACGCCTC GGCGATCGAC GCGGTACGCG CGGCGTTCCC ACCGGGTTCC ATGACCGGTG CGCCGAAAAT CCGGTCCATG GAGATTCTCG ACGCCCGGGA AGGCGGTCCG CGGGGGGTCT ATTCCGGCGC GCTCGGCTGG TTCTCCCTCA CCGGCACGGC CCGCCTCGCT GTCGTCATCC GCACCGCGGT CATCGACGCC GACACGGTTC ACATAGGCAC CGGCGGTGCT ATCACCATCG ATTCCGATCC CGACGCCGAA ATCGCCGAAA CAGCGGCCAA AGCAGCCGGG ATGCTGCACG CCATCCGGCT CAGTCGAAGA CCAGCGGCTC CGTACGGCTC CGTTTGA
|
Protein sequence | MSARNPIVVR RWDAVPDPAA AYDALFAGRL GAFWLDGEYS AVLDGDPMST LAADDRPGWG CSILGAADGP LGEFLSYDVE TGTLAVRTAQ TTRTYRVADI FSYLNEQIAA RHITAPCGIP GDIAPGYVGY LGYELKALTG GRAVHRSPFP DAALTFADRL LLLDHHSGAT YALALTGSAS ETWFARVERA LADAPTRHVR GPRSPSQLGP ASVPTRQDAV VRLAALAVTE EFSSPYPVTP RHDAAAYRAR IEECRRDIAA GDAYELCLTT MLTTPRVDPL RLFHTVRAVT PAPYAALLEF PDAAVVCASP ELFCTVDAAG WVTSGPIKGT RRRSADPLVD AALRRDLATC AKDRAENVMI VDLLRNDLGR VCRPGTIDVS RLCAVETYPT VHQLVSTIRG RLNPDASAID AVRAAFPPGS MTGAPKIRSM EILDAREGGP RGVYSGALGW FSLTGTARLA VVIRTAVIDA DTVHIGTGGA ITIDSDPDAE IAETAAKAAG MLHAIRLSRR PAAPYGSV
|
| |