Gene Acel_1828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1828 
Symbol 
ID4485435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2071477 
End bp2073033 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content67% 
IMG OID639730618 
Productpara-aminobenzoate synthase, subunit I 
Protein accessionYP_873586 
Protein GI117929035 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.388219 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGCCC GAAACCCGAT CGTCGTCCGG CGCTGGGACG CGGTTCCCGA TCCCGCAGCC 
GCCTACGACG CCCTCTTCGC GGGCCGGCTT GGCGCGTTCT GGTTGGACGG CGAATATTCC
GCTGTGCTGG ACGGCGACCC GATGTCCACG CTGGCGGCCG ATGACCGCCC GGGCTGGGGC
TGCTCAATCC TCGGTGCCGC GGACGGACCA TTGGGCGAAT TCCTCAGTTA CGACGTGGAA
ACCGGCACCC TCGCGGTCCG GACGGCGCAA ACAACCCGCA CGTATCGCGT GGCGGACATC
TTCTCCTACC TCAACGAACA GATCGCGGCA CGACATATCA CCGCGCCGTG CGGCATTCCC
GGTGATATCG CCCCCGGCTA CGTCGGATAC CTCGGGTATG AGCTGAAAGC GCTGACCGGC
GGCCGTGCCG TCCACCGCTC ACCCTTTCCG GACGCCGCTC TTACCTTCGC GGATCGGCTT
CTCCTCCTCG ACCATCACAG CGGAGCAACC TACGCGCTCG CGCTTACCGG CTCCGCCTCC
GAGACGTGGT TTGCCCGGGT CGAGCGCGCA CTCGCCGATG CGCCGACCCG TCATGTCCGC
GGCCCCCGGA GTCCGTCCCA GCTCGGCCCT GCCTCCGTCC CGACACGGCA GGACGCCGTC
GTGAGGCTGG CCGCCCTTGC GGTGACCGAA GAATTCAGCA GCCCGTACCC GGTCACACCC
CGCCACGATG CGGCCGCCTA CCGGGCACGC ATCGAGGAGT GCCGTCGCGA CATCGCGGCT
GGTGACGCAT ACGAACTCTG TTTGACCACG ATGCTGACCA CGCCGCGGGT CGATCCACTG
CGGTTGTTTC ACACGGTCCG AGCCGTCACA CCGGCGCCGT ATGCGGCCCT CCTTGAATTC
CCGGACGCCG CTGTGGTCTG CGCATCACCT GAGCTCTTCT GTACGGTCGA CGCCGCCGGA
TGGGTCACCT CAGGGCCGAT CAAAGGCACG CGCCGCCGAA GCGCCGATCC CCTCGTCGAC
GCCGCACTGC GCCGTGACCT CGCCACCTGC GCCAAAGACC GGGCAGAGAA CGTAATGATT
GTTGACTTGC TGCGCAACGA CCTCGGCCGG GTATGCCGGC CCGGAACGAT CGACGTAAGC
CGACTCTGCG CGGTCGAGAC GTATCCAACG GTGCATCAAC TCGTCTCCAC CATTCGCGGC
CGGCTCAACC CCGACGCCTC GGCGATCGAC GCGGTACGCG CGGCGTTCCC ACCGGGTTCC
ATGACCGGTG CGCCGAAAAT CCGGTCCATG GAGATTCTCG ACGCCCGGGA AGGCGGTCCG
CGGGGGGTCT ATTCCGGCGC GCTCGGCTGG TTCTCCCTCA CCGGCACGGC CCGCCTCGCT
GTCGTCATCC GCACCGCGGT CATCGACGCC GACACGGTTC ACATAGGCAC CGGCGGTGCT
ATCACCATCG ATTCCGATCC CGACGCCGAA ATCGCCGAAA CAGCGGCCAA AGCAGCCGGG
ATGCTGCACG CCATCCGGCT CAGTCGAAGA CCAGCGGCTC CGTACGGCTC CGTTTGA
 
Protein sequence
MSARNPIVVR RWDAVPDPAA AYDALFAGRL GAFWLDGEYS AVLDGDPMST LAADDRPGWG 
CSILGAADGP LGEFLSYDVE TGTLAVRTAQ TTRTYRVADI FSYLNEQIAA RHITAPCGIP
GDIAPGYVGY LGYELKALTG GRAVHRSPFP DAALTFADRL LLLDHHSGAT YALALTGSAS
ETWFARVERA LADAPTRHVR GPRSPSQLGP ASVPTRQDAV VRLAALAVTE EFSSPYPVTP
RHDAAAYRAR IEECRRDIAA GDAYELCLTT MLTTPRVDPL RLFHTVRAVT PAPYAALLEF
PDAAVVCASP ELFCTVDAAG WVTSGPIKGT RRRSADPLVD AALRRDLATC AKDRAENVMI
VDLLRNDLGR VCRPGTIDVS RLCAVETYPT VHQLVSTIRG RLNPDASAID AVRAAFPPGS
MTGAPKIRSM EILDAREGGP RGVYSGALGW FSLTGTARLA VVIRTAVIDA DTVHIGTGGA
ITIDSDPDAE IAETAAKAAG MLHAIRLSRR PAAPYGSV