Gene Acel_1071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1071 
Symbol 
ID4485319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1181193 
End bp1182650 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content67% 
IMG OID639729845 
Productanthranilate synthase, component I 
Protein accessionYP_872829 
Protein GI117928278 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.14675 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.387124 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGACGC TCGCCACCCG TCCGCCGACC GATTCGGGCA TTCGCCGGCT GACCCGGCGG 
TTATTGGCGG ACGGCGAAAC CGCAATCGGC CTCTACCGGA AGCTCGCCGG CGACCGTCCG
GGGACGTTCC TGCTGGAATC CGCCGAGAGC GGCAGATCCT GGTCGCGTTA TTCCTTCGTC
GGGGTGCGCA GCGCCGCTGT GCTGACGGAG CGCGGGGGCC GAGGGCACTG GCTGGGCACG
CCGCCGCCCG GCGTTCCCAC CGACGGCGAC CCGCTGGCCG TCCTGGCGGG GACGCTCGAC
GCGCTGCGTA CGCCGCGCGA TCCGCAGTTG CCGCCGCTTG CCGGCGGGTT GGTCGGTTTC
ATCGGCTACG ACATGGTCCG GCGCCTCGAG CGCCTTCCCC AGACGACAGT CGACGACCTC
GGCCTGCCGG AGCTGCTCCT CCTTCTGGTC ACGGACCTTG CCGTCTTGGA CCACTACGAC
GGCAGTGTGC TGCTCATCGC CAACGTGTTG CCGGGTGATT CGGTGGACGC CGCGGCGGCG
CGGCTTGACG AGATGACGTC CGCGCTGCGG CGTCCCGCAC CATCGACGGT CACCGTGGTC
GACCAGGTGC CGGCACGCGA GCCGCTGCGG CGCACCGCCA GCGACGAGTA CTGCGCATGG
GTGGAGCGGG CCCGGGAGTA CATCCGGGCC GGCGACATTT TCCAGGTCGT TCTCAGTCAG
CGTTTCGAGA TGACGACGAC GGCGTCGGCA TTGGACATTT ACCGGGTGCT GCGGACCCGC
AATCCCAGTC CGTATCTTTT TCTCCTGCGG TTCGAGGGTT TCGACGTCGT CGGATCCAGT
CCCGAGGCGC ACGTGACCGT GAAGGACGGC CGCGCGACCA TGCACCCGAT CGCTGGCAGC
AATCCCCGTG GGGCTACTCC CGAGGAAGAC GCTGCGCTTG CCGCTGCGCT GCTTGCGGAT
CCGAAAGAAC GTGCGGAACA CGTGATGCTC GTCGATTTGG CCCGTAATGA CCTGGGCAGG
GTGTGTGCGC CCGGCACGGT CGAGGTCGTG GATTTCATGG CGGTCGAGCG CTACAGCCAC
ATCATGCACT TGGTCTCCAC CGTGGTTGGG CAGGTTGCGC CGGGACGGAA TGCGCTCGAC
GTTCTAACGG CGACATTTCC AGCCGGCACG CTCTCCGGTG CGCCCAAGGT ACGGGCGATG
GAAATCATCG AGGAATTGGA GCCGACCCGC CGCGGCCTCT ACGGCGGCGT TGTCGGGTAC
GTCGATTTTG CCGGCGACCT CGACACCGCG ATTGCGATTC GTACCGCGTT GCTGCGCGAC
GGCACCGTCT ACGTTCAGGC CGGAGCGGGA TTGGTTGCCG ATTCCAACCC GGTGAGAGAA
GACCAGGAAT GTTGCAACAA GGCCAACACG GTGCTGTCCG CGGTGACGAT CGCCGAAACG
CTGCATCCCG CCGGCTGA
 
Protein sequence
MTTLATRPPT DSGIRRLTRR LLADGETAIG LYRKLAGDRP GTFLLESAES GRSWSRYSFV 
GVRSAAVLTE RGGRGHWLGT PPPGVPTDGD PLAVLAGTLD ALRTPRDPQL PPLAGGLVGF
IGYDMVRRLE RLPQTTVDDL GLPELLLLLV TDLAVLDHYD GSVLLIANVL PGDSVDAAAA
RLDEMTSALR RPAPSTVTVV DQVPAREPLR RTASDEYCAW VERAREYIRA GDIFQVVLSQ
RFEMTTTASA LDIYRVLRTR NPSPYLFLLR FEGFDVVGSS PEAHVTVKDG RATMHPIAGS
NPRGATPEED AALAAALLAD PKERAEHVML VDLARNDLGR VCAPGTVEVV DFMAVERYSH
IMHLVSTVVG QVAPGRNALD VLTATFPAGT LSGAPKVRAM EIIEELEPTR RGLYGGVVGY
VDFAGDLDTA IAIRTALLRD GTVYVQAGAG LVADSNPVRE DQECCNKANT VLSAVTIAET
LHPAG