Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0478 |
Symbol | |
ID | 4484800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 511729 |
End bp | 513030 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639729245 |
Product | glycosyltransferase family 28 protein |
Protein accession | YP_872237 |
Protein GI | 117927686 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | [TIGR00661] conserved hypothetical protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCCTGT CACCGGACGA TTTTCTTCAG CCGCTTGACC TCGACTCTAC GCTTCGGTCC GAGGAAGAGC GGGCGTTGTC CACGTTTGCC GGTGGGAGCG CCGGTGGGGT CAACGCGACA ATCTCCGACC CTCTCGTGGT TGCGATATTG ACGTTCTTCG ACGGTGTCGG TCACGTCGTG CGGTCATGCA AGGTCGGTGC TGAATTGGCG GCGCGCGGCC ACAACGTCGT GGTCGGTTGT GCCGAAGGAG CGAAACAAAT CGTCGAGCGC TTCAACCTCA CCCACTGGCC GATACATGAA GTCGCACCGT TACCGCCCGA AGGGCATCCA CCCGGACGCA GCAGGCTCAG CAACCCGCAG TACTTAGCGG CGTGCCTCGA CAGCGAGGCA CGCCTGCTGG CCGCCGTCCG GCCGGCTGTC GTCCTCGTAG ATTTTCGGGT CACGGGAACC GTCAGCAGCG CTGCCGCGGG AGTCCCCTGT GCGTGGCTGG CGAACCTCAG TTTTCTTCAT TACCCGTTGA GCGCTATTTG GCCACAGATC TGCCAGGGAT TCGCAGAGAT CGGTGTGGCT CCTCCGAGCC GTCCGTTTGG TGAGGCGTTG TTGGTCCCCA GCTCGGCATG GCTGGAACCG CTGGACGGCG TACCCGCGGA GGTTCGGCAG TGGGTGACGG GCGTCGTTTC AGACGTCCGC TTCGTTGGTC CGATTCCGAG TATCGACGGG TCGCCTGGGG CGTCCGATAG GCGTGACGCC CGCCGGTCCG AGGAGTACGT CTACGTGACG TTTGGCGGTC TTGCTGCGGG TTCCACGATT GTGCAGAGGC TGCTCCCAGC GCTGGCTGAG TTGGACATGC ACGTGGTGGT CAGCATGGGC CCGCATACCG CCGCCTCGGC ATTGAGCGAC GTTCCCCCGA ATGTGGCGAT CCGGCCTTTC GACAACAACT ATCCGGCTCT GATCCGTAAC GCCAGTGTCG TCGTACACCA CGGCGGTCAC ACCACCCTCA TCGACGCGAT GTACGCGGGA ATTCCGGCGA TATGCCTGCC GCAGCACGAT GAGCAACGTC GAAACGGCCG GCTGCTTGAG GAGCTCGGCA CCGGTAAAGT CGTCGAACTC GACGAGGTCG GCTCGATCGC GCGAGAAATT AAGAATCTGT TGTCTGACAC CGTAGTGCGC GGCAACTGCG AAAAGGTGGC CCGTCTTCTC TCCGTGGAGC GCGGGGCCGC CGATGCTGCG GACGCACTCG AGCGCATCGC TCGTTGGCGT CGCGTCCTCG CAGCAGACAG GCCGTCGGTC TCGGGAGTGT GA
|
Protein sequence | MSLSPDDFLQ PLDLDSTLRS EEERALSTFA GGSAGGVNAT ISDPLVVAIL TFFDGVGHVV RSCKVGAELA ARGHNVVVGC AEGAKQIVER FNLTHWPIHE VAPLPPEGHP PGRSRLSNPQ YLAACLDSEA RLLAAVRPAV VLVDFRVTGT VSSAAAGVPC AWLANLSFLH YPLSAIWPQI CQGFAEIGVA PPSRPFGEAL LVPSSAWLEP LDGVPAEVRQ WVTGVVSDVR FVGPIPSIDG SPGASDRRDA RRSEEYVYVT FGGLAAGSTI VQRLLPALAE LDMHVVVSMG PHTAASALSD VPPNVAIRPF DNNYPALIRN ASVVVHHGGH TTLIDAMYAG IPAICLPQHD EQRRNGRLLE ELGTGKVVEL DEVGSIAREI KNLLSDTVVR GNCEKVARLL SVERGAADAA DALERIARWR RVLAADRPSV SGV
|
| |