Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0460 |
Symbol | |
ID | 4484781 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 493087 |
End bp | 496224 |
Gene Length | 3138 bp |
Protein Length | 1045 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639729227 |
Product | glycosyl transferase family protein |
Protein accession | YP_872220 |
Protein GI | 117927669 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0791075 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAACC GACGTCCGTA TCTCCCGCAT CCGCGGCACG TCGTGACGGC GATCCTCGTC GCCCACGACG GCGGTCGGTG GCTGCCCACC ACGCTGCAAG CGGTGAAGAT GCAGCGGCGG CCGGTGCAAC GCTTCGTCGC CGTGGACACC GGTAGCCGCG ACGACACCCG CCAACTCCTC GAGGAATCGG TCGGGGCGGC CAGCGTCCTC TCCGCGCCGC GGACCGTCGG TTTCGGGGAC GCCGTCCACC AGGCCGTCGC CGCGTTCGCC GGTGTGCCCG GCTGGGCGGC GAGTCAGACG GACGCCGGAT CCCCCGTCGA GTGGCTCTGG CTCCTGCACG ACGACTCGGC ACCCGATCCA GCCGCTCTGG ACGCGATGCT TGCCCTTGCC GACGAGATGC CGTCCGCCGC GGTCATCGGG CCGAAGATCG TCGACTGGGA GCGGCCCGGG GTGCTCCGGG AGGTCGGCTT CACCGTCGAC CGAGGCGGCC ATCGCCAGAC CGGATTGGAA CCCGACGAGC TCGATCACGG CCAGCATGAC GGCGACCGCG ACGTCTTCGC GGTCAGCAGC GCCGGCATGC TGATCCGGCG GGATCTGTGG GACCTGCTTG GCGGTTTTGA CCGGAACTTT CCGCTGCTGC GCGACGATCT CGACTTCTGC TGGCGGGCGC ACCTCGCCGG CGAGCGGGTC GTCGTCTGCA CGCGTGCCGT CGTCCGGCAC GCTCGGGCGG CGACCAGCGG CTACCGGCGG GTGAGCTGTA CGCCGCCGGG GGCTCGGCGT TCGGTGTCGC TGCGCCGCCT CGACCGGCAA GCCGCCCTGT TCGCCTGGCT CGCCAACTGC AGCCGGAGCA GCTTTCCGCT GGTGGCGGTT CGGCTGGCCC TGGCCGCCCT GCTCCGCGCG CTTGCGTTCG TCGCCGCGAA GAGCGTGGAC CGGGCCGTCT CGGAGCTCGC GGCGGCTGCG GCGGTCTTCG GCCGGCCCGG CCGGTTGCTG GCGGCCCGGC GGCAACGACG TCCATTCCGG CGGGTGCCGT ACTCGGAGGT CCGGGCCCTG CTGGCCCCGC GCGGGTCGCG GCTGCGGCAG GTCATCGACC AGTGGACGGC GGCGTCCGCG CCCGACGACA GGCGCCGCAG CCTGCGGCGG CTGCTCACCA ACCCCGGGGT GCTGCTGGCC GTCGGCCTGC TCGCGGTCAC GCTTGCCGCC GAGCGCACGG TGCTTACCAC CCAGCTCTGG GGCGGAGCCC TGCTGCCCGC GCCCGCCGGG GCGGGCGACC TGTGGCAGCG GTACGTCGAG AGCTGGCATG CCACCGGGTA CGGCAGTGCA GCGCCCGCTC CGCCGTACCT CGCGGTGCTG GCGTTCCTGG GCAGCCTGCT CGTCGGCAAC GCCCGGTTCG CCGTCGCGAT CCTCCTGCTT GGAGCCGCCC CGCTTGCCGG GCTGGTCGCG TATCGGTCCA GCCGCTGGGC GTTCACCTCG CCGCGGCTGC GGGTGACCGC CGGGGCGCTG TACGGCCTGG CTCCGGTGGT CACCGGGGCG GTGAGCACGG GACGGCTCGG GGTCGCCGTC GCGGCCATCG TCCTGCCGGC GCTGCTCGTC CAACTGGCCC GGGCGTTGGT GCCGGAACAG CTGCTTGCGG TCCCCGCCGG CGTCCGGCAT GGACGGCGAC CGGCGTCCGT GCGGCACGCG TGGGCTGCCG GCTTGCTGCT CGCCGTCCTG ACCGCGTTTG ACCCCGCGGG GTATCTCCTC ACCGCCGCCT TGTTGGTCAT CGCTCTTGTC ATCGCACTGG TACGTCGGCA GGTCGGCGAG GCGGCACGCT GCGTCTTGAT CGCGGGCATT CCGGCCCTGC TTCTCGTTCC GTGGACCGGC TGGCTCTGGT CGCACCCGGC CCTGTTTGTC ACCGGCCTGG GCCAGGTGGC TCAGGCACTG CAGGATTCCA ACCTCCACCC GGTTGACCTG GTGCTCGCAC ATCCGGGCGG GCCGGGGTCG CCGCCGTACT GGTTGATGGC GGGGGTGGCG GCTGCCGCGC TCTTCGGCCT GTTCCGCAGC CGCACGGCGC TTCTCGGCTG GCTGCTGGCG GCGGTCGGGG TCGCCGGCGG CCTGGTCGTC ACCCGGGTGC ACGTTGTCCC CGTCGCCGAG CCGGGCGGCA GTGCGGTCGC CGGGTGGCCG GGGGCGGCGA CGGCGTTCGT CGCGGCCGGG CTCGCGCTGG CCGCCGCCGG TGGCTTGGCC GGCCTGCGGG GTTGGCTGCG CCGGTCCAGC TTTGGCCTGC GGCACCCTGC GACGCTGCTC CTCGCCGCCG CCGTCCTTGC CACGCCGCTC GTCGCGGCGG GAACCTGGAT TGCGCGCGGC ACCGGACACC TGCTGCGGAC CGGGTCGGCC GACATCCTGC CGATCTTCGT CACCGATCCG ACGACCGTGC ACGGGCAACC GCGCACGGTC ATCCTCCGGC CTGCGGGGAA CAGCGTCCGC TACACCGTGT TGCGGGACCG GTCGCCCGAA CTGGGGGACG CCGACCTGCC GCCGAGCCCC GCGCAGGTTG CCGCGGTGGG CAGTGCGGTC GGCGACCTGG CAAGCGGGCT CGGCGGGCCG GCCGCCGACG CGCTTGCCCG CACGGGTGTG CGGTTCGTGC TCATCCCGGC GTCGGCGCAG CGGCTGGACC AGACCATCGC CGCCGCCGGC GGGTTGCTGC GCCGCGGCAC GGTGGGCGGC TGGCAGGTCT GGGAGATCAC CCCGGGCGGC GCCAGGCTTG CGATCACTGA CGGCACCGCG TGGCAACCGG TGACGGCCGG CGGTGTCGGG TGGAACGCAG CTCCGGTGAT GGTGAGTTCC GGTTCGCCGT CCCGGTTGCT CGTGCTCGCT GAGGCAGCCT CGCCGCGGTG GCGCGCTGTT CTCGGCAGCC CGGGGAGCAC CGTGCCGCTC GCGCCGGTGA ATTATCAGGG CTGGCAGGCG TTCCGTCTGC CGGCCACCGG CGGGGAGGTT CGGGTCTACC GCGTTCCGGA CCGCCGCAGC ACGTGGCTGA CCGTGGAGCT TGCCGCGACG GCGATCGTGA TCTTCATCGC GCTGCCGGGC GCCCCGCGGC GTCGCGCCAC CGCCGCGCCC GCCGCAACGC CGTCCGAGGC CCGTGTCCCC GTTGGAGCTG CGCCATGA
|
Protein sequence | MTNRRPYLPH PRHVVTAILV AHDGGRWLPT TLQAVKMQRR PVQRFVAVDT GSRDDTRQLL EESVGAASVL SAPRTVGFGD AVHQAVAAFA GVPGWAASQT DAGSPVEWLW LLHDDSAPDP AALDAMLALA DEMPSAAVIG PKIVDWERPG VLREVGFTVD RGGHRQTGLE PDELDHGQHD GDRDVFAVSS AGMLIRRDLW DLLGGFDRNF PLLRDDLDFC WRAHLAGERV VVCTRAVVRH ARAATSGYRR VSCTPPGARR SVSLRRLDRQ AALFAWLANC SRSSFPLVAV RLALAALLRA LAFVAAKSVD RAVSELAAAA AVFGRPGRLL AARRQRRPFR RVPYSEVRAL LAPRGSRLRQ VIDQWTAASA PDDRRRSLRR LLTNPGVLLA VGLLAVTLAA ERTVLTTQLW GGALLPAPAG AGDLWQRYVE SWHATGYGSA APAPPYLAVL AFLGSLLVGN ARFAVAILLL GAAPLAGLVA YRSSRWAFTS PRLRVTAGAL YGLAPVVTGA VSTGRLGVAV AAIVLPALLV QLARALVPEQ LLAVPAGVRH GRRPASVRHA WAAGLLLAVL TAFDPAGYLL TAALLVIALV IALVRRQVGE AARCVLIAGI PALLLVPWTG WLWSHPALFV TGLGQVAQAL QDSNLHPVDL VLAHPGGPGS PPYWLMAGVA AAALFGLFRS RTALLGWLLA AVGVAGGLVV TRVHVVPVAE PGGSAVAGWP GAATAFVAAG LALAAAGGLA GLRGWLRRSS FGLRHPATLL LAAAVLATPL VAAGTWIARG TGHLLRTGSA DILPIFVTDP TTVHGQPRTV ILRPAGNSVR YTVLRDRSPE LGDADLPPSP AQVAAVGSAV GDLASGLGGP AADALARTGV RFVLIPASAQ RLDQTIAAAG GLLRRGTVGG WQVWEITPGG ARLAITDGTA WQPVTAGGVG WNAAPVMVSS GSPSRLLVLA EAASPRWRAV LGSPGSTVPL APVNYQGWQA FRLPATGGEV RVYRVPDRRS TWLTVELAAT AIVIFIALPG APRRRATAAP AATPSEARVP VGAAP
|
| |