Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0073 |
Symbol | |
ID | 4484665 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 78012 |
End bp | 79358 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639728835 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_871835 |
Protein GI | 117927284 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | [TIGR03449] UDP-N-acetylglucosamine: 1L-myo-inositol-1-phosphate 1-alpha-D-N-acetylglucosaminyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGCGTT GGCTGTCGGC TCCCGAGCAT GCACGGCACC GCACGGTCGC TGCTTCGGCG GGTGCTCCGT GGACGCCGCG ACGCGTCGCG ATGATCGCCG TCCACACCTC GCCGCTGGAG ATCCCTGGGT GTGGCGATGC CGGCGGCCTC AACGTGTACG TGGCGCAAAT CGCCCGGCGG TTGGCATCCC GCGGTATTGA CGTTGACGTG TTCACCCGGG CCACCCGGCG GGACCTACCG CCGCAGCAAC GACTCGCTCC GGGGGTCACC GTCCGCAATG TCGTCGCCGG GCCGTTGGAA CCGCTGCCCA AAGACGAGCT TCCGGTGCAT CTGTGTGCGT TCACCGCCGC GGTGCTTCGT GCGGAGGCGA TGCGTGAACC CGGCTGGTAC GACGTCATCC ACTCCCATTA CTGGCTCTCC GGTGAGGTTG GCCGGGTGGC GAGCCAGCGG TGGGGGGTGC CCCTCGTCCA CACCATGCAC ACCTTGGCGA AAGTCAAGAA CGCCGCGCTC GCCGAAGGCG ATGTGCCGGA ACCGGGCCGA CGTGTCATCG GAGAAGCCGA CGTTGTCGCC GCCGCGGATC GGCTGGTGAC TAACACCTGG ACTGAGGCGA GGCAGCTCGT TGACCTTTAC GGCGCCGAGC CCGATCGGAT CCGCGTGGTA CCGCCCGGCG TCGAAACGGC GATCTTCCGG CCCGGTGACA GCGCCCGGGC TCGCCGTCGG CTGGGCCTTC CCATCGACGG GTGCGTCGTC TTGTTCGTGG GACGCCTCCA ACCGCTCAAA GGGCCGGACA TCGCGGTTCG CGCCGCAGCC GAATTTCTCT CCACCCATCC GGGAATGCGT TCGACGTTCC GCCTCGTCAT TGTGGGAGGT CCGAGCGGGT CGCGCAGCAC GGAGCCGGAA CGGTTGCGCG CGCTCGCTGC CGATCTCGGG GTTGCCGATG CCGTCATTTT CGCCCCTCCG ATGCCGCCGG ATAGGCTCGT CGAGTTCTAT CGCGCCGCGA CGGTGACGAT TGTTCCGTCC CACAGCGAAT CGTTTGGCTT GGTGGCACTG GAATCGCAAG CCTGCGGCAC GCCGGTGGTG GCCGCCCGGG TCGGCGGTCT GACGACGGCG GTGCGGGACG GCGAGAGCGG TCTGCTCGTC GACGGCCACG ACCCCGCGAG GTACGCCGGC GCAATCGGCC GCCTTCTCGA CCCGGGTCTG CGGGCCGAGC TCGTCCGCGG CGCGGTCGCT CACGCCATGC GGTTTCACTG GGACAACACC GTCGAGGGCA TTCTGGGCGT GTACCGGGAT GCCCTGGCTG AGCGCCGGAC CGCGGCGACA CAGCGGCTGG CGAGCCGCGT GGGCTAG
|
Protein sequence | MPRWLSAPEH ARHRTVAASA GAPWTPRRVA MIAVHTSPLE IPGCGDAGGL NVYVAQIARR LASRGIDVDV FTRATRRDLP PQQRLAPGVT VRNVVAGPLE PLPKDELPVH LCAFTAAVLR AEAMREPGWY DVIHSHYWLS GEVGRVASQR WGVPLVHTMH TLAKVKNAAL AEGDVPEPGR RVIGEADVVA AADRLVTNTW TEARQLVDLY GAEPDRIRVV PPGVETAIFR PGDSARARRR LGLPIDGCVV LFVGRLQPLK GPDIAVRAAA EFLSTHPGMR STFRLVIVGG PSGSRSTEPE RLRALAADLG VADAVIFAPP MPPDRLVEFY RAATVTIVPS HSESFGLVAL ESQACGTPVV AARVGGLTTA VRDGESGLLV DGHDPARYAG AIGRLLDPGL RAELVRGAVA HAMRFHWDNT VEGILGVYRD ALAERRTAAT QRLASRVG
|
| |