Gene Acel_0422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0422 
Symbol 
ID4485657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp443130 
End bp447023 
Gene Length3894 bp 
Protein Length1297 aa 
Translation table11 
GC content66% 
IMG OID639729189 
Productglycosyl transferase family protein 
Protein accessionYP_872182 
Protein GI117927631 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGGTCA ACCTCGGGTG CCGCGGCGGC TATTTCGACG GTTGGACCAA CGTGGATGCG 
GACGCCGGAG TCCGCGCGGA CATTCATCTC GATCCGCTCG ATTTTCTCCG CCGGTACGCC
GACGACATTG ACACGTTGTA CGTCGGTGCG CTGCTGGAGC AGCATCCGGT CGAGTACGCC
TTGTCGTTGC TGCGCCTCGC CAATGACCGG CTGTGTCCCG GCACGGTCGT CGTGGCGGTG
ACGCACGACG TCAAGGCGAT TTTGCGCGCC GCCGTCACCG GCGCCCTGAC TGCCGCAGAA
TCGGCGGCTG CGGATGTGTC CCGGTATGTG GAGCAACCGG AGAACGTGTC GTACTACGAC
ACACACTCGC TCGCGGCCCT GTTCCACCGC GCCGGTTTTG CCGATGTGAC GCCGATTGGC
GACCTGGACG CCGAACTGTC TGCGCTCGGC GTCGGTCCGG TCGATGCCCG GTGGCGCTGC
GCGGTGCGCG GTGTCGCGGT AGGCCGGCGC TATCGCAGCG TCGATGCCAT TGTCGAAGCC
GTTACCGAGG ACACCGCCGG TGAAACCGCC AGGGAAGCGG CTGAGCCGGC AGCCGAGGAA
ACGGCTGGTG TTTCCCGGGC TGGTGTGTCC CGGGCGGCAG GCGGTGCTGA CGCGGGCAGA
CCGCTGCGGC CTGAGGTGCT CACCGAGGCG CATTCCGCGT TGCGTGAGGT GCAGAAGCTG
CGCAGCGCGC TGATTCGTGA GCATGAGCGG CGGGTGCTTG CCGAGCAGTA CCTGGAGGAG
CTGACCGAGC GGACTGGCGA GGACACCGCG CGACGCGACG AGCCGGTGGA ACGCGTTCCC
GAAGTGGGCG GTCGTGAACC CGACACCGCC GCTGCGGAGA ACGCCGCGGC CCTCACCTGG
CGGGAACGGG CCAAGCGTTT CGCCAAGACT GTGCTCCCTC CCGGATCCCG TGGACGTATT
GCGGCGCTGG GCGCGCTCCA CCTGTACCGC GAAGGGCGGA CCGTGCTGCG GAGCCTCCGG
CAGGTCGCGG CGGAAGCGGG ATTTGCCCGG CCGCCGAGTT ATTCGCGGTG GTACCGATAT
CACGACGCGA CACCGCATCA CCTGAGACTC CAGCGGCGTG CGTCCGAGCG CGCGACGATG
CCGCCGACGT TTCTCGTCTG CGTCCTGGCC GACACCGCCG ACCGCCAGCC GATCGCAACG
ACCATTGCGA GCATTCGCAG TCAATCCTGG CAGCACGCGA AGCTGGTCAT TTGCGCTGCG
GAACCGATCG CGCGTGCGCT CCAGGGGGTT TTCTCTGACG TCGACGTCAT CGGCGCGCCG
ACGGTCGCGG ACGCCGTGAT GCGGGCGGTG GATGGCAGCG ACCGGGATTT TGTGCTGCTG
CCGGACGCCG GGGACGTGCT TGCCGCGGAC TGCCTCTATC GCGTGGCACA GGCGGCGTGG
CGAAATCCCT TGCTTGACCT CACCTACTGG GACGACGACC AGCTCGGTGC CGACGGACGC
CGGCATGACC CGCTCTTCCG CCCGTCCTGG TCGCCCGAGA TGCTCTTCAG CGTCAACTAT
CTCGCCCAGT CCTTCGCCGT CCGGCGCCGG CGGATCCGTG CCATTGGGTC GCTCCACGGT
GATTCCGCGG AGGTCATGCG CTGGGATCTC CTGCTGCGCG GCAATTTCAC CGGCGAGCAG
GTGGAACGAC TCGCGCACGT CCTTGGGCAC GTGCGTCGCC GTGTCTTCGG CGTGACTGAG
GCCGGCCGGC AGGTTGTGCA GGCCGAACTG GCGCGTCGCG GTCTTCCGGC GGCTGCGGAA
CTCGATGCGC ACGGCGTACG GCTGCGCTGG CAGCTTCCGG AGTGGCCGAC GGTCAGCATC
ATCATCCCCA CCCGGCACAA CCGCCGACTG CTCTCCACTG TGCTCGACGG GCTGCGGGCC
ACGGATTATC CGTCGTTCGA GGTGAGAATC GTCGACAACG GCGGCTACTC AACGGAGAAC
GAGGCGTGGT ACGCCGAACA GCTGCGCGGG CTGGACGCGC ACGTCACGTG GTGGACCGAG
GAGCCGTTCA ACTACTCGCG GGTCAACAAT GCCGGCGCCG CTGACGCCCG CGGTGACGTG
CTGGTTTTTC TCAACGACGA CATCGAACTG ACCGATCCGT CCTGGCTGCG TGAACTCGTC
GGATGGACCT CGGTGTCCGA CATCGGGCTC GTCGGCCTGC AATTGCTTGA CGGCAACGGA
CGCATCCAGC ACGGCGGTGT CATCCTCGGT CTCGGCGGTT TCGCCGACCA TCTCTTCCAG
GGCATGGCGC CCGGCACCAT GACGATGTTC GGCCACACCG GGTGGTATCG CAACCTCTTG
TCGGTGACCG GGGCGTGTGT CGCGGTACGC CGTGAGGTGT TTCGCACGAT CGGCGGCTTC
GACGAACGGT TCATCCTCTG CGGCAGTGAC GTCGCGTTGG GACTCGACCT CGTCGAAGCC
GGTTACCGGA ACATCTGCTC GCCGTACGGC GGGGTTCGGC ATTTGGAGTC AGCCACCCGC
GGCACCGACA TTCCGCGGCA GGATTTCTTC ACCAGCTACT GGCGCTACAA CACGTGGCTC
TTCGGTGGTG ACCCGTACTT CTCACCGAAT CTCTCGCTGT ACAGCCGGGA GCCGCGGTTG
CGGAATCCGT TCGATCCGCC GATTTTGAAG CGGGTGTCAG CGGTTCTCGG CCGTGAGCTT
CGGGTGTTCC GGCAAAAGAG CGATGCATCC GAGGCGGCTG GACTCGCCGC GGTGTCCCGG
GCGGACGACG TCGACGTCGC CGCGGTCCGC TCATTGCACG AGCGCAATCA GGGTCACATC
GACGTCCGGA CCATCAACTG GTACATCCCG GAAATTGATT CACCGTTTTA CGGCGGCATC
AACACCGCAT TCCGGATGGC CGACTACCTC GCGCGTCGGC ACGGAGTGCA GAACCGCTTT
GTCGTGTGGG CAAAACCGGC TGAGGAGTTC ATGCGCTCGG CACTTGCCGC TGCGTTTCCC
ACGCTGGCCG ACTCCGAGAT CGTATTCTTC GATGCGGTCG ACAGCGGTGC CGCCGAGCAG
ATCCCACCTG CCGACGTCGC GATCGCCACG CTCTGGCTGA CGGCGTACGC GGTCTTGCAC
GCGCGTAACG TGCGCCGGAA GTTCTACCTC GTGCAGGATT TCGAGCCGAT GTTCTATCCC
GCGGGCACGT TGTATGCCGT GGCCGAGGAG ACGTACCGGT TCGGCTTGTA CGGCCTGTGC
AACACCGACA ACCTTCGCCG CATGTATGTC GAGGAGTACG GCGGAAAGGC GATGGCGTTC
ACTCCGGCGG TGGATCCGGC GGTCTTCCAC GCCGTCGGGC GGTCCTGGCG GAGTGAGGAC
GATCCGGTGA CCGTGTTCGT CTACGCCCGT CCCGGTCACT GGCGCAATTG CTGGGAAGTC
GCGTCGCTCG CGTTGCGGGA GTTGAAGAAC CGGCTGGGTG ACCGGGTGCG CATCGTCACC
GCTGGATCGT GGGCCATCGA CCCGGCTGCC GCCGACAGCA TGCAGCAGCT CGGTCTGCTG
AGCTATAAGG GCACCGGGAA TCTCTACCGC ACCTGCGATG TCGGGCTGGC GCTGACGGTC
TCCAAGCATC CGTCGTACCT GCCGCTGGAG CTGATGGCCT GCGGCGTTCC GGTGGTCGCC
TTCGATAATC CCTGGGGGTA CTGGATTCTT CGCGACGGCG AGAATGCGCT GCTCGCCCGG
CGTACCGTCG ATGGTCTGGC GGATGCGCTC GAGCGGCTCT GCACCGACCA TTTGCTGCGC
GAGAAGCTCG CGCAGAATGC CCTGGCGACC ATCGCCGAGG GGTATACGAA CTGGGATCAC
GCGTTTCGCG ACATCTACCG GTATCTGTGC GATCCGGAGG CCGGGTCGGC CTGA
 
Protein sequence
MKVNLGCRGG YFDGWTNVDA DAGVRADIHL DPLDFLRRYA DDIDTLYVGA LLEQHPVEYA 
LSLLRLANDR LCPGTVVVAV THDVKAILRA AVTGALTAAE SAAADVSRYV EQPENVSYYD
THSLAALFHR AGFADVTPIG DLDAELSALG VGPVDARWRC AVRGVAVGRR YRSVDAIVEA
VTEDTAGETA REAAEPAAEE TAGVSRAGVS RAAGGADAGR PLRPEVLTEA HSALREVQKL
RSALIREHER RVLAEQYLEE LTERTGEDTA RRDEPVERVP EVGGREPDTA AAENAAALTW
RERAKRFAKT VLPPGSRGRI AALGALHLYR EGRTVLRSLR QVAAEAGFAR PPSYSRWYRY
HDATPHHLRL QRRASERATM PPTFLVCVLA DTADRQPIAT TIASIRSQSW QHAKLVICAA
EPIARALQGV FSDVDVIGAP TVADAVMRAV DGSDRDFVLL PDAGDVLAAD CLYRVAQAAW
RNPLLDLTYW DDDQLGADGR RHDPLFRPSW SPEMLFSVNY LAQSFAVRRR RIRAIGSLHG
DSAEVMRWDL LLRGNFTGEQ VERLAHVLGH VRRRVFGVTE AGRQVVQAEL ARRGLPAAAE
LDAHGVRLRW QLPEWPTVSI IIPTRHNRRL LSTVLDGLRA TDYPSFEVRI VDNGGYSTEN
EAWYAEQLRG LDAHVTWWTE EPFNYSRVNN AGAADARGDV LVFLNDDIEL TDPSWLRELV
GWTSVSDIGL VGLQLLDGNG RIQHGGVILG LGGFADHLFQ GMAPGTMTMF GHTGWYRNLL
SVTGACVAVR REVFRTIGGF DERFILCGSD VALGLDLVEA GYRNICSPYG GVRHLESATR
GTDIPRQDFF TSYWRYNTWL FGGDPYFSPN LSLYSREPRL RNPFDPPILK RVSAVLGREL
RVFRQKSDAS EAAGLAAVSR ADDVDVAAVR SLHERNQGHI DVRTINWYIP EIDSPFYGGI
NTAFRMADYL ARRHGVQNRF VVWAKPAEEF MRSALAAAFP TLADSEIVFF DAVDSGAAEQ
IPPADVAIAT LWLTAYAVLH ARNVRRKFYL VQDFEPMFYP AGTLYAVAEE TYRFGLYGLC
NTDNLRRMYV EEYGGKAMAF TPAVDPAVFH AVGRSWRSED DPVTVFVYAR PGHWRNCWEV
ASLALRELKN RLGDRVRIVT AGSWAIDPAA ADSMQQLGLL SYKGTGNLYR TCDVGLALTV
SKHPSYLPLE LMACGVPVVA FDNPWGYWIL RDGENALLAR RTVDGLADAL ERLCTDHLLR
EKLAQNALAT IAEGYTNWDH AFRDIYRYLC DPEAGSA