Gene Athe_0854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0854 
Symbol 
ID7407429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp949238 
End bp951319 
Gene Length2082 bp 
Protein Length693 aa 
Translation table11 
GC content36% 
IMG OID643715232 
ProductAlpha-glucuronidase 
Protein accessionYP_002572742 
Protein GI222528860 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3661] Alpha-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTTAT CAAGGAGCAG TAACCCAAAC TATTCTATGT GTTGGCTTTC TTATAAACCT 
ATAGGTAAGA AAGAATATGT ACATGAAGTT GAAAAATTTT TAGGGCAAAT AGTTTTATTG
GAGAAAAATA TTTATTTCGA AAATGCGGCG AATGAACTTA AAAAGGCTTT ATGTGTATTG
TTTGAAACTG AACTAAGATT GAACAATGCT TTAAGTCTTT ATGTTGACAG TGGAATTATT
TTAGGTAAAG TGACAAATGA AAATCTTAGA GGTTTTATAA CCGATGTTGA AAAAGAAGCA
GTAGGTGAGG AAGGGTTTAT AATAAAACTT GTAGATAAAA GTAAGAAAAA ATACATTATT
GTTGCTTCAA AGGGTGAAAA AGGAATAATA TATGGGATAT TTCATTTGAT AAACAAATTT
AGACTTAAAA CAGGATTAAA AGAACTCAAT TGTATAGAAA ATCCAAAGGC CTCGTTACGA
ATTATTAACC ATTGGGATAA TATGGATGGA AGTATTGAAA GAGGATATGC GGGTAAATCA
ATATTTTTTA CAAATGGTAG AATAAAACGC AATTATAAAC GTATATGGGA TTATGCAAGG
CTTCTTGCCT CAATTGGAAT AAACGGTGTT GTAATAAATA ATGTGAATGT AAGAGATAAG
GCTATATGGT TAATTACGCC AAAATATCTA AATGACCTCT CGAAAATAGC AGAAATTTTT
AGACTCTATG GGATAAAACT TTACCTTAGC ATAAACTTTG CAAGCCCAAT TTATATAGGA
GGTCTTGACA CTGCAGACCC ACTTGACAAA AACGTTCAAA AGTGGTGGAA GGACACTGTA
AAAACTATTT ACAGCTACAT ACCAGACTTT GGTGGATTTT TGGTAAAAGC CGATTCTGAG
TTCAATCCAG GGCCGTATGT ATACGGTAGA ACACATGCAG ATGGAGCAAA CATGCTTGCA
GAGGCACTTT TGCCTTATGG AGGAGTTGTT ATATGGCGTG CGTTTGTTTA CAACTGCTTG
CAGGATTGGA GAGATACAAA GACAGACAGG GCAAAGGCTG CATATGACAA TTTTAAACCA
CTTGATGGGA TGTTCTCTAA AAATGTCATT TTACAGATAA AGTATGGTCC GATGGATTTT
CAGGTAAGAG AACCTGTTTC ACCTCTTTTT GGCGCTATGG AAAAGACAAA CCAGATGATA
GAGTTTCAAA TAACCCAAGA ATATACGGGG CAACAAATTC ATCTGTGCTA TTTGGGGACG
CTATGGAAAG AGATTTTAGA GTTTGACACA TATTGTAAAG GAAAAGGTTC GTACGTAAAG
AGAATAGTGG ATGGAAGTCT TTTTGGAATG AAATATGCAG GATTTGCAGG TGTTTCGAAT
ATTGGGGATA GCATCAACTG GACAGGTCAT GACCTTGCAC AGGCGAATCT GTGGACGTTT
GGAAAACTTG CATGGGACCC AGATAAAAAG ATTGAAGATA TAGCAAGAGA GTGGGCCATT
TTAACATTTG GAGATGACAA AAAAGTGGTT GACAACATTT TATGGATGCT TCTTAATTCT
CACGGGATCT ACGAAAAATA TACAACTCCG CTTGGGCTTG GCTGGATGGT AAATCCAGGT
CATCACTATG GTCCAAACCC GGAAGGGTAT GAGTATTCAA AGTGGGGAAC GTATCATCGG
TCAGATACAA AAGCAATTGG AGTTGACAGA ACTTCAAGAG GGACAGGTTA TACTTTGCAA
TATCACAAGC CCTGGCAGGA AATATTCGAT GATATAAATA AATGTCCTGA AGAACTTCTT
CTATTTTTCC ACAGAGTGCC GTATGATTTT AGACTGAAAA ATGGAAAAAC GCTCCTGCAG
TTTATGTATG ACTCTCACTT TGAAGGGGCT GATATGGTAG ATAAACTTAT AGAAAAGTGG
GAGGAACTGA GAGGAAAGAT TGATGAGGAG ATCTTCAACA GAGTATATGA AAGATTGAAG
ATGCAAAAAG AACATGCAAT GGAATGGAGA GATGTTATCA ACACATATTT TTATAGAAAG
ACAGGAATAC CTGATGAAAA GGGAAGACTA ATATATCCGT AA
 
Protein sequence
MILSRSSNPN YSMCWLSYKP IGKKEYVHEV EKFLGQIVLL EKNIYFENAA NELKKALCVL 
FETELRLNNA LSLYVDSGII LGKVTNENLR GFITDVEKEA VGEEGFIIKL VDKSKKKYII
VASKGEKGII YGIFHLINKF RLKTGLKELN CIENPKASLR IINHWDNMDG SIERGYAGKS
IFFTNGRIKR NYKRIWDYAR LLASIGINGV VINNVNVRDK AIWLITPKYL NDLSKIAEIF
RLYGIKLYLS INFASPIYIG GLDTADPLDK NVQKWWKDTV KTIYSYIPDF GGFLVKADSE
FNPGPYVYGR THADGANMLA EALLPYGGVV IWRAFVYNCL QDWRDTKTDR AKAAYDNFKP
LDGMFSKNVI LQIKYGPMDF QVREPVSPLF GAMEKTNQMI EFQITQEYTG QQIHLCYLGT
LWKEILEFDT YCKGKGSYVK RIVDGSLFGM KYAGFAGVSN IGDSINWTGH DLAQANLWTF
GKLAWDPDKK IEDIAREWAI LTFGDDKKVV DNILWMLLNS HGIYEKYTTP LGLGWMVNPG
HHYGPNPEGY EYSKWGTYHR SDTKAIGVDR TSRGTGYTLQ YHKPWQEIFD DINKCPEELL
LFFHRVPYDF RLKNGKTLLQ FMYDSHFEGA DMVDKLIEKW EELRGKIDEE IFNRVYERLK
MQKEHAMEWR DVINTYFYRK TGIPDEKGRL IYP