Gene Acel_0894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0894 
Symbol 
ID4485726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp985623 
End bp986840 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content67% 
IMG OID639729669 
Producthypothetical protein 
Protein accessionYP_872653 
Protein GI117928102 
COG category[S] Function unknown 
COG ID[COG5276] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.707456 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGACG GTCCGGACGG CGGATATCGC CCTTCCGAGC GGGGTTTACG GCTGACCGGA 
CACCATGATC TCGGCGGTCG GGGCGATGGG ATGCAGGTGA TGCGCTGGGG ATCGGCGGTG
TACGTCGGAC ACGTCGGCAC CAGCCGCGCC GGCACGTCCG TCCTTGACGC CTCGGATCCG
GAACGACTTC GACTCGTCGA GCAGTGGCCG GCCCCCAACC GTTCGCATAC ACACAAGGTT
CAGGTCGCGG ACGACGTGCT CCTCGTCAAT CACGAGAAGT TCCCCTACCG CGTACCGGCC
GACGGACCGG TCTCCGCGGG CGTAGCCATT TACGACGTCA GCCGCCCTCT GGAGCCCCGG
CGGATCGCCT TCTGGGAGTG CGGCGGCATC GGGGTGCACA GGATCGTCTG GACCGGGGGA
CGGTATGCGC ACATGTCAGC GACGCCGGCG GGTTTCCGGG ACCGCATCTG GATCGTGCTC
GACCTCGCTG ATCCATTCCA TCCGGTGGAG GCCGCGCGAT GGTGGTGGCC CGGTCAACGG
GACGACGAGG TGCCCGACTG GCCGGCGGAG CTGCGTTACG CCGCGCACCA TGCGTTGATC
GCCGACGGAC GGGCGTTCGT TGGCTATGGC GATGCGGGCA TGGTGATCCT GGATGTCGCC
GATATCACCC GGCCACGGTT GCTTCACCGT GTTTCCTGGC CGGACGGCGG CGATACCCAC
ACCTGCCTGC CGCTGCCGGG GCGCCGTCTC ATTGCTGTGA CGGACGAGCA GGTCCGGGAC
GGTCCCGGCG CGCCGCCACG GAAGATTCGG CTGTTCACGA TGGACGATCC ACCCCGGCTG
GTGAGCGTGC TGCCGGCACC GGACGACGAG TTCGCGAGCC TGCCGCTGCG CTACGGTGCA
CATAATCTTC ACGAGAACCG GCCGCGTTCC TACCAGAGCG AGGACATCCT CTTCGCGACG
TATTTCAGCG CCGGGCTCCG TGTTTATGAC ATCAGCGATC CCGGGCAGCC GGTGGAAATC
GCCCATTGGT GTCCGCCGGT GCCGCCGGGT CAGGCGGTGC CGCAGATCAA CGATGTATTC
GTCGACCATG AGGGGCTCAT CTGGGTGACG GATCGGCTGA ACGGCGGCCT CTACGTGCTC
GAGCCGGAAC CGGAACTGCG GCGGCGGATG CTCCATCACG CGCCACCCAC CCGGTCCGGC
GGCCCGAGAG GATGGTGA
 
Protein sequence
MIDGPDGGYR PSERGLRLTG HHDLGGRGDG MQVMRWGSAV YVGHVGTSRA GTSVLDASDP 
ERLRLVEQWP APNRSHTHKV QVADDVLLVN HEKFPYRVPA DGPVSAGVAI YDVSRPLEPR
RIAFWECGGI GVHRIVWTGG RYAHMSATPA GFRDRIWIVL DLADPFHPVE AARWWWPGQR
DDEVPDWPAE LRYAAHHALI ADGRAFVGYG DAGMVILDVA DITRPRLLHR VSWPDGGDTH
TCLPLPGRRL IAVTDEQVRD GPGAPPRKIR LFTMDDPPRL VSVLPAPDDE FASLPLRYGA
HNLHENRPRS YQSEDILFAT YFSAGLRVYD ISDPGQPVEI AHWCPPVPPG QAVPQINDVF
VDHEGLIWVT DRLNGGLYVL EPEPELRRRM LHHAPPTRSG GPRGW