Gene Acel_0190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0190 
Symbol 
ID4485503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp202044 
End bp203336 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content67% 
IMG OID639728953 
ProductTat-translocated enzyme 
Protein accessionYP_871950 
Protein GI117927399 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01412] Tat-translocated enzyme
[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0695897 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAGG CATCTGGCGG CGTCACGCGA CGGCGTTTTC TCGGCGGCCT CGGACTTGCC 
GGCGGAGGTC TCGGGATCGC AGCCGCCTTC ACGGGCGGAA GCGCAGCCGG CGCCGTCCTC
GGCCTGGTCG ACCGATCCCC TGACGGGCGA TCTGAGCAGG CGGCGGGTGC GGTGTCGACA
CCGGGTCGCG TGGTCCCGTT TCGGGGGCGC CATCAAGCCG GCATCATCAC GCCGGCCCAG
GATCGGCTGG TCTTTGGCGC GTTTGACGTG GTGACGGATT CAGCGGGTGC GCTACGTGAC
GTGCTGGACG CGTGGACGAT TGCCGCTGAA GCAATGACAC AGGGTCGCTC CGTGCCAGCA
GATGATGGGA GCCCGCAAGC ACCACCCACC GACACCGGCG AGGCACTCGG TTTACCGCCG
GCCGCGTTGA CGGTGACGAT CGGGTACGGG CCGCGACTCT TCGCGCGATT TGGGCTCGCC
ACCCGTCGCC CGTCAGCATT GGCCGAGCTT CCTCCGCTTC CCGGCGATCG ACTCGATCCG
GCCCGGTGTG GCGGGGACAT CTGCATCCAG GCGTGCTCCG ACGATCCGCA GGTCGCATTC
CACGCGGTGC GTAATGTGAC CCGGCTGGGC CGGGGTGTCG TGGTCCTCCG GTGGCTGCAG
CTCGGTTTCG GACGCACCTC GTCGACGACG ACCGGTCAGC AGACGCCTCG TAACCTGATG
GGCTTCAAAG ACGGCACGCG GAATCTCCGA GCCGATGACA CGGCCGCATT GGATCGGCAC
GTGTGGATTG GTACGGAGAC CGACCAACCG TGGCTCATCG GCGGCAGCTA CCTCGTCGCG
CGGCGGATCC GGATGTTGAT CGAAGCATGG GACCGGGCGT CGCTGAGCGA GCAGGAACGG
GTGATCGGCC GGCGTAAAGT ATCGGGCGCT CCGCTCACCG GATCCAGGGA ATACGACCCG
CCGGATTTCT CCGCCGTGCG GGCCGGCGAA CTGGTCATCC CGTCGGACGC CCACATCCGG
CTGGCCAGTC CCGAGCACAA CGCCGGACGC CGAATGCTTC GTCGCGGCTA CTCCTATACC
GACGGCATCG ATCTCTCAAC CGGGGAGCTG GACGCTGGTC TGTTCTTCAT CTCCTTCCAT
AAGGATCCCG CAACATTCAT CGCCGTCCAG CGCATTCTCG GCACCCAGGA CGCGCTGCGG
GAGTACATCG TGCACACCGG GAGCGCACTT TTCGTCTGCC CGCCGGGTTT GCAGGACGGC
GAATCGTGGG GGCGCCAACT CTTCGGCGCG TGA
 
Protein sequence
MTEASGGVTR RRFLGGLGLA GGGLGIAAAF TGGSAAGAVL GLVDRSPDGR SEQAAGAVST 
PGRVVPFRGR HQAGIITPAQ DRLVFGAFDV VTDSAGALRD VLDAWTIAAE AMTQGRSVPA
DDGSPQAPPT DTGEALGLPP AALTVTIGYG PRLFARFGLA TRRPSALAEL PPLPGDRLDP
ARCGGDICIQ ACSDDPQVAF HAVRNVTRLG RGVVVLRWLQ LGFGRTSSTT TGQQTPRNLM
GFKDGTRNLR ADDTAALDRH VWIGTETDQP WLIGGSYLVA RRIRMLIEAW DRASLSEQER
VIGRRKVSGA PLTGSREYDP PDFSAVRAGE LVIPSDAHIR LASPEHNAGR RMLRRGYSYT
DGIDLSTGEL DAGLFFISFH KDPATFIAVQ RILGTQDALR EYIVHTGSAL FVCPPGLQDG
ESWGRQLFGA