Gene Acel_1019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1019 
Symbol 
ID4484560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1125485 
End bp1126780 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content63% 
IMG OID639729794 
Productnickel-dependent hydrogenase, large subunit 
Protein accessionYP_872778 
Protein GI117928227 
COG category[C] Energy production and conversion 
COG ID[COG3259] Coenzyme F420-reducing hydrogenase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.00653003 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCGCTG AACGTCGGGT CCTGCGAACT GACTACCTTG CCCGGGTCGA GGGCGAAGGG 
GCGCTGTTCG TCGAATGTGA TGGCGACGTC GTCACCAAGG TGGAGTTGCG CATCTTCGAG
CCGCCGCGTT TCTTTGAGGC TCTCTTGCGC GGCCGGTCGT GCTTCGAGGC TCCCGACATC
ACCGCGCGTA TTTGTGGCAT CTGCCCGGTC GCCTATCAGA CCAGCGCGGT CAACGCCGTG
GAGAGTCTCG CCGGTGTGGA CGTACCCGAA TCCATTCACC ATCTGCGCCG ATTGTTGTAC
TGCGGTGAGT GGATCGAGAG TCACGCCTTG CACGTGTACC TGCTTCATGC ACCGGATTTC
CTTGGTTACC CTGATGCCAT CACGTTGGCC CGCGATTACC CGGACATCGT GCAGCGGGGC
CTACAGCTCA AGGCGGCAGG CAACCACCTC ATGCGGGTTC TCGGCGGCCG CGAAATTCAT
CCGATCAATG TACGAGTCGG CGGCTTCTAC CGCGTCCCTT CGGCGTCCGA ACTTCGCGCG
CTCCGGCCCG AGCTCGAGCA GGCGCGCGAA ATCGCAGTCG AGACCGTCCG GTGGGTCTCG
GGATTCTCCT TCCCCGAGCG GGTATTCGAG GGCGCGCTTG TCGCCCTCCA CCAGCCGGAC
AGTTACGCTA TTGAGCGCGG CCGTATTCGG TCGGATACCG GTCTGGATAT CGACGCTTCC
CGCTACGACG ACTACTTCGA AGAGGAGCAG GTCGGCCATT CCACTGCACT GCATTCACGG
TTGCGTGGCG CCGGGCGGTA CCTCTGCGGG CCGCTTGCCC GTTACAGCCT GAATTATCGG
CAACTGTCTC CGCTGGCCAA GGAATGTGCC CGCGAAGCCG GGCTCGGCGA GGTGTGCCGG
GACGTCTTTC GCAGCATCGT GGTGCGCAGC GTCGAACTCG TGTACGCCTG CGATGAAGCA
CTGCGGCTGA TTGACATTTA TGAGCGACCC GAAATTCCGG CCGTGCCCGT CGTCGTTCGG
CCAGGTACCG GTCACGGCGT CAGCGAGGCA CCGCGCGGTC TGCTCTACCA CCGTTACCGT
CTTGACGGCG ACGGGACGAT TCTCGACGCG GAGATCGTGC CTCCGACAGC GCAGAATCAG
GCAGCGATCG AAGGAGACGT GCACGACGTT GTCGTCCGTT ACCGCGACCT CGACGATGAG
CAGTTGCGCC ACCTCTGCGA GCAAGCGATT CGCAACTACG ACCCGTGCAT TTCGTGCGCT
ACTCACTTCC TCCGGCTGGA GGTGAACCGG CGATGA
 
Protein sequence
MSAERRVLRT DYLARVEGEG ALFVECDGDV VTKVELRIFE PPRFFEALLR GRSCFEAPDI 
TARICGICPV AYQTSAVNAV ESLAGVDVPE SIHHLRRLLY CGEWIESHAL HVYLLHAPDF
LGYPDAITLA RDYPDIVQRG LQLKAAGNHL MRVLGGREIH PINVRVGGFY RVPSASELRA
LRPELEQARE IAVETVRWVS GFSFPERVFE GALVALHQPD SYAIERGRIR SDTGLDIDAS
RYDDYFEEEQ VGHSTALHSR LRGAGRYLCG PLARYSLNYR QLSPLAKECA REAGLGEVCR
DVFRSIVVRS VELVYACDEA LRLIDIYERP EIPAVPVVVR PGTGHGVSEA PRGLLYHRYR
LDGDGTILDA EIVPPTAQNQ AAIEGDVHDV VVRYRDLDDE QLRHLCEQAI RNYDPCISCA
THFLRLEVNR R