Gene Acel_2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_2022 
Symbol 
ID4486429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2296342 
End bp2297436 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content69% 
IMG OID639730818 
ProductNAD(P) transhydrogenase subunit alpha 
Protein accessionYP_873780 
Protein GI117929229 
COG category[C] Energy production and conversion 
COG ID[COG3288] NAD/NADP transhydrogenase alpha subunit 
TIGRFAM ID[TIGR00561] NAD(P) transhydrogenase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATCG TCGCCGTCAG GGAGACTGCC CCCTACGAGC GTCGGGTCGC CGTCGTACCC 
GACACGGTGA CTCGACTTCG ATCCGCCGGA CACACCGTTG CCGTCGAACA GGGGGCCGGC
GAGGCCGCCG GGTATCCCGA CGAGGTGTAC CGCGACGCGG GCGCACAGAT CGTGCAGCGG
GAGGCATTGT CGGACGCAGA CGTCGTCCTT GCCGTGCAAC CGCTGCCAAC AGAAGATGCC
CGGCGATTGC GGGCGGGCTG TCTTGTGCTG AGCTTCCTGC AACCGGCGGC GTACGCAGAG
TTGCTTCACA TCCTCGCCGA GCGCAAGGCA AGCGCGATCT CCTTGGACCG CCTGCCGCGT
ATCTCCCGGG CGCAGAGCAT GGATGCGCTC TCCTCGCAGG CGCTGGTCGC CGGCTACCGG
GCGGCCCTCA TCGCAGCCGA GAAATTGCCG CGGTTCTTCC CCTTGCTCAT GACCGCCGCC
GGGACGGTTC CGCCGGCAAA GGTCCTGGTT CTCGGCGCCG GGGTCGCCGG GCTGCAAGCG
ATCGCCACGG CGCGTCGCCT CGGCGCAGTG GTCTCGGCGT ACGACGTCCG GGCCGCGGCG
GCCGAGGAGG TGCGCAGTCT CGGGGCGACA TTCATCGATC TCGGCTTGGA AACGCTCGAG
GGCGCCGGCG GTTACGCCCG GGAGATGACC GAGGAGCGGG CCGCCAAACA GCGGGAACTG
CTCACCCCTC ATCTGGCCGC GTCGGACGCG GTCATCACGA CCGCGGCGGT TCCCGGCCGG
CGGGCGCCTC TGCTCGTCGA CCGGCGCATG GTCGAGGCGA TGCGGCCCGG CACGGTCATT
GTGGACATCG CGGCGGAATC CGGTGGGAAC GTCGAACTCT CCAAGCCGGG TGAAGAGGTT
CTGCATAACG GCGTGCTCAT TTGGGGCGGC CGGAACGTGC CGAGCGGCAT GCCGTACGAC
GCCAGTCGGC TCTACGCGCG GAATCTCGCG AATTTGCTCG TCATGCTGAC GCGGGACGGG
GAGGTCGTCC TGGATCTCTC CGACGAGATC GTCGCAGCGT CGCTCGTCGT CCACGAAGGG
CAGGTGCGGA CGTGA
 
Protein sequence
MNIVAVRETA PYERRVAVVP DTVTRLRSAG HTVAVEQGAG EAAGYPDEVY RDAGAQIVQR 
EALSDADVVL AVQPLPTEDA RRLRAGCLVL SFLQPAAYAE LLHILAERKA SAISLDRLPR
ISRAQSMDAL SSQALVAGYR AALIAAEKLP RFFPLLMTAA GTVPPAKVLV LGAGVAGLQA
IATARRLGAV VSAYDVRAAA AEEVRSLGAT FIDLGLETLE GAGGYAREMT EERAAKQREL
LTPHLAASDA VITTAAVPGR RAPLLVDRRM VEAMRPGTVI VDIAAESGGN VELSKPGEEV
LHNGVLIWGG RNVPSGMPYD ASRLYARNLA NLLVMLTRDG EVVLDLSDEI VAASLVVHEG
QVRT