Gene Athe_1086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1086 
Symbol 
ID7409643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1180614 
End bp1181693 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content41% 
IMG OID643715452 
ProductNADH-ubiquinone oxidoreductase chain 49kDa 
Protein accessionYP_002572960 
Protein GI222529078 
COG category[C] Energy production and conversion 
COG ID[COG3261] Ni,Fe-hydrogenase III large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00746179 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGGGAAGA GAACGATAGT TCCGTTTGGT CCACAGCATC CTGTTTTGCC GGAACCTCTG 
CAGCTTAGGC TTGTTTTAGA GGACGAAAAA GTTGTTGAAG CGATACCTGC AATAGGTTAT
GTTCACAGAG GACTTGAAAA GCTTGCTGAG GAAAAGGATA TAAACCAGAA CATATATGTA
GTCGAAAGAG TTTGTGGTAT ATGCAGTTTT CAACAAGCTT TGGCTTACTG TCAGGGAATT
GAAGAGCTAA TGGGCATTGA AGTGCCTGAC AGAGCAAGGT ATTTACGAGT TATTTGGGCA
GAACTTCACA GACTTCACAG CCACCATTTG TGGCTTGGGT TATTGGCTGA CGCTTTTGGT
TTTGAAAGTC TTTTTATGCA ATGCTGGAGA AATAGAGAGC TTGTGATGGA CCTTATGGAA
GCAACAGCAG GTTCAAGAGT TATAATTTCC ACAAATATAA TTGGTGGGGT TAGAAGAGAT
ATAGATGCTG ATAAACAAAA GTTCATTTTA GATAATCTTG CAAAACTGGA AGAGGAGCTA
AAGAAGATAG AAGGTTCGTT TTTGAACGAT TATACAGTCA AGAAAAGACT TGTAGGTGTA
GGTGTTCTGA GCAAACAAGA AGCTTACGAG CTTGGATGCG TTGGACCTAT GGCAAGGGCA
AGTGGAATTA GTATGGATTT GCGAACCCTT GGATATGCAG CATATGGCGA ACTTGACTTT
GAACCTGTCG TGGAAAATGA CGGGGACTGC TATGCAAGAC TTAAAGTAAG ACTTCGCGAG
TGCTATCAGT CAATTGACCT TATTCGTCAG GCTATATCTA AGATGCCAGA AGGTGAGATT
TCAACACCAG TCAAAGGATT TCCAAACGGT GAGGTTATTT CAAGGGTTGA ACAGCCACGA
GGAGAAGATG TATATTATAT AAAGGCAAAC GGGACAAAGA ATTTAGAAAG GCTTAGAATT
AGAACACCAA CATTTGCAAA TATTCCTGCG CTTGTGAAGA TGCTTCAGGG TGTGGATTTT
GCAGATGTTC CAATGCTTGT TTTGACAATT GATCCATGTA TCTCATGTAC TGAAAGGTAA
 
Protein sequence
MGKRTIVPFG PQHPVLPEPL QLRLVLEDEK VVEAIPAIGY VHRGLEKLAE EKDINQNIYV 
VERVCGICSF QQALAYCQGI EELMGIEVPD RARYLRVIWA ELHRLHSHHL WLGLLADAFG
FESLFMQCWR NRELVMDLME ATAGSRVIIS TNIIGGVRRD IDADKQKFIL DNLAKLEEEL
KKIEGSFLND YTVKKRLVGV GVLSKQEAYE LGCVGPMARA SGISMDLRTL GYAAYGELDF
EPVVENDGDC YARLKVRLRE CYQSIDLIRQ AISKMPEGEI STPVKGFPNG EVISRVEQPR
GEDVYYIKAN GTKNLERLRI RTPTFANIPA LVKMLQGVDF ADVPMLVLTI DPCISCTER