Gene Athe_1062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1062 
Symbol 
ID7409619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1156980 
End bp1158203 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content38% 
IMG OID643715428 
ProductMalate dehydrogenase (oxaloacetate-decarboxylating) (NADP(+)) 
Protein accessionYP_002572936 
Protein GI222529054 
COG category[C] Energy production and conversion 
COG ID[COG0281] Malic enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00425454 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGTAA AAACGCTTGC ACTGCAACTT CATAGGCAAC ACAGAGGAAA GATTGCCCTC 
AAAAGCAAAG TTAGCGTAGA TAATCAGCAC GACCTTAGTA TATACTATAC ACCCGGTGTT
GCAGAACCTT GCAGAGAGAT TGTAAAGAAT AAGTTTCTTG TGTATGAGTA TACATCTAAA
TCCAACTGGG TTGCAGTGGT AACAAATGGT ACAGCTGTCT TGGGTTTGGG AGACATTGGT
GTGCATGCCT CTTTGCCTGT TATGGAAGGC AAGGCAATTT TGTTTAAACA GTTTGGTGGT
GTTGATGCAT TCCCCATTTG TATAGACTCA AAAGATGTTG AAGAGATTGT AAAGACAGTA
AAGCTTATTG AAACATCTTT TGGTGGAATA AATTTAGAAG ATATAGCTGC ACCAGCATGT
TTTGAGATTG AACAAAAGCT AATTGAAAGT CTTGATATTC CTGTGTTTCA TGACGATCAG
CATGGAACTG CTGTGGTAGC ACTGGCGGCT TTGATAAATT CTCTCAAAAT TGTAAGGAAA
AGAATATCAG ATGTAAAAAT AGTCATAAAC GGTGCTGGTG CTGCTGGAAT TGCAACTGCT
AAGCTTTTAT TAAAGTATGG AGCAAGAAAT ATTGTTATTT GTGATAAATG TGGTGCCATA
TATGAAGGAA GAGAAAAAGA TATGAATAAG TATAAAGAAG AAATAGCAAG AATTACCAAT
AAAGAAGGAA TAAAAGGTTC GCTACACAGA GCAATTGAAG GTGCTGATGT ATTTATTGGT
CTTTCTGTTG CAAATGTCCT GAATGAAGAT GATATTAAAA AGATGTCGAA TGACGCAATT
GTGATGGCAA TGGCAAATCC GATACCAGAG ATTATGCCAG ATATTGCAAA AAAGGCAGGA
GCAAGGATTG TTTGCACAGG AAGGTCTGAT TTTAATAACC AAGTCAATAA CGTATTGGCA
TTTCCAGGTA TTTTTAGAGG AGCACTTGAT GTTATGGCAA CAAGGATAAC AGATGAGATG
AAGATAGCAG CTGCTGAGGC TATAGCTAAG GTTGCAGAGG AGGAACTTTC AGAAGATTAT
GTAATTCCGA AGCCTTTTGA CAAGAAAGTA GCCTTTGAGG TGGCTTTAGC AGTTGCAAAA
AAGGCAGTGG AACAAAAGGT AGCGCGGCTG GATCTGAATG ATGAAGAGCT CAGAACAAGG
ATTTTGTCTA TGTTAAATAT ATAA
 
Protein sequence
MDVKTLALQL HRQHRGKIAL KSKVSVDNQH DLSIYYTPGV AEPCREIVKN KFLVYEYTSK 
SNWVAVVTNG TAVLGLGDIG VHASLPVMEG KAILFKQFGG VDAFPICIDS KDVEEIVKTV
KLIETSFGGI NLEDIAAPAC FEIEQKLIES LDIPVFHDDQ HGTAVVALAA LINSLKIVRK
RISDVKIVIN GAGAAGIATA KLLLKYGARN IVICDKCGAI YEGREKDMNK YKEEIARITN
KEGIKGSLHR AIEGADVFIG LSVANVLNED DIKKMSNDAI VMAMANPIPE IMPDIAKKAG
ARIVCTGRSD FNNQVNNVLA FPGIFRGALD VMATRITDEM KIAAAEAIAK VAEEELSEDY
VIPKPFDKKV AFEVALAVAK KAVEQKVARL DLNDEELRTR ILSMLNI