Gene Mlab_0388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0388 
Symbol 
ID4794706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp364537 
End bp365880 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content51% 
IMG OID640099040 
Producthypothetical protein 
Protein accessionYP_001029831 
Protein GI124485215 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0861429 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATTG AAGCCCACCG GTTTTTTTTC GACACCGGAA ATACTCTCCC AATCCAGCGC 
AGACTTCTTG CTCTGCAGAA ACTCCGGACA TCCATCGAAA CCCATGAACC GGAAATCACG
GCGGCCTTAT TTTCGGATCT TGGAAAATGT CCCTTCGAGG CATACGCATT CGAGATCGCT
CCTGTCCTGC ACGAAATCGA TTATCTGATC AAGCACACGA ATAAAATCCT GAAACCGGAG
AAAGTACGCT CACCAATGAT GATTTTCCCG GCAAAAACGG TTATCCGTCA CGACCCATTC
GGTCTTGCCC TCCTTTTGTC CCCGTGGAAT TATCCGTTCC ATCTCTTCAT GCTCCCGCTT
GCAGGAATCG TCGCCGGCGG AAACGTCGTG ATCGGAAAAA CATCCCGGAG ATCACCCGAG
ACCGGAAAGA TCATCCGAAC GATCCTTGCC GAAGTATTCC CCGAAGAATG GGTTAGTGTA
GAGGATGAAG TCGATTTAGA TGCGCATTAT GACTACATCT TCTTCACCGG CGGAAAAGAT
ACGGGAAAAA TGATCGCCGA AAAGGCGGCT GCCCACTTAA CACCGGTGAC CCTGGAGCTC
GGCGGGAAAA ATGCCTGTAT CGTCGATGAG ACCGCAGATA TCCCTGTAGC CGCAAAACGG
ATCGCCTGGG GAAAGTTTGC AAATTCCGGA CAGACCTGCA TCGCCCCTGA CTATCTGCTG
GTACATAAAT CCGTTCGGGA TCAACTCGTA AATAAAGTCA AAGAAGAAAT CGTCACCCTC
TACGGAAGTA ATCCGGCAAC GAACAACGAC TACGGCAAGA TCGTCACCAA AGATGCATAC
GACCGTCTGG TCGCGTTCGA AACACCGGAG AACCTCATCT TCCGTGCCGG AGAGCACAAT
CCCGACGGAA GAAAAGTCGC GCCGACGATT CTGTCCGCAG ATATGTCCGA CCCCGTCATG
CAGAACGAGA TCTTCGGACC GATCCTCCCG GTCCTCGCGT GGGAGAGAAA GGATGAACTC
GAACAGCTGA TCAAAAAGGA GCCGCTGGCC CTCTACATCT TCTCCGAAAA CGAAACCTTC
CGCAATCATC TCATCGAACG AAACCCTTCA GGCGGGGTTT GCATCAATGA TGTCATGATG
CAGGTCGCAA ACCAGAACGC TCCTTTCGGC GGGGTTGGAA CCAGCGGGAT GGGAAAATAT
CACGGAAAGG ATTCGCTTGA AACCTACACG CGGAAACGCA CGGTCGTGAT CAAGAAAACA
AGACCCGACC CGAAGATCCG GTATCCTCCG TACACCGAAA AAACCCTGAA CATGGTCAAA
AAGTGGAGAA AACTGCTGTT TTAG
 
Protein sequence
MSIEAHRFFF DTGNTLPIQR RLLALQKLRT SIETHEPEIT AALFSDLGKC PFEAYAFEIA 
PVLHEIDYLI KHTNKILKPE KVRSPMMIFP AKTVIRHDPF GLALLLSPWN YPFHLFMLPL
AGIVAGGNVV IGKTSRRSPE TGKIIRTILA EVFPEEWVSV EDEVDLDAHY DYIFFTGGKD
TGKMIAEKAA AHLTPVTLEL GGKNACIVDE TADIPVAAKR IAWGKFANSG QTCIAPDYLL
VHKSVRDQLV NKVKEEIVTL YGSNPATNND YGKIVTKDAY DRLVAFETPE NLIFRAGEHN
PDGRKVAPTI LSADMSDPVM QNEIFGPILP VLAWERKDEL EQLIKKEPLA LYIFSENETF
RNHLIERNPS GGVCINDVMM QVANQNAPFG GVGTSGMGKY HGKDSLETYT RKRTVVIKKT
RPDPKIRYPP YTEKTLNMVK KWRKLLF