Gene Mlab_0603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0603 
Symbol 
ID4795341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp573235 
End bp574803 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content50% 
IMG OID640099261 
Product2-isopropylmalate synthase 
Protein accessionYP_001030044 
Protein GI124485428 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0910176 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATCGG AGCAGTCACT GCGAAAGTGG CGGATTGCAT TCTATTGCGA TAAAAGTACA 
GTCAACACGA ACAGAAGAGT GACTATTTTA GACACCACAC TTCGTGATGG TGAACAGACT
CCCGGCGTGT CATTCACACT GGAGCAGAAA ATTGAAATTG CGCATCAGCT GTCCGATATC
GGCGTTGATG TGATTGAAGC AGGCTTTCCG GCATCTTCGG ATGTCGAATT TGAAACGGTC
AAGAGGATTT GCGCTGAAGA GGGTATCCGC CCCAAGATCT GCGGACTCGC ACGCTCTGTC
AAAGCGGATG TAGACCGTTG TATCGAGGCC GGCGTTGATA TGGTCCATGT GTTTATTCCC
ACATCTGAGA TTCAGAGAAC CTATACCATC AAAAAAAGTC ATGCAGAAGT CCTGGCGATC
ACCCGGGAGA TCATTACCTA TGCACGTTCG AAGTGCGATT ATGTGATGTT CTCGCCGATG
GATGCAACCA GAACTGAACC GTCGGAACTG ATTGAAATCT GCAAAGCAGC AGATGAAGCC
GGCACGACGA TAATCAACAT TCCCGACACA GTCGGTGTAA GCACCCCTTC GATGATAAAA
CCCCTCATCG CAATGATTCG CGAAAACGTC AAATGCAAAA TCGATGTGCA TTGTCACAAC
GATTTCGGCC TGGCGACCGC AAACACCATT GCAGCAGTTG AAGGAGGAGC TGACCAGATT
CAGGTCACCG TAAACGGTAT CGGCGAACGG GCAGGTAACG CGGATTTAGC CCAGACGGTT
ATGATCCTGA AATCGATCTA TGGAATCGAA ACCAATATTC GGACCGAAAA ACTGGTGGAG
ACTTCAAGAA TGGTTTCCAG ATTTTCACAG ATTGCCGTTC TGCCGATCCA GCCGGTCGTA
GGCGAAAATG CATTTTCGCA TGAAAGCGGC ATCCATTCCC ACGGCGTAAT GGCTAACCCC
GGAACATTCG AGCCGGGAAT CATGACGCCC GAAATGGTGG GTCACCGCCG GCGCTTAAAG
CTTGGAAAGC ATGTTGGAAA ACATGCGGTC CGGCAGATGC TGGAGGACAT AAACGTAAAT
CCGTCCGACC CCGAACTGGA CATGATCGTC GCTAAAGTCA AGGAGATCTC CGGACGCGGG
CGAAAAGTAA CCGAGTTCGA TCTCTTCGAG ATCGCAAAAA TCATCACCGG CAGTCATAAC
GACAAAAAAA TGATCGAGCT GGATGATATT TCGGTATTTA CCGGGAGCCA TGCGATTCCG
ACTGCAAGCG TACAGGCCGT AGTCCACGGT GAAAACAAGA TCTGTTCCAA AACGGGAGAC
GGACCGGTGG ATGCAGCAAT GAAGGCGCTT TTAGCGATCG CACCTGGAAA AGTTCAGTTG
AAGAGCTTTC AGATCGAGGC GATCTCCGGT GGAAGCGATG CGCTCGGATG CGTCACTATC
GAAGTCGAGG ATGAAAAAGG CAGGATCTTT GATGCAGCCT CGTCAAACAG TGACATCGTG
ATTGCATCGG CTGAAGCGAT GGTGAATGCC CTCAATGTCG TTTACCGCTC GGGTGGTTTC
GACAGATAA
 
Protein sequence
MGSEQSLRKW RIAFYCDKST VNTNRRVTIL DTTLRDGEQT PGVSFTLEQK IEIAHQLSDI 
GVDVIEAGFP ASSDVEFETV KRICAEEGIR PKICGLARSV KADVDRCIEA GVDMVHVFIP
TSEIQRTYTI KKSHAEVLAI TREIITYARS KCDYVMFSPM DATRTEPSEL IEICKAADEA
GTTIINIPDT VGVSTPSMIK PLIAMIRENV KCKIDVHCHN DFGLATANTI AAVEGGADQI
QVTVNGIGER AGNADLAQTV MILKSIYGIE TNIRTEKLVE TSRMVSRFSQ IAVLPIQPVV
GENAFSHESG IHSHGVMANP GTFEPGIMTP EMVGHRRRLK LGKHVGKHAV RQMLEDINVN
PSDPELDMIV AKVKEISGRG RKVTEFDLFE IAKIITGSHN DKKMIELDDI SVFTGSHAIP
TASVQAVVHG ENKICSKTGD GPVDAAMKAL LAIAPGKVQL KSFQIEAISG GSDALGCVTI
EVEDEKGRIF DAASSNSDIV IASAEAMVNA LNVVYRSGGF DR