Gene Mlab_0233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0233 
Symbol 
ID4795616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp218346 
End bp219599 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content54% 
IMG OID640098879 
Producthypothetical protein 
Protein accessionYP_001029676 
Protein GI124485060 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR01343] homoaconitate hydratase family protein
[TIGR02086] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATGG CAGACACACT TTCCGAACGG ATTCTCGGCT CGGCCGAAGG TACGTATGTC 
GACCGGATGG TTGATCGGGC ATTTGCCCAC GACGGGACCG GTGCCCAGGC ACTGGTCGCA
TTCGAAAATT TCCGGATCCA AAACAAGTCC GTTGTCAATC CTGAAAAATT ATCTATAATA
TATGATCACA TCTCGCCGGC GAACAACTCG GTTACGGCCG ATCTTCAGGG AGATCTTCGG
AAGTTTTCCC GACAGAACGG GATGCATTTC CACGAGGTCG GCTGCGGGAT CTGTCATCAG
ATAATGAGCG AAGGAGTCTG CCTACCGGGT GAGATCGTTG TCGGGGCGGA TTCGCATTCC
TGTACTCTCG GGGCACTCGG CGCATTTTCA ACCGGTGTCG GAGCAACCGA TATGGCGGGG
ATCTGGGCGA CCGGCGGGAC CTGGTTTAGA GTTCCCGAAT CTATTAGTAT AGTGCTGTCC
GGCAAACTTT CCGGTCATAC CGAGCCGAAG GATGTCGCAC TATCTTATGT AAAGGCACTT
GGGATGGACG GCGGGACCTA CAAAGCTCTG GAGTTTATCG GTGACGGAGC CGCCGGCATG
CCGGTCGAAG GAAGACTGAC GTTATCTAAC ATGGCCGTCG AGACCGGGGC AAAGACCGGA
TTATTCTATG CTGATGCGTT GACCCGCGAA CATTTGATAA CCTACGGAGC GGACGAGAAA
ACAATTTCTC TGCAGAAACC CGAAGACTGC AGTTATGAAT CCGAGATTTA CCTTGATCTT
GACGATATTG AACCGCTTCT TGCCATACCT CACCGGGTCG ATAACGCAGT ACCCGTTACA
GAGTATTCGG GCACCCAGAT CGATCAGGTA TTTATGGGTA CCTGTACAAA CGGACGGTTT
GAGGATCTCA AACGGTTCGC TGAAATCGTC AGAGGTAAAA AAGTCGCCGT CAGGACGATC
GTTACGCCTG CTTCGAAGGA TGCATATGCG AAGGCTCTGT CGACTGGTGT CCTGTCCGAC
ATACTTGAGG CGGGCTGCGT AATCTGTCCG CCCGGCTGCG GCCCCTGTCT TGGGGCACAT
ATGGGTGTCC TTGGGGGAGG CGAGGTGGGT CTGTCCACAG CGAACCGGAA TTTCAGGAAT
CGGATGGGGG TTGGTGCCGA GTATTATCTC TGTTCTCCGT CGACGGCTGC TGTTAGCGCT
CTTTGCGGCG AGATCAGGTC GCCGGATGAA TGGAAGGGAG GTTTGAACCG ATGA
 
Protein sequence
MKMADTLSER ILGSAEGTYV DRMVDRAFAH DGTGAQALVA FENFRIQNKS VVNPEKLSII 
YDHISPANNS VTADLQGDLR KFSRQNGMHF HEVGCGICHQ IMSEGVCLPG EIVVGADSHS
CTLGALGAFS TGVGATDMAG IWATGGTWFR VPESISIVLS GKLSGHTEPK DVALSYVKAL
GMDGGTYKAL EFIGDGAAGM PVEGRLTLSN MAVETGAKTG LFYADALTRE HLITYGADEK
TISLQKPEDC SYESEIYLDL DDIEPLLAIP HRVDNAVPVT EYSGTQIDQV FMGTCTNGRF
EDLKRFAEIV RGKKVAVRTI VTPASKDAYA KALSTGVLSD ILEAGCVICP PGCGPCLGAH
MGVLGGGEVG LSTANRNFRN RMGVGAEYYL CSPSTAAVSA LCGEIRSPDE WKGGLNR