Gene Mlab_0404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0404 
Symbol 
ID4794583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp382719 
End bp384536 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content52% 
IMG OID640099057 
Producthypothetical protein 
Protein accessionYP_001029847 
Protein GI124485231 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.737872 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0168695 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAA GCTACGCATG TGGGACTTCC CAGAAACCCC TTTTGGGAAG TACCGTAGGA 
GATGTCTTGA ACAGCATTGC GGCTAAATAT CCCCGCAACG ATGCTCTAGT CTCCTTTCAG
CCGAATCCCT ACAGCAAACG GTGTATGTCT GACGCGGCAA ACTACTGCCA CGAAGGGCTC
GTATCCGACG AGCCGAACTC GTATCGCGAG GCCAGACTAA CTCTAAACAC TGCCGGGTCT
AAGGTACTTC GCTATAATTG GACGGAGTTC CTTGCTGAGA CGAATGCGGT GGCCAAAGGT
CTTATGATGC TTGGCGTGGA ACACGGGACG CGTGTTGCCA TCTGGGCAAT GAATTATGCC
GAGTGGATCC TTGTCCAGTT TGCGACCGCC AAGATCGGTG CGGTGATGGT GAACATCAAT
CCTGCCTACC GTACGTTCGA ACTGGAATAC GCCCTCAAGC AGTCGGAAGT GGACACTCTG
ATCCTTCAGG GAAAGTTCAA GACCTCCGAT TACGTCGGTA TGTTCTACGA GGCATGCCCC
GAGGCATTCG AGGCAAAACC CGGCAAGATC CGGTCAGAGA AGTTCCCGTA TTTACGTAAC
GTCGTATTCA TGGGCGAGAT CATCTATAAC GGGATGTACC GCTGGAGCGA ACTTCTCGAG
ATGGGTGAGT ATGTCTCCGA CTTCGAACTG GAAAACCGTG AAGAGTCTGT CTCTTTCGAC
GATGCGCTCA ACATCCAGTA TACGTCCGGA ACGACCGGAT TTCCCAAAGG CGTCGTTTTG
TCCCATCATT CGGTCTTAAA CAACGGACTG TTTATCGGTG ACGGGATGAG TTTTACCGAG
AACGACAAAC TCTGTATTCC GGTTCCGTTC TATCACTGTT TCGGGATGGT CCTCTCGAAT
ATGGCCTGTG TGACGCACGG ATCCACGATG GTCATCCCCG GTCCGTTCTT CGATGCCGAG
GCCGTACTTC AGGCGGTCGA GGCGGAAAAA TGTACGGCTC TTCACGGCGT TCCAACCATG
TTCATCGCAG AACTGGAGCA CCCGAACTTC AACCGGTATG ATCTTTCCAG TCTTAGGACC
GGTATCATGG CAGGATCTCC ATGTCCTATC GAGAAGATGC GGGAGGTCGC TTCAAGGATG
AACATGAAAG ATATCGTTAT CGTTTACGGT CTGACTGAAA CGGCACCGGG TATTACCATG
TCCACCACCT CCGACACGCT TGAAAACCGT GTGGCGACCG TTGGTCGTGC CTTCCCGCAC
ACTGAAATCA AGATCACAGA TCCGAAGACA GGAAGAATCG TCCCTCTCGG CGAGAAAGGC
GAAATATGTG CCCGCGGTTA CATGAAGATG AAATGTTACT ACAATAATCC GAATGCAACC
AAGCAGGTCA TCGACAAAGA CGGCTGGCTC CACTCCGGCG ATCTTGGAAC GATGGATGAG
GAAGGCTATG TCCGGATGGC AGGCCGTCTG AAGGAGATGG TCATTCGCGG CGGAGAAAAC
CTTTATCCAA GAGAGATCGA GGAGTTCTTC CATCTTCACC CGAAGATTTC CGACATCTAT
GTCATCGGAG TTCCGGATGC GAAATACGGC GAGGAACTCT GTGCCTGGGT GAAAGCAGAA
CCTGGAACTA CGATCACGGA AGAAGAGATC AAGGCATTTG CCGACGGGAA GATTGCCCGC
CACAAGATCC CGCGTTATTA CAAGTTTGTG GATTCTTTCC CCATGACCGT TACAGGCAAA
ATCAAGAAAG GCGATATGCA GGAGATATCC ATCGCCGATC TTGGTCTCGC CGACGTTGCG
AAGATAAAAA CCGCGTAA
 
Protein sequence
MSESYACGTS QKPLLGSTVG DVLNSIAAKY PRNDALVSFQ PNPYSKRCMS DAANYCHEGL 
VSDEPNSYRE ARLTLNTAGS KVLRYNWTEF LAETNAVAKG LMMLGVEHGT RVAIWAMNYA
EWILVQFATA KIGAVMVNIN PAYRTFELEY ALKQSEVDTL ILQGKFKTSD YVGMFYEACP
EAFEAKPGKI RSEKFPYLRN VVFMGEIIYN GMYRWSELLE MGEYVSDFEL ENREESVSFD
DALNIQYTSG TTGFPKGVVL SHHSVLNNGL FIGDGMSFTE NDKLCIPVPF YHCFGMVLSN
MACVTHGSTM VIPGPFFDAE AVLQAVEAEK CTALHGVPTM FIAELEHPNF NRYDLSSLRT
GIMAGSPCPI EKMREVASRM NMKDIVIVYG LTETAPGITM STTSDTLENR VATVGRAFPH
TEIKITDPKT GRIVPLGEKG EICARGYMKM KCYYNNPNAT KQVIDKDGWL HSGDLGTMDE
EGYVRMAGRL KEMVIRGGEN LYPREIEEFF HLHPKISDIY VIGVPDAKYG EELCAWVKAE
PGTTITEEEI KAFADGKIAR HKIPRYYKFV DSFPMTVTGK IKKGDMQEIS IADLGLADVA
KIKTA