Gene Mlab_0520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0520 
Symbol 
ID4796033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp492670 
End bp493731 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content59% 
IMG OID640099178 
Producthypothetical protein 
Protein accessionYP_001029961 
Protein GI124485345 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR03282] putative methanogenesis marker 13 metalloprotein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.268419 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.583611 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTACG TTCAGCCGCG TCCAAGCTCG ATTGTTGCCG CCCTCTATAC CCTTCGGGAT 
CTGAACGTCG ATCTCGCGAT ACTTCACGGC CCGTCAGGCT GTTCATTCAA GCACGCCCGA
CTCTTGGAAG AGGACGGGAT CCGCGTTCTG ACGACCTCGC TTGGCGATGA GGAGTTCATC
TTCGGCGGGC AGAAGATCCT CGAAGATGTC CTCCAATATG CTGAAAAGGA GTTTTCTCCC
CGCCGCATCG CCGTTGTCGG GACCTGTGTT TCGATGATCA TCGGCGAGGA TCTCGATGCC
GCGATCGAAG CCTCCGGCAT CACGACCCCT GCGATAGGGG TATCGATCCA CGCAGGATTT
CGCGAGAACA TCGACGGGGT CATCGCCACC CTCGAGCCGG CGGCAAAGAT CGGCTGGATC
TCCGAAGAGG AGTTCGAGCG GCAGAAACTG GTCCTTGCCT CGGCGAACAA AACCGAGCGG
GAACGCGGAG CTGCCTGTAA AACCTACATT GCCCCGTCCC GCGGCGATCT GAAGCACGTT
GCCGCCGCCG AACTCGCAGA GCTACTTCGT TCCGGCAAAA AAGGCATGGC GATCATGAAC
GCAAAGAAGG AGACGGCGTA TATGTTCGCC GATCATCTCT GCGCCGTGCA TGAATGTGCG
CCGGACGCGA ATGTCACCTT TGTCGCAAAC CTCGAAGCCC GCGGTCTGCC GAAAGTGAGA
GGGGACGCCG CCATGATCCT TGCCGAACTC AATGAACGCG GCATCCACCC CGAACTCATC
GGAGCTCTTG ACGAATACGG CGGAAACGGC CCGCGGATCG CAGAAAGGAT CGCGGAAGTC
AAACCGGAAT TCCTCCTGCT CGTCGGTGTC CCCCACGCGG TCTCGCCCGA AGCTCTTGCC
GGGATCAAAG TATTCTCCGT CACAAACGGA CCGCGGCAGG TCCTGCCCTT AAAAGAGCAG
GGGCATGCCC ATGTCATGGT CGAGGTTGAT CTTCATCCAA AGACGCTTGG CGTCCACAAC
ATCGTCGAAA GCGAGTTCGG AGCCGTTCTG CGGAGCATGT GA
 
Protein sequence
MRYVQPRPSS IVAALYTLRD LNVDLAILHG PSGCSFKHAR LLEEDGIRVL TTSLGDEEFI 
FGGQKILEDV LQYAEKEFSP RRIAVVGTCV SMIIGEDLDA AIEASGITTP AIGVSIHAGF
RENIDGVIAT LEPAAKIGWI SEEEFERQKL VLASANKTER ERGAACKTYI APSRGDLKHV
AAAELAELLR SGKKGMAIMN AKKETAYMFA DHLCAVHECA PDANVTFVAN LEARGLPKVR
GDAAMILAEL NERGIHPELI GALDEYGGNG PRIAERIAEV KPEFLLLVGV PHAVSPEALA
GIKVFSVTNG PRQVLPLKEQ GHAHVMVEVD LHPKTLGVHN IVESEFGAVL RSM