Gene Mlab_0266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0266 
Symbol 
ID4795463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp247968 
End bp249149 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content49% 
IMG OID640098912 
Producthypothetical protein 
Protein accessionYP_001029709 
Protein GI124485093 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.423514 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.267621 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAAAA AAATCATCTA TATGGACCAT GCTGCTACGA CTTACACCGC AAAAGAGGTC 
GTAGAGGCAA TGCTACCTTA TTTCTCCGAA CAATTCGGCA ATCCCTCTTC AGTTTATTCT
ATTGGGCAGT CCAACAAGAG TGTCATTGAT CTAGCCCGAA AACAGGTGGC ATCAGCGATC
AATGCTCAGC CTGATGAAAT TTTCTTTACC AACGGCGGGA CCGAATCGGA CAACTGGGCG
CTGAAAGGTG TGGCATTTGC CAATGAAAGA AAAGGAAAAC ATATCATCAC AACAGCGATT
GAGCATCATG CAATTCTGCA TTCCTGTGAA TGGCTCGCAT CACGCGGATT TGAAATAACC
TATCTTCCGG TAGACAAATA CGGTATGGTT TCTCCAGAGG CGGTCGAAAA AGCGATTCGC
CCGGACACGA TTCTCATCTC GGTTATGTAT GCAAACAACG AGGTCGGAAC GATTCAACCG
ATTGCCGAGA TCGGAAAGAT CGCGAGAGAT CATGGAATAT ATTTCCACAC CGACGCGGTT
CAGGCAGTTG GCCATGTTCC AATAGATGTG GTTGCCGAAA ATATTGATCT TCTCTCGCTC
TCCGGTCACA AATTCTACGG ACCGAAAGGT ACCGGAGCCC TTTACATCCG GAAAGGAACG
CGGATCCAGA ACTTTATCCA CGGCGGGGCT CAGGAGAAAA AACGCCGGGC AGGTACTGAA
AATGTTCCCG GCATCGTGGG CCTCGGCGCT GCGGTAGAAC GGGCAATCAA AATGATGCCG
GTGGAGACGA AACGACTTGC ATCTCTGAGT GATGAACTTA CTCGTGAACT TCTCAAAATC
CCGGCAACAC ACTTAAACGG CCATCCGACA AAACGTCTTC CAAACAACAC GAACATTATC
TTCGAGTATA TTGAAGGAGA ATCAATTCTT CTGTTTCTGA ACATGAAGGG CATCTGTGCT
TCGACCGGAA GTGCCTGTAA CTCGGCATCC CTTGAACCAT CTCATGTCTT AACCGCCATG
GGAATATCCC ACGAGATCGC ACACGGGTCC ATCCGGCTTA CCGTCGGCGA ACGGACGACG
GAAGACGATG TGAAATATGT GATCACCGCA CTCACAGAAA CGGTGGCAAA ACTCAGGGCA
ATGTCCCCCC TTACTCCAAA GGAGCTGAGA AATGTACAGT GA
 
Protein sequence
MGKKIIYMDH AATTYTAKEV VEAMLPYFSE QFGNPSSVYS IGQSNKSVID LARKQVASAI 
NAQPDEIFFT NGGTESDNWA LKGVAFANER KGKHIITTAI EHHAILHSCE WLASRGFEIT
YLPVDKYGMV SPEAVEKAIR PDTILISVMY ANNEVGTIQP IAEIGKIARD HGIYFHTDAV
QAVGHVPIDV VAENIDLLSL SGHKFYGPKG TGALYIRKGT RIQNFIHGGA QEKKRRAGTE
NVPGIVGLGA AVERAIKMMP VETKRLASLS DELTRELLKI PATHLNGHPT KRLPNNTNII
FEYIEGESIL LFLNMKGICA STGSACNSAS LEPSHVLTAM GISHEIAHGS IRLTVGERTT
EDDVKYVITA LTETVAKLRA MSPLTPKELR NVQ