Gene Mlab_0143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0143 
Symbol 
ID4795435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp130448 
End bp131443 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content54% 
IMG OID640098790 
Productphosphoribosylformylglycinamidine cyclo-ligase 
Protein accessionYP_001029587 
Protein GI124484971 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.225095 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAAA AATATTCGTA CAAAGAAGCC GGGGTCGACA TCAATCTTGA GGCCGACGGG 
GTCAAAGCGC TGATCCATCA GTTAAGCTAC AAACGCACCG GAGAACACGG TATGGTCGGC
GATGTCGGCC ACTTTGCCGG AATGATCGAT TTCGGCAGCA AAGTCCTTTC GCTGTGTACT
GACGGCGTCG GGACCAAGAT GCGGATCGCC GACGATCTGA AGGACTGGAC AACGGTCGGT
ATCGACTGTA TGGCGATGAA CGTGAACGAC ATGTATGTGA TGAACATCGA GCCGATCGCA
TTCGTCGATT ACATCGCGAC GGAAGGCATC AATACCGAAC AGATGATCCA GATCGGCGTA
GGCCTTAATG AAGGAGCACG CCTTTCCAAC CTTAACATCG TCGGCGGCGA GACGGCGACG
TTGAAGGGAA TGATCAACGG GCTCGATCTT GCCGGGACCT GTCTTGGCGT TCAGGACAAG
GATAAGGTCG TCACGGGCGA AAAAGTCGCA CCGGGCGATA TCATCATTGG TGTCGCAAGT
ACCGGCGTCC ACTCAAACGG CTATACGCTG GCACGTAAAG TCGCCGAAGA AAACGGCGGC
TATGCGACGG TCCTTCTGTC CGGCAGAACC GTCGGCGAGG CATTGCTCGT TCCGACACGG
ATCTACTCGG AAGTCCTCGA CGTCTGCAGC AAAGCAGTTG TACACGGGAT GTGCCATGTG
ACCGGTGGCG GCCTTTTGAA TTTCCTGAGA ATTTCCAGCT ACGGTTTTGC GATCGAAGAC
CCGCTTGAAG TCCCGGAGAT TTTAGCCTGG ATCGCCGAGA AGGGCAATCT CGAGATGAAC
GAACTTTACC GGACGTTCAA TATGGGTATG GGTTTTGCGT TCATCGTGCC AAAAGAGAGC
GTGGAGACCG TTCTTTCCAT GGTCGAAGGA TCAAAAGTCG TTGGCCGTGT GATCGAAGAA
CACAAAGTCA CGCTGAAAGG CGTGGAAGTC TACTAA
 
Protein sequence
MTKKYSYKEA GVDINLEADG VKALIHQLSY KRTGEHGMVG DVGHFAGMID FGSKVLSLCT 
DGVGTKMRIA DDLKDWTTVG IDCMAMNVND MYVMNIEPIA FVDYIATEGI NTEQMIQIGV
GLNEGARLSN LNIVGGETAT LKGMINGLDL AGTCLGVQDK DKVVTGEKVA PGDIIIGVAS
TGVHSNGYTL ARKVAEENGG YATVLLSGRT VGEALLVPTR IYSEVLDVCS KAVVHGMCHV
TGGGLLNFLR ISSYGFAIED PLEVPEILAW IAEKGNLEMN ELYRTFNMGM GFAFIVPKES
VETVLSMVEG SKVVGRVIEE HKVTLKGVEV Y