Gene Mlab_1525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_1525 
Symbol 
ID4794539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp1557768 
End bp1558913 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content53% 
IMG OID640100211 
Productprotein tyrosine phosphatase 
Protein accessionYP_001030956 
Protein GI124486340 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.469946 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.164235 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAATC CGTTCCATGT CATGATCATT CCGACACTCG GCTGCCCCGG GCGCTGCAAA 
TACTGCTGGA GTTCGGACGA AACTTCACCG AGGATGACGC TTGACACGAT AGATGATATT
GTCACGTGGC TCAAACCGCT CGAAGACCAG CGGGTCACAT TTACCTTTCA CGGCGGAGAA
CCCCTGCTTG CCGGAGCCGA GTTTTACCGG CAGGCGTTCA AGAAGATCAC AGAAGGACTC
CCGCATCTCT CGCCCGAGTT TGCCATTCAG ACCAATCTCT GGCTGATGGA TGACGAGCTC
GCCGAAATCT TTGCAGAATA CCAGGTCCCA ATCGGATCTT CCATCGACGG ACCGCAGGAG
CTGACGAATT ATCAGCGGGG CGATGAATAC TTTGAACGCT GCCTCGCCGG CTACAAGATC
GCCGTGGACC ACGGACTTCT GGTCAGGTTC ATCTGTACGT TCACCAACTC TTCCGTTAAG
CAGAAAGAAG CGATCGTGAA CTTTTTCAAA GAACAGGGCT GGGTGATGAA ACTTCATCCG
GCTCTGCCGT CCCTGAAAGG AGAGAATCCG AATGCATGGA CCCTTGCCCC GGAGGAGTAC
GGCGAGTTGC TGGTCTTTCT TCTGGACGAG GCGATCGAAC ATGCAGACGA TCTTGAGATC
ATGAACATCA ATGATCTCTG CAGGTGCGTG TTTACCCGGG CAGGGAGCGT TTGCACCTAT
GCGGATTGTA TGGGAACCAC GTATGCCGTT GGACCGGACG GGGAAATTTA TCCCTGTTAC
CGGTTTATCG GGATGCCGGA ATGGGTAATG GGCCATGTCA GGAATGCTCC GTCAATCGAG
AGCCTGATGG AGAGCCACGC AGGAAAACGG ATGCTGGCGT TCAAGGAATT TGTGGACACG
GCATGTAAAG ATTGCGCCCA TATCACGTAC TGCAGAGGGG GATGTCCATA TAATGCAATA
GCACCGACCG GAGGGTCTCT CGAGGGGGTC GATCCCCACT GTACTGCATA CAAGAGGATC
TTCGATGAGA TCACAACACG GCTGAACGAG GAGATGAATG CAGCGCCGGT GAGCAGAGTT
TCACGAGTGA AGAGGCAGAA AAAGCCAAGC GTTACGAGAC TTATTCAGAA AATCGTTGAG
GAATAG
 
Protein sequence
MKNPFHVMII PTLGCPGRCK YCWSSDETSP RMTLDTIDDI VTWLKPLEDQ RVTFTFHGGE 
PLLAGAEFYR QAFKKITEGL PHLSPEFAIQ TNLWLMDDEL AEIFAEYQVP IGSSIDGPQE
LTNYQRGDEY FERCLAGYKI AVDHGLLVRF ICTFTNSSVK QKEAIVNFFK EQGWVMKLHP
ALPSLKGENP NAWTLAPEEY GELLVFLLDE AIEHADDLEI MNINDLCRCV FTRAGSVCTY
ADCMGTTYAV GPDGEIYPCY RFIGMPEWVM GHVRNAPSIE SLMESHAGKR MLAFKEFVDT
ACKDCAHITY CRGGCPYNAI APTGGSLEGV DPHCTAYKRI FDEITTRLNE EMNAAPVSRV
SRVKRQKKPS VTRLIQKIVE E