Gene Mlab_1356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_1356 
SymbolglyA 
ID4794899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp1377529 
End bp1378779 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content53% 
IMG OID640100038 
Productserine hydroxymethyltransferase 
Protein accessionYP_001030790 
Protein GI124486174 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0112] Glycine/serine hydroxymethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0821973 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000382903 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTTCCC TGGCACTATT TGACCCGGAA ATTTTCCAGC TGATAAACAA AGAACACAAG 
CGTCAGGTCG AGGGGCTTGA GCTTATCGCT TCGGAAAATG TCGTCGCACG CGAAGTAATG
GAGGCAATGG GAACCATTCT GACCAATAAA TATGCAGAAG GTTATCCGGG AAAGCGTTAC
TACGGCGGCT GCGAATTTCA TGATCAGATC GAAAACCTCG CACGCGACAG ACTCTGCCAA
CTTTTCGGTG CAGAACACGC AAACGTCCAG CCCCACTCGG GAAGCCAGGC GAATGAAGCA
GTATATCTCT CATGTCTGAA ACCGGGAGAC AAGATCCTCA GTCAGAGCCT GAATAACGGC
GGTCACTTGT CCCACGGCGA CCCGGCAAAC ATGTCCGGAA AATGCTTCGA CATCTCCTTT
TACGGTGTCG ACTTCGATAC CGAACGTCTC GACTACGGCG TGATCGAAGA GCTTGCACGG
AAAAACAAGC CTGACCTCAT CGTCTGCGGA GCATCCGCAT ACCCGCGTGA GATCGATTTC
AAAGCATTCG CAGAGATCGC AGAAGACGTC GGCGCCAGAT CGATGGCGGA CATCGCCCAT
ATCTCCGGAC TCTGCTGCAC CGGACTCCAC AACTCTCCGG TCGGCGTTAC CACCTACACC
ACCTCGACAA CGCATAAAAC CCTCCGCGGA CCCCGCGGCG GCGTGATCAT GTGCAATAAA
GAGTATGCAA ACTCTATCGA CAAGGCGGTT TTCCCGGGAA TGCAGGGCGG ACCCCTTATG
CACGTGATCG CCGCAAAGGC CGTTTGTTTC CGTGAAGCGC TCACCGACGA CTACAAAGAA
TACGCTAAAC AGGTCGTCAA AAACTGCAAA GTGCTTGCAG CAACTCTTGA AGACAATAAC
TTCCGTCTCG TTTCCGGCGG AACCGACAAC CACCTCTGTC TGCTTGACCT TTCCGACCAC
AACATATCCG GCCAGCAGGC GGAAGTCGCT CTTGGAAAAG CAGGCATCAC CGTTAACAAA
AACACGATCC CAAGACAGGC TCTCTCTCCC TTCGAGACCT CGGGTATCCG GATCGGGACG
CCGACCATCA CGACCCGCGG TATGAAGGAA GAACAGTGTA AACAGATCGG AGACTGGATC
GCCAAGGTCT TAAACCACAT CGATGACGAG AAGACGATTG CCGGAGTCAA AGACGAAGTT
ACGTCGCTTT GCCTGAAGTA TCCTCTGTAT CCGGAAATTC GGACCTTATA A
 
Protein sequence
MSSLALFDPE IFQLINKEHK RQVEGLELIA SENVVAREVM EAMGTILTNK YAEGYPGKRY 
YGGCEFHDQI ENLARDRLCQ LFGAEHANVQ PHSGSQANEA VYLSCLKPGD KILSQSLNNG
GHLSHGDPAN MSGKCFDISF YGVDFDTERL DYGVIEELAR KNKPDLIVCG ASAYPREIDF
KAFAEIAEDV GARSMADIAH ISGLCCTGLH NSPVGVTTYT TSTTHKTLRG PRGGVIMCNK
EYANSIDKAV FPGMQGGPLM HVIAAKAVCF REALTDDYKE YAKQVVKNCK VLAATLEDNN
FRLVSGGTDN HLCLLDLSDH NISGQQAEVA LGKAGITVNK NTIPRQALSP FETSGIRIGT
PTITTRGMKE EQCKQIGDWI AKVLNHIDDE KTIAGVKDEV TSLCLKYPLY PEIRTL