Gene Mlab_1649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_1649 
Symbol 
ID4795469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp1676925 
End bp1677998 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content56% 
IMG OID640100334 
Productchorismate synthase 
Protein accessionYP_001031077 
Protein GI124486461 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0108012 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCGAA TCGGAGAATC AATAACGCTC ACGTTGTTTG GGGCGAGTCA CGACAGTCGT 
ATCGGCTGTG TTATAGACGG AATTCCTCCG GGCTATCCTG TAAACGTTGA GAGTATCACT
GCAGATCTCG AGCTTAGAAA ACCATCCGCC GGTATCGGAA CTCCGCGGGT GGAGGCGGAT
GTTCCTGAGA TCTCCGGTAT TGTGGATGGA ATCACTACGG GCTGTCCGGT CGTGATTACT
TTTTCAAACA GCAATACCCG GAGTTCGGAT TATGAACAAC TCCGCCGCAT CCCCCGTCCG
GGCCATGCCG ATTACCCTGC GGTATCAAAA TTCGGTCCGG CTCATGACAT CCGTGGCGGC
GGGATGTTTT CCGGCAGAAT GACGACTCCC CTTGTTGCGG CTGGTGCTCT CCTCCGTGAT
CTGATCGGCA GTTTGGGAAT CTCCGTCGGC TCGTATGTTA CCCGGATCGG CAGCGTCGTC
GATACAAATA CCTACGATCC TGCCGATGTG CTGACGAGAT CGCGGACAAA TCCGCTTCGT
GCCATGTCTT CGGGCATCGA GGATCGGATG AGAGCCGAGA TCCTCGTGGC GAAATCGGAT
GGAGACAGTG TCGGCGGGAT CGTTCGGTGC TTTGCGACAG GTCTTCCGGC TGGTCTTGGA
GAGCCTTTCT TCGACACGCT CGACGGCGAG ATATCTAAAG CGGTTTTCGC CATTCCCGGC
GTGAAAGCCA TCGGATTTGG CGAGGGGTTC GCCGCCGCTG GCCTTCGCGG ATCCGAAAAC
AACGATGCCT ACCGTATTCA AAATGGGTCT GTCGTCACGC TGACGAATCA TGCGGGCGGC
GTCCTTGGCG GGATGTCGAG CGGCGCTGTT CTGGATTTTT CCGTGGCATT CAAGCCGACC
CCGTCTATTG CAAAACAGCA GATGAGTGTT GATCTGCTGA CCCGCGAAGA CGCCGAACTT
TCAGTGAAAG GACGCCACGA TCCGTGCATT GCGAATCGGG GAGCGATCGT AGTCGAAGCG
ATGACCGTGT TCACGCTTGC GGATCTCGCA CTCAGAGGGG GATTTCTTGT CTGA
 
Protein sequence
MNRIGESITL TLFGASHDSR IGCVIDGIPP GYPVNVESIT ADLELRKPSA GIGTPRVEAD 
VPEISGIVDG ITTGCPVVIT FSNSNTRSSD YEQLRRIPRP GHADYPAVSK FGPAHDIRGG
GMFSGRMTTP LVAAGALLRD LIGSLGISVG SYVTRIGSVV DTNTYDPADV LTRSRTNPLR
AMSSGIEDRM RAEILVAKSD GDSVGGIVRC FATGLPAGLG EPFFDTLDGE ISKAVFAIPG
VKAIGFGEGF AAAGLRGSEN NDAYRIQNGS VVTLTNHAGG VLGGMSSGAV LDFSVAFKPT
PSIAKQQMSV DLLTREDAEL SVKGRHDPCI ANRGAIVVEA MTVFTLADLA LRGGFLV