Gene Msed_1873 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1873 
Symbol 
ID5104141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1816146 
End bp1817318 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content55% 
IMG OID640507759 
Productchorismate synthase 
Protein accessionYP_001191937 
Protein GI146304621 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.485609 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGGAA ATAGCTTTGG CAAGATCTTT AGGATTACCA CTTTCGGGGA AAGCCACGGG 
CCCGCGGTGG GCGTCGTGGT GGACGGTGTC CCTGCAGGGT TAAGGTTGTC CCAGGAGGAT
CTTGAATTCG AGCTCTCCTT CAGGAGACCA GGGAGACTCT ACGTGTCAGG GAGAAGGGAG
AAGGACGTTC CCGAGATCCT GAGTGGGGTC TATAATGGAA GGACCACTGG GTCTCCCATT
GCCATCGTCG TGAAAAACAC TGATGTTATC TCGTCGTTCT ATGAGGAGGT TAAGGTGAAG
CCAAGGCCTG GACACGCTGA CCTTCCCTTC ATCATGAGGT ATGGTTACGA AAACTGGGAC
TACAGGGGAG GTGGCAGATC GAGCGCCAGG GAGACCGTTG GGAGAGTAGC TGCAGGGGCG
ATAGCCAAGA AGCTGTTAAT GTTCCACGAT ACCTGGATAG CGGGGAGGCT CAAGAGTCTT
GGACCTGTGG ACTCACCGCC CGCGGACTTC CTTCAAATCT TATGTTCCAA GTACAGCCCC
GTGAGGGCTT CCGACCCAGA GACTGAGGCC AAGTTCCAGG AACTGGTGAA ACAGGCAACG
GTTGAGGGAG ATAGTTACGG TGGAGTTGCC GAGATAGTCG TGAAGAACCC ACCTGCAGGG
TTAGGTGAAC CTGTCTTCGA CAAGATCAAG GCTGACCTGG CCAAGGCGAT CCTCTCTATC
CCGGCAGTGA CGGGGTTTGA GTACGGACTG GGCTTTCAAG CTTCAAGAAT GAAGGGAAGC
GAGGCAAACG ACTCCATTGT AAGGAAGGGT GAGAGACTTG GTTGGAGGGA GAACAAGTCT
GGCGGAATCC TGGGAGGAAT AACTACGGGT GAGGATATAG TGGTGAGATG TTCCTTCAAG
CCCACAAGCT CCATAAGGAA ACCGCAGGGG ACGGTGGATC TAAGAACCGG GGAACCGGCA
GAGATCTCTG TCCTGGGAAG GCATGATCCA GCTGTCGCAA TCAGGGGAGT CTCCGTGGCT
GAGTCCATGG TGGCCCTGAC CCTGGTGGAT CACTCCCTGA GGTCTGGGGT CATCCCACCT
GTCAAGCTCG AGGAGAGGCA GGCTGAGGTC ATTGAGGACA GATGGAGGAG GTACATGGAG
GAATGCAGGC CTACGGCGGA ATCTCAGTCG TGA
 
Protein sequence
MPGNSFGKIF RITTFGESHG PAVGVVVDGV PAGLRLSQED LEFELSFRRP GRLYVSGRRE 
KDVPEILSGV YNGRTTGSPI AIVVKNTDVI SSFYEEVKVK PRPGHADLPF IMRYGYENWD
YRGGGRSSAR ETVGRVAAGA IAKKLLMFHD TWIAGRLKSL GPVDSPPADF LQILCSKYSP
VRASDPETEA KFQELVKQAT VEGDSYGGVA EIVVKNPPAG LGEPVFDKIK ADLAKAILSI
PAVTGFEYGL GFQASRMKGS EANDSIVRKG ERLGWRENKS GGILGGITTG EDIVVRCSFK
PTSSIRKPQG TVDLRTGEPA EISVLGRHDP AVAIRGVSVA ESMVALTLVD HSLRSGVIPP
VKLEERQAEV IEDRWRRYME ECRPTAESQS