Gene Mbur_1012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbur_1012 
Symbol 
ID3998117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcoides burtonii DSM 6242 
KingdomArchaea 
Replicon accessionNC_007955 
Strand
Start bp1093889 
End bp1094989 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content47% 
IMG OID637958792 
Productchorismate synthase 
Protein accessionYP_565701 
Protein GI91773009 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGGAA ATACCTTTGG CCATTCTTTC AGGATAACAA CCTGGGGAGA ATCACACGGA 
CGTGCATTGG GAGTCGTTAT TGACGGGGTA CCCGCAGGAC TTCCCCTTGA CACAGAGATA
GTACAGAAAG AGCTTGACAG ACGACGTCCC GGCCAAAGCG CTGTATCCAC ACCGCGTTCA
GAGACAGACA AGGTGGAGAT CATTTCAGGG ATATTTGAAG GAAAAACTAC CGGCACACCC
ATTTCCATGA TGGTATGGAA CAAGGATGCT GATTCGAGTT CTTACGACAA TATCAAAGAC
CTCCCCAGAC CGGGGCATGC CGATTACCCA TACATGGAAA AATATGGCAT CCGTGACCAT
CGGGGAGGAG GACGTTCTTC CGCACGTGAG ACCATTGGAA GAGTTGCAGC AGGAGCTGTT
GCAAAAGAGA TACTTTCAAT TTTTGGTATT GATATCATTG CACATGTCAC AGAACTTGGC
GGTATTCGTG CAAAAGAGAT GCCTTTTGAT ACAATAAAGG AACATCTTGA AAAGACACCT
GTCAGATGTG CCGATCTGGA AGCGGCACAA TTGATGCTCG AAAAGGTTGG CAAAGCACGG
GAAGAACATG AAAGCATTGG TGGTGTTGTC GAAATAATAG CTATCGGCCT GCCACCGGGA
ATAGGAGAGC CAGTTTTCGA TAAACTTGAT GCAGATATAG CAAAAGCTAT CATGAGCATC
GGTGCTGTCA AAGGTGTTGA GATAGGGATT GGAAATGAGG CAGCACAGAT GAAGGGAAGC
CAGATGAACG ATCCTTTCAT ACTGGAAGAC GGGAAGATAA TCGCACAGAC CAATAATGCA
GGCGGGATAC TCGGAGGACT TTCCACAGGA ATGCCCATAA TCTGCCGTGC AAGTGTCAAA
CCCACACCAT CCATATCAAA AGTGCAGCAC ACTGTCAATA CAAAAGAGAT GAAGAACAGC
GATATAATCA TCAAAGGCCG CCATGACCCA ACCATCCCGC CACGAATGGT TCCCGTTGCA
GAAGCCATGA TGGCATTGGT ACTTGTCGAC CACATGATAA GAAGCGGTCA TATTCATCCG
AACTCACTTT TGAAACAATG A
 
Protein sequence
MPGNTFGHSF RITTWGESHG RALGVVIDGV PAGLPLDTEI VQKELDRRRP GQSAVSTPRS 
ETDKVEIISG IFEGKTTGTP ISMMVWNKDA DSSSYDNIKD LPRPGHADYP YMEKYGIRDH
RGGGRSSARE TIGRVAAGAV AKEILSIFGI DIIAHVTELG GIRAKEMPFD TIKEHLEKTP
VRCADLEAAQ LMLEKVGKAR EEHESIGGVV EIIAIGLPPG IGEPVFDKLD ADIAKAIMSI
GAVKGVEIGI GNEAAQMKGS QMNDPFILED GKIIAQTNNA GGILGGLSTG MPIICRASVK
PTPSISKVQH TVNTKEMKNS DIIIKGRHDP TIPPRMVPVA EAMMALVLVD HMIRSGHIHP
NSLLKQ