Gene Mboo_1564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1564 
Symbol 
ID5410089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1632306 
End bp1633292 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content63% 
IMG OID640868798 
Productchorismate synthase 
Protein accessionYP_001404724 
Protein GI154151106 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACCT TTGGAAGCAA TTTCCGCATA ACAACCTTTG GCGAGAGCCA CGGCCCGGCC 
GTGGGCTGCG TTATTGACGG GTGCCCGCCC CGGCTCGCAC TTTCCGCGGA TGACATCCAG
CCGCTGCTGG ACCGGCGCCG CCCGGGCACA TCCCCCCTCT CCTCGGCACG GGACGAGGCA
GACACCGTTG AGATCCTCTC GGGCGTATTT GAGAAGATGA CCACGGGGAC CCCGATTGCG
CTCCTTGTGC GGAACCAGGA TATGCACTCG CGCGATTACG ATACGATAAA AGAGAAGTTC
CGCCCGGGCC ATGCGGATTT CACGTACCAG GCCAAGTACG GTATCCGGGA CTACCGCGGC
GGGGGCAGGA GCTCGGGGCG CGAGACCGTG GGCCGGGTTG CGGCAGGAGC GGTGGCCCTG
AAGTACCTGG CCACAAAAGG GATCGCGGTC CAGGGCCGGA TCGTGGCCGT GCACGGCAAA
ACCGATCCGC AGAATATCGA AAACGAGATC CTCGGGGCAA AGTCTGCCGG TGATTCTGTG
GGCGGGATCG CGGAGATCAC GGCCACCGGC TGCTCAGCGG GCCTGGGTGA TCCCGTATTC
GGGAAACTTG ACGCAGGCAT TGCAGCGGCC ATGATGGGGA TAGGCGCGGT CAAGGGTGTT
GAGATTGGCG ACGGCTTTGC CGTGGCAGAA CGCTTCGGTA GCGAGAACAA CGACCCGATG
ACCGCAGCCG GATTTCAAAG CAACCATGCC GGGGGGATCC TTGGGGGGAT CTCCACAGGA
CAGGACCTCG TGGTGCGCAT CGCGGTAAAA CCCACGCCGT CCATTGCAAA AGTCCAGCAT
ACCCGGGACA TCCACGGGAA CGCAACAACG ATTACGATTG GCGGCCGGCA CGACCCCTGC
ATCGTGCCCC GGATCCTCCC GGTGGCAGAG GCAATGCTCG CCCTCGTTCT CATCGACGCG
GTGCTGGAGC AGGAAAAATA CCGGTAA
 
Protein sequence
MNTFGSNFRI TTFGESHGPA VGCVIDGCPP RLALSADDIQ PLLDRRRPGT SPLSSARDEA 
DTVEILSGVF EKMTTGTPIA LLVRNQDMHS RDYDTIKEKF RPGHADFTYQ AKYGIRDYRG
GGRSSGRETV GRVAAGAVAL KYLATKGIAV QGRIVAVHGK TDPQNIENEI LGAKSAGDSV
GGIAEITATG CSAGLGDPVF GKLDAGIAAA MMGIGAVKGV EIGDGFAVAE RFGSENNDPM
TAAGFQSNHA GGILGGISTG QDLVVRIAVK PTPSIAKVQH TRDIHGNATT ITIGGRHDPC
IVPRILPVAE AMLALVLIDA VLEQEKYR