Gene Cmaq_0808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0808 
Symbol 
ID5708772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp843446 
End bp844447 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content43% 
IMG OID641275311 
Productchorismate mutase 
Protein accessionYP_001540633 
Protein GI159041381 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0287] Prephenate dehydrogenase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01808] monofunctional chorismate mutase, high GC gram positive type 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0594729 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.450237 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTGGC AGTTGAGGAA GAGTATTGAT GAGGTTGATG ATGAAATAAT TAAGTTACTC 
GCCAGGAGGC TAACCATAGC CGAAACCATA GGTGATGTTA AGAGGAAGCT TAATCTACCA
CCCGTGGATC ATGAGAGGGA GAGTGAAGTT ATTGATAGAT GGGTCAGTGG CTTAGTTGAG
GCTGGTTTAG ATGAATTAAC AGCCAGAAGC ATTGCTGAGC TAGTGATAAA GGCATCCACC
AAGAGGCAGA TTAGGAATTG GTTTAACGTT AAAGTCACTA TAGTGGGTTC AGGGAGATTA
GGTAAGACGC TTAAGAGGGC TTTAAGCCAA GTCACTCCAA CAACCTTAAT TAGCATGAGG
GATGAATTAC CTGACTCAGA CATAGTAATA CTTGCCACAA GACCCACTGA GGACTCCATT
AACTACATTA AGAGGAATAG TGAGAGCATA AGGGGTAGGG TGCTCATGGA TTCCTTCTCG
GTTAAGTCAA GGTTATTCAA CATCATTGAG GATGAGTCAA GGGAAGTAGG CTTCAAGTAC
CTGAGCATAC ACCCATTGTT CGGTAGCCTA ACGGATACTT GGGGTGAAGT AGTAGTCCTA
ATACCATCAT TAACAAGTAG GGATTCACTA CCAATGGCTA CTCAGATATT TGAGGCAGCA
GGCTTAAGAA CAGTGGTGTT AAGTGATCCT GATACTCATG ATAAGGTAAT GGCTTACATA
CAGGTTGCCC ACCATTTAAT GCTACTAGCC CTCTATACCA TGCTTAAGGA TGCTGGTAAA
GTAGGTGGGA TTGATGCAAA CCTACTTATG ACCCACAGCT TGAGGTTAAC CATGAAGGCT
ATTGAAAGAA CCCTGGAGCA GCTTGATGTT GTTGAGGAGA TTCAGGAAAT GAATCCATAC
GCCAGTGAAG TTAGGGATAA GATTACCAAG TACATTAACA TTGTTAATTC AGCAGCAGCT
GAAGGTAAAT TAAGTGAGTT AATTGGAGGT GACTTAAAGT GA
 
Protein sequence
MLWQLRKSID EVDDEIIKLL ARRLTIAETI GDVKRKLNLP PVDHERESEV IDRWVSGLVE 
AGLDELTARS IAELVIKAST KRQIRNWFNV KVTIVGSGRL GKTLKRALSQ VTPTTLISMR
DELPDSDIVI LATRPTEDSI NYIKRNSESI RGRVLMDSFS VKSRLFNIIE DESREVGFKY
LSIHPLFGSL TDTWGEVVVL IPSLTSRDSL PMATQIFEAA GLRTVVLSDP DTHDKVMAYI
QVAHHLMLLA LYTMLKDAGK VGGIDANLLM THSLRLTMKA IERTLEQLDV VEEIQEMNPY
ASEVRDKITK YINIVNSAAA EGKLSELIGG DLK