Gene Mmar10_0855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0855 
Symbol 
ID4285856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp946429 
End bp947520 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content67% 
IMG OID638140321 
Productchorismate synthase 
Protein accessionYP_756086 
Protein GI114569406 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACACA ACACGTTCGG ACATCTTTTC CGGGTCACCA CCTGGGGCGA AAGCCATGGC 
CCCTCCATCG GCGCGGTCGT CGATGGTTGC CCGGCCGGCA TTCCGCTGAC CGAAACCGAT
CTTCAGCCCT TCCTCGACCT GCGCCGTCCG GGCACTTCGC GCCATGTCAC GCCGCGCCAG
GAACCCGACC AGGTCCGCAT CCTGTCCGGC ACGTTCGAGG ATGACCGCAC GGACGGGCCG
GTGACGACGG GCGCGCCGAT CAGCCTGATG ATCGAGAATA CCGACCAGCG CTCGAAGGAT
TACAGCGCCA TCCGCGACAA ATGGCGTCCC GGCCATGCCG ACTACACCTA TGACATGAAA
TACGGCATCC GCGATTATCG CGGCGGTGGC CGTTCCTCGG CCCGTGAGAC GGCCATGCGG
GTCGCCGCCG GCGGCATTGC CCGCAAGGTG CTGGGCGACG GCATCTCGAT CCGCGCCGCG
CTGGTGCAAG TGGGTGATCG CGCCATCGAC CGTAGCCGCT GGGACTGGGA CGAGGTGAGC
AACAACCCTT TCTTCTGCCC CGACGCCACA ACCGCCGCCC TGTGGGAAGC GGACATGGAC
GCGCTGCGCC GGGCTGGTTC ATCGACCGGC GCCATCGTCG AAGTCGTGGT CTCCGGCGTC
CCGGTCGGCT GGGGTGCCCC CGTCTATGCC AAGCTCGACA GTGAGCTGGC CGCCGCCATG
ATGACCATCA ATGCGGTCAA GGGCGTCGAG ATCGGGGCCG GGTTCGGCTC GGCCGCGATG
CGCGGTGAAG ACGCCGCGGA CGAGATGCGC ATGGGCGAGG ACGGGCCGGT CTTTTTGTCC
AACCATAATG GCGGCGTGCT GGGCGGCATT TCGACCGGGC AGGACCTGGT GGTCCGCTTT
GCCGTCAAAC CGACCTCCTC GATCACGGTC GAGCGCAACA CGCTGGACCG CAATTTCGAG
GAGACCGTGA TCGAGACCCG CGGCCGCCAT GACCCCTGCG TCGGCATCCG TGCCGTCCCG
GTCGGCGAGG CCATGGCAGC GCTGGTCCTG GCCGACCAGA AGCTGCGCCA TGCGGGCCAA
TCGGCGTACT GA
 
Protein sequence
MSHNTFGHLF RVTTWGESHG PSIGAVVDGC PAGIPLTETD LQPFLDLRRP GTSRHVTPRQ 
EPDQVRILSG TFEDDRTDGP VTTGAPISLM IENTDQRSKD YSAIRDKWRP GHADYTYDMK
YGIRDYRGGG RSSARETAMR VAAGGIARKV LGDGISIRAA LVQVGDRAID RSRWDWDEVS
NNPFFCPDAT TAALWEADMD ALRRAGSSTG AIVEVVVSGV PVGWGAPVYA KLDSELAAAM
MTINAVKGVE IGAGFGSAAM RGEDAADEMR MGEDGPVFLS NHNGGVLGGI STGQDLVVRF
AVKPTSSITV ERNTLDRNFE ETVIETRGRH DPCVGIRAVP VGEAMAALVL ADQKLRHAGQ
SAY