Gene Mchl_3216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_3216 
Symbol 
ID7117555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp3399286 
End bp3400386 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content70% 
IMG OID643525967 
Productchorismate synthase 
Protein accessionYP_002421982 
Protein GI218531166 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.279354 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.10403 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCACA ACACCTTCGG CCACCTGTTC CGCGTCACCA CCTTCGGCGA GAGCCACGGG 
GTGGCGCTCG GCTGCGTGGT GGACGGATGC CCGCCCGGCC TCGCGCTGGA AGCCGAGGAC
ATCCAGGCGG AGCTCGACCG GCGCAAGCCC GGCCAGTCGC GCTTCACCAC GCAGCGGCGC
GAGCCGGATC AGGTGAAGAT CCTGTCCGGC GTGTTCAGCG ACGACCGCAC CGGCGGGCGC
CAGCTCACCA CCGGCACGCC GATCGCGCTG ATGATCGAGA ACACCGATCA GCGCTCGAAA
GACTATTCCG AGATCCGCGA CAGCTACCGC CCCGGCCACG CCGACTTCAC CTACGATGCC
AAGTACGGCA TCCGCGACTA TCGCGGCGGC GGACGCTCCT CCGCCCGCGA GACCGCCGCG
CGGGTCGCGG CCGGCGCGGT GGCGCGCAAG GTCATCCCCG GCATCACCAT CCGCGCTGCC
CTGGTGCAGA TGGGGCCGCA CGCCATCGAC CGCGCGAACT GGGATTGGGA GCAGGTCGGC
CAAAATCCGT TCTTCTGCCC CGACGCGAAG GCGGCGGCGC TCTACGAGAC CTATCTCGAC
GCAATCCGAA AAGACGGCTC CTCGGTCGGC GCGGTGATCG AGGTGGTGGC CGAAGGCGTG
CCGCCCGGGC TCGGCGCACC GATCTACGGC AAGCTCGACG CGGATCTCGC CGCAGCGATG
ATGTCGATCA ATGCGGTCAA GGGCGTGGAG ATCGGCGACG GCTTCGCCGC CGCAGCCCTC
CGCGGCGAGG ACAATGCCGA CGAGATGCGC GCCGGCAATG ACGGCCGCCC GCGCTTCCTC
GCCAACCATG CCGGCGGCAT CCTGGGCGGC ATTTCGTCGG GCGAGCCGGT GGTTGTCCGG
TTTGCCGTGA AGCCGACCTC CTCGATCCTG ACCCCGCGCC AGAGCGTGAA CCGCGACGGG
GCTGAGATCG ACCTCATCAC CAAGGGCCGC CACGACCCCT GCGTCGGCAT CCGCGCCGTC
CCCGTCGCCG AGGCGATGAT GGCCTGCGTG CTGGCCGATC ACACTCTCCG CCATCGCGGG
CAGAACGGCG AGCGCCCGTG A
 
Protein sequence
MSHNTFGHLF RVTTFGESHG VALGCVVDGC PPGLALEAED IQAELDRRKP GQSRFTTQRR 
EPDQVKILSG VFSDDRTGGR QLTTGTPIAL MIENTDQRSK DYSEIRDSYR PGHADFTYDA
KYGIRDYRGG GRSSARETAA RVAAGAVARK VIPGITIRAA LVQMGPHAID RANWDWEQVG
QNPFFCPDAK AAALYETYLD AIRKDGSSVG AVIEVVAEGV PPGLGAPIYG KLDADLAAAM
MSINAVKGVE IGDGFAAAAL RGEDNADEMR AGNDGRPRFL ANHAGGILGG ISSGEPVVVR
FAVKPTSSIL TPRQSVNRDG AEIDLITKGR HDPCVGIRAV PVAEAMMACV LADHTLRHRG
QNGERP