Gene EcSMS35_2486 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2486 
SymbolaroC 
ID6144165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2533155 
End bp2534240 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content56% 
IMG OID641617358 
Productchorismate synthase 
Protein accessionYP_001744530 
Protein GI170680943 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.519758 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGGAA ACACAATTGG ACAACTCTTT CGCGTAACCA CCTTCGGCGA ATCGCACGGG 
CTGGCGCTCG GCTGCATCGT CGATGGTGTT CCGCCAGGCA TTCCGCTGAC GGAAGCGGAC
CTGCAACATG ACCTCGACCG TCGTCGCCCT GGGACATCGC GCTATACCAC CCAGCGTCGT
GAGCCGGATC AGGTCAAAAT TCTCTCCGGT GTTTTTGAAG GCGTTACTAC CGGCACCAGC
ATCGGATTGT TGATCGAAAA TACCGACCAG CGTTCTCAGG ATTATAGTGC GATTAAGGAC
GTTTTCCGTC CAGGTCATGC CGATTACACT TACGAACAAA AATACGGCCT GCGCGATTAT
CGCGGCGGCG GGCGCTCTTC CGCCCGAGAA ACCGCCATGC GCGTGGCGGC AGGGGCGATT
GCCAAAAAAT ATCTCGCCGA GAAATTTGGT ATTGAAATCC GCGGCTGCCT GACCCAGATG
GGTGACATTC CGCTGGAAAT CAAAGACTGG TCGCAGGTCG AGCAAAATCC GTTTTTCTGC
CCGGACCCGG ACAAAATCGA CGCGTTAGAT GAACTGATGC GCGCGCTGAA AAAAGAGGGC
GACTCCATCG GCGCGAAAGT CACCGTTGTT GCCAGTGGCG TCCCCGCCGG ACTTGGCGAG
CCGGTCTTTG ATCGCCTGGA TGCCGACATC GCCCATGCGC TGATGAGCAT CAACGCGGTG
AAAGGCGTAG AAATTGGTGA TGGTTTTGAC GTGGTAGCGC TGCGTGGCAG CCAGAACCGC
GACGAAATCA CCAAAGACGG ATTCCAGAGC AACCATGCGG GCGGCATTCT TGGCGGTATC
AGCAGCGGGC AGCAAATCAT TGCCCATATG GCGCTGAAGC CAACCTCCAG TATTACCGTG
CCGGGGCGCA CCATTAACCG CTTTGGCGAA GAAGTTGAGA TGATCACCAA AGGTCGTCAC
GATCCTTGTG TTGGGATCCG CGCGGTGCCG ATCGCGGAAG CGATGCTAGC GATCGTTTTA
ATGGATCACC TGTTACGGCA ACGGGCGCAA AATGCCGATG TGAAGACTGA TATTCCACGC
TGGTAA
 
Protein sequence
MAGNTIGQLF RVTTFGESHG LALGCIVDGV PPGIPLTEAD LQHDLDRRRP GTSRYTTQRR 
EPDQVKILSG VFEGVTTGTS IGLLIENTDQ RSQDYSAIKD VFRPGHADYT YEQKYGLRDY
RGGGRSSARE TAMRVAAGAI AKKYLAEKFG IEIRGCLTQM GDIPLEIKDW SQVEQNPFFC
PDPDKIDALD ELMRALKKEG DSIGAKVTVV ASGVPAGLGE PVFDRLDADI AHALMSINAV
KGVEIGDGFD VVALRGSQNR DEITKDGFQS NHAGGILGGI SSGQQIIAHM ALKPTSSITV
PGRTINRFGE EVEMITKGRH DPCVGIRAVP IAEAMLAIVL MDHLLRQRAQ NADVKTDIPR
W