Gene SNSL254_A2573 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2573 
SymbolaroC 
ID6485537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2493511 
End bp2494596 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content59% 
IMG OID642737908 
Productchorismate synthase 
Protein accessionYP_002041649 
Protein GI194444824 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGAA ACACAATTGG ACAACTCTTT CGCGTAACCA CTTTCGGCGA ATCACACGGG 
CTGGCGCTTG GGTGTATCGT CGATGGCGTG CCGCCCGGCA TCCCGTTGAC GGAGGCCGAT
CTGCAACACG ATCTCGACAG ACGCCGCCCC GGCACCTCGC GCTATACTAC CCAGCGCCGC
GAACCGGACC AGGTAAAAAT TCTCTCCGGC GTGTTTGATG GCGTGACGAC CGGCACCAGC
ATTGGCCTAC TGATTGAAAA CACCGATCAG CGCTCGCAGG ACTACAGCGC GATTAAAGAT
GTTTTTCGTC CGGGACACGC GGATTACACC TATGAGCAGA AATACGGCCT GCGCGATTAC
CGTGGCGGTG GACGTTCTTC CGCGCGTGAA ACCGCGATGC GCGTAGCGGC AGGGGCGATC
GCCAAGAAAT ACCTGGCGGA AAAGTTCGGC ATCGAAATCC GCGGCTGCCT GACCCAGATG
GGCGACATTC CGCTGGAGAT TAAAGACTGG CGTCAGGTTG AGCTTAATCC GTTCTTTTGT
CCCGATGCGG ACAAACTTGA CGCGCTGGAC GAACTGATGC GCGCGCTGAA AAAAGAGGGT
GACTCCATCG GCGCGAAAGT GACGGTGATG GCGAGCGGCG TGCCGGCAGG GCTTGGCGAA
CCGGTATTTG ACCGACTGGA TGCGGACATC GCCCATGCGC TGATGAGCAT TAATGCGGTG
AAAGGCGTGG AGATCGGCGA AGGATTTAAC GTGGTGGCGC TGCGCGGCAG CCAGAATCGC
GATGAAATCA CGGCGCAGGG TTTTCAGAGC AACCACGCTG GCGGCATCCT CGGTGGCATC
AGTAGCGGGC AACACATTGT GGCGCATATG GCGCTGAAAC CTACCTCCAG CATTACCGTG
CCGGGACGTA CGATCAACCG GGCAGGTGAA GAAGTCGAAA TGATCACCAA AGGGCGCCAC
GATCCGTGTG TGGGGATTCG CGCAGTGCCG ATCGCAGAAG CCATGCTGGC GATCGTGCTG
ATGGATCACC TGCTGCGCCA TCGGGCACAG AATGCGGATG TAAAGACAGA GATTCCACGC
TGGTAA
 
Protein sequence
MAGNTIGQLF RVTTFGESHG LALGCIVDGV PPGIPLTEAD LQHDLDRRRP GTSRYTTQRR 
EPDQVKILSG VFDGVTTGTS IGLLIENTDQ RSQDYSAIKD VFRPGHADYT YEQKYGLRDY
RGGGRSSARE TAMRVAAGAI AKKYLAEKFG IEIRGCLTQM GDIPLEIKDW RQVELNPFFC
PDADKLDALD ELMRALKKEG DSIGAKVTVM ASGVPAGLGE PVFDRLDADI AHALMSINAV
KGVEIGEGFN VVALRGSQNR DEITAQGFQS NHAGGILGGI SSGQHIVAHM ALKPTSSITV
PGRTINRAGE EVEMITKGRH DPCVGIRAVP IAEAMLAIVL MDHLLRHRAQ NADVKTEIPR
W