Gene SNSL254_A2597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2597 
Symbol 
ID6483044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2519321 
End bp2520973 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content59% 
IMG OID642737930 
Productindole-3-pyruvate decarboxylase 
Protein accessionYP_002041670 
Protein GI194445213 
COG category[G] Carbohydrate transport and metabolism
[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG3961] Pyruvate decarboxylase and related thiamine pyrophosphate-requiring enzymes 
TIGRFAM ID[TIGR03393] indolepyruvate decarboxylase, Erwinia family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.380997 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones82 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAACC CCTATACCGT GGCCGACTAT TTGCTGGACA GACTGGCAGG ATGCGGCATT 
GGCCATCTTT TTGGCGTACC GGGCGATTAT AACTTGCAGT TTCTTGACCA TGTGATTGAC
CACCCGACCC TGCGTTGGGT GGGATGCGCC AATGAGCTGA ACGCCGCTTA TGCCGCGGAC
GGCTATGCGC GCATGTCGGG CGCTGGAGCG CTACTCACTA CCTTTGGCGT GGGAGAACTT
AGCGCTATTA ACGGTATCGC GGGCAGTTAC GCGGAATATG TCCCGGTCTT GCATATCGTC
GGCGCGCCCT GTAGCGCTGC GCAGCAGCGA GGCGAATTGA TGCACCACAC CCTCGGTGAC
GGCGATTTTC GTCATTTTTA TCGCATGAGC CAGGCGATAT CCGCTGCCAG CGCAATATTA
GATGAACAGA ACGCCTGTTT CGAGATTGAC CGCGTGTTGG GTGAAATGCT TGCCGCACGC
AGGCCAGGAT ACATCATGTT GCCCGCCGAT GTGGCGAAAA AAACGGCCAT CCCGCCTACG
CAGGCGCTGG CGTTGCCCGT GCATGAAGCG CAAAGCGGCG TGGAGACGGC TTTTCGTTAC
CACGCCCGTC AGTGCCTGAT GAACAGTCGG CGCATTGCGC TATTGGCCGA CTTTCTTGCC
GGGCGTTTTG GTTTACGACC ACTGTTGCAA CGCTGGATGG CGGAAACGCC CATCGCTCAT
GCCACACTAC TGATGGGGAA GGGGCTTTTT GATGAACAGC ACCCGAACTT CGTTGGCACC
TATAGCGCAG GCGCCAGCAG CAAAGAAGTG CGTCAGGCCA TAGAGGACGC CGATAGGGTT
ATCTGCGTCG GCACCCGTTT TGTCGATACC CTTACGGCCG GATTCACCCA ACAATTACCG
GCGGAACGCA CGCTGGAGAT TCAGCCTTAC GCGTCGCGCA TCGGCGAAAC CTGGTTCAAC
CTCCCGATGG CGCAGGCGGT GTCTACGCTG CGCGAACTGT GCCTGGAATG CGCTTTTGCG
CCGCCGCCGA CGCGTTCCGC CGGACAGCCA GTGCGGATTG ATAAGGGAGA ACTGACCCAG
GAAAGCTTCT GGCAAACCTT ACAGCAGTAT CTCAAACCCG GAGATATTAT CCTTGTCGAC
CAGGGGACTG CAGCTTTTGG CGCTGCCGCG CTGTCGCTTC CTGACGGCGC GGAAGTTGTG
GTACAGCCGC TGTGGGGATC TATCGGCTAT TCCTTGCCCG CCGCGTTTGG CGCGCAAACC
GCCTGCCCCG ATCGGCGGGT GATTCTGATT ATTGGCGATG GCGCGGCGCA GCTCACGATT
CAGGAGATGG GCTCGATGTT ACGCGACGGG CAGGCGCCGG TCATCCTGCT GCTCAACAAT
GACGGCTATA CCGTAGAGCG CGCCATTCAC GGCGCGGCCC AGCGGTACAA CGACATCGCG
AGCTGGAACT GGACGCAGAT ACCACCGGCG CTAAACGCGG CGCAACAGGC GGAGTGCTGG
CGGGTGACGC AGGCTATCCA ACTGGCAGAG GTCCTCGAAC GTCTGGCGCG CCCACAACGT
CTGTCATTTA TTGAAGTGAT GTTGCCAAAA GCCGATCTGC CGGAATTACT GCGTACCGTG
ACCCGGGCGC TGGAAGCCCG CAACGGGGGA TAA
 
Protein sequence
MQNPYTVADY LLDRLAGCGI GHLFGVPGDY NLQFLDHVID HPTLRWVGCA NELNAAYAAD 
GYARMSGAGA LLTTFGVGEL SAINGIAGSY AEYVPVLHIV GAPCSAAQQR GELMHHTLGD
GDFRHFYRMS QAISAASAIL DEQNACFEID RVLGEMLAAR RPGYIMLPAD VAKKTAIPPT
QALALPVHEA QSGVETAFRY HARQCLMNSR RIALLADFLA GRFGLRPLLQ RWMAETPIAH
ATLLMGKGLF DEQHPNFVGT YSAGASSKEV RQAIEDADRV ICVGTRFVDT LTAGFTQQLP
AERTLEIQPY ASRIGETWFN LPMAQAVSTL RELCLECAFA PPPTRSAGQP VRIDKGELTQ
ESFWQTLQQY LKPGDIILVD QGTAAFGAAA LSLPDGAEVV VQPLWGSIGY SLPAAFGAQT
ACPDRRVILI IGDGAAQLTI QEMGSMLRDG QAPVILLLNN DGYTVERAIH GAAQRYNDIA
SWNWTQIPPA LNAAQQAECW RVTQAIQLAE VLERLARPQR LSFIEVMLPK ADLPELLRTV
TRALEARNGG