Gene SeSA_A2639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A2639 
Symbol 
ID6517801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp2523069 
End bp2524721 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content59% 
IMG OID642747684 
Productindole-3-pyruvate decarboxylase 
Protein accessionYP_002115475 
Protein GI194734253 
COG category[G] Carbohydrate transport and metabolism
[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG3961] Pyruvate decarboxylase and related thiamine pyrophosphate-requiring enzymes 
TIGRFAM ID[TIGR03393] indolepyruvate decarboxylase, Erwinia family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAACC CCTATACCGT GGCCGACTAT TTGCTGGACA GACTGGCAGG ATGCGGCATT 
GGCCATCTTT TTGGCGTACC GGGCGATTAT AACTTGCAGT TTCTTGACCA TGTGATTGAC
CACCCGACCT TGCGTTGGGT GGGATGCGCT AATGAGCTGA ACGCCGCTTA TGCCGCGGAC
GGTTATGCGC GCATGTCGGG CACTGGAGCG CTACTCACCA CCTTTGGCGT GGGAGAACTT
AGCGCTATTA ACGGTATCGC GGGCAGTTAC GCGGAATATG TCCCGGTCTT GCATATCGTC
GGCGCGCCCT GTAGCGCTGC GCAGCAGCGT GGCGAATTGA TGCACCATAC CCTCGGTGAC
GGCGATTTTC GTCATTTTTA TCGCATGAGT CAGGCGATAT CTGTTGCCAG CTCAATATTG
GATGAACAGA ACGCCTGTTT CGAGATTGAC CGTGTATTGG GTGAAATGCT TGCCGCACGC
AGGCCAGGAT ACATCATGTT GCCCGCCGAC GTGGCGAAAA AAACGGCCAT CCCGCCTACG
GAGGCGCTGG CGTTGCCCGT GCATGAAGCG CAAAGCGGTG TGGAGACGGC TTTTCGTTAC
CACGCCCGTC AGTGCCTGAT GAACAGTCGG CGCATTGCGC TATTGGCCGA CTTTCTTGCC
GGGCGTTTTG GTTTACGACC ACTGTTGCAG CGCTGGATGG CGGAAACGCC CATCGCTCAT
GCGACACTAC TGATGGGGAA GGGGCTTTTT GATGAGCAGC ACCCGAACTT CGTTGGCACC
TATAGCGCAG GCGCCAGCAG CAAAGAAGTG CGTCAGGCCA TAGAGGACGC CGATAGGGTT
ATCTGCGTCG GCACCCGTTT TGTCGATACC CTTACGGCCG GATTTACCCA ACAATTGCCG
GCGGAACGCA CGCTGGAGAT TCAGCCTTAC GCGTCGCGCA TCGGCGAAAC CTGGTTCAAC
CTCCCGATGG CGCAGGCGGT GTCTACGCTG CGCGAACTGT GCCTTGAATG CGCTTTTGCG
CCGCCGCCGA CGCGTTCCGC CGGACAGCCA GTGCGGATTG ATAAGGGAGA ACTGACCCAG
GAAAGTTTCT GGCAAACTTT ACAGCAGTAT CTCAAACCCG GCGATATTAT CCTTGTCGAC
CAGGGGACCG CCGCTTTTGG CGCTGCCGCG TTGTCGCTTC CTGACGGTGC GGAAGTTGTG
GTTCAGCCGC TGTGGGGGTC TATCGGCTAT TCCTTGCCCG CCGCGTTTGG CGCGCAAACC
GCCTGCCCCG ATCGGCGGGT GATTCTGGTT ATCGGCGATG GCGCGGCGCA GCTCACGATT
CAGGAGATGG GCTCGATGTT ACGCGACGGG CAGGCGCCGG TCATCCTGCT GCTCAACAAT
GACGGCTATA CCGTAGAGCG CGCCATTCAC GGCGCGGCCC AGCGGTATAA CGACATCGCG
AGCTGGAACT GGACGCAGAT ACCACCGGCG CTAAACGCGG CGCAACAGGC GGAGTGCTGG
CGGGTGACGC AGGCTATCCA ACTGGCGGAG GTCCTCGAAC GGCTGGCGCG CCCACAACGT
CTGTCATTTA TTGAAGTGAT GTTGCCAAAA GCCGATCTGC CGGAATTACT GCGTACCGTG
ACCCGGGCGC TGGAAGCCCG CAACGGGGGA TAA
 
Protein sequence
MQNPYTVADY LLDRLAGCGI GHLFGVPGDY NLQFLDHVID HPTLRWVGCA NELNAAYAAD 
GYARMSGTGA LLTTFGVGEL SAINGIAGSY AEYVPVLHIV GAPCSAAQQR GELMHHTLGD
GDFRHFYRMS QAISVASSIL DEQNACFEID RVLGEMLAAR RPGYIMLPAD VAKKTAIPPT
EALALPVHEA QSGVETAFRY HARQCLMNSR RIALLADFLA GRFGLRPLLQ RWMAETPIAH
ATLLMGKGLF DEQHPNFVGT YSAGASSKEV RQAIEDADRV ICVGTRFVDT LTAGFTQQLP
AERTLEIQPY ASRIGETWFN LPMAQAVSTL RELCLECAFA PPPTRSAGQP VRIDKGELTQ
ESFWQTLQQY LKPGDIILVD QGTAAFGAAA LSLPDGAEVV VQPLWGSIGY SLPAAFGAQT
ACPDRRVILV IGDGAAQLTI QEMGSMLRDG QAPVILLLNN DGYTVERAIH GAAQRYNDIA
SWNWTQIPPA LNAAQQAECW RVTQAIQLAE VLERLARPQR LSFIEVMLPK ADLPELLRTV
TRALEARNGG