Gene SeHA_C2663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C2663 
Symbol 
ID6490464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp2578191 
End bp2579843 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content59% 
IMG OID642742841 
Productindole-3-pyruvate decarboxylase 
Protein accessionYP_002046474 
Protein GI194448111 
COG category[G] Carbohydrate transport and metabolism
[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG3961] Pyruvate decarboxylase and related thiamine pyrophosphate-requiring enzymes 
TIGRFAM ID[TIGR03393] indolepyruvate decarboxylase, Erwinia family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value0.603651 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAACC CCTATACCGT GGCCGACTAT TTGCTGGACA GACTGGCAGG ATGCGGCATT 
GGCCATCTTT TTGGCGTACC GGGCGATTAT AACTTGCAGT TTCTTGACCA TGTGATTGAC
CACCCGACCC TACGTTGGGT GGGATGCGCC AATGAGCTGA ACGCCGCTTA TGCCGCGGAC
GGCTATGCGC GCATGTCGGG CGCTGGAGCG CTACTCACTA CCTTTGGCGT GGGAGAACTT
AGCGCTATTA ACGGTATCGC GGGCAGTTAC GCGGAATATG TCCCGGTCTT GCATATCGTC
GGCGCGCCCT GTAGCGCTGC GCAGCAGCGA GGCGAATTGA TGCACCACAC CCTCGGCGAC
GGCGATTTTC GTCATTTTTA TCGCATGAGT CAGGCGATAT CCGCTGCCAG CGCAATATTG
GATGAACAGA ACGCCTGTTT CGAAATTGAC CGCGTATTGG GTGAAATGCT TGCCGCACGC
AGGCCAGGAT ACATCATGTT GCCCGCCGAT GTGGCGAAAA AAACGGCCAT CCCGCCTACG
CAGGCGCTGG CGTTGCCCGT GCATGAAGCG CAAAGCGGCG TGGAGACGGC TTTTCGTTAC
CACGCCCGTC AGTGCCTGAT GAACAGTCGG CGCATTGCGC TATTGGCCGA CTTTCTTGCC
GGGCGTTTTG GTTTACGACC ACTGTTGCAA CGCTGGATGG CGGAAACGCC CATCGCTCAT
GCGACACTAC TGATGGGGAA GGGGCTTTTT GATGAACAGC ACCCGAACTT CGTTGGCACC
TATAGCGCTG GCGCCAGCAG CAAAGAAGTG CGTCAGGCCA TAGAGGACGC CGATAGGGTT
ATCTGCGTCG GCACCCGTTT TGTCGATACC CTTACGGCCG GATTCACCCA ACAATTACCG
ACGGAACGCA CGCTGGAGAT TCAGCCTTAC GCGTCGCGCA TCGGCGAAAC CTGGTTCAAC
CTCCCGATGG CGCAGGCGGT GTCTACGCTG CGCGAACTGT GCCTGGAATG CGCTTTTGCG
CCGCCGCCGA CGCGTTCCGC CGGACAGCCA GTGCGGATTG ATAAAGGAGA ACTGACCCAG
GAAAGCTTCT GGCAAACCTT ACAGCAGTAT CTCAAACCCG GCGATATTAT CCTTGTCGAC
CAGGGGACTG CAGCTTTTGG CGCTGCCGCG CTGTCGCTTC CTGACGGCGC GGAAGTTGTG
GTACAGCCGC TGTGGGGGTC TATCGGCTAT TCCTTGCCCG CCGCGTTTGG CGCGCAAACC
GCCTGTCCCG ATCGGCGGGT GATTCTGATT ATCGGCGATG GCGCGGCGCA GCTCACGATT
CAGGAGATGG GCTCGATGTT ACGCGACGGG CAGGCGCCGG TCATCCTGCT GCTCAACAAT
GACGGCTATA CCGTAGAGCG CGCCATTCAC GGCGCGGCCC AGCGGTATAA CGACATCGCG
AGCTGGAACT GGACGCAGAT ACCACCGGCG CTAAACGCGG CGCAACAGGC GGAGTGCTGG
CGGGTGACGC AGGCTATCCA ACTGGCAGAG GTCCTCGAAC GTCTGGCGCG CCCACAACGT
CTGTCATTTA TTGAAGTGAT GTTGCCAAAA GCCGATCTGC CGGAATTACT GCGTACCGTG
ACCCGGGCGC TGGAAGCCCG CAACGGGGGA TAA
 
Protein sequence
MQNPYTVADY LLDRLAGCGI GHLFGVPGDY NLQFLDHVID HPTLRWVGCA NELNAAYAAD 
GYARMSGAGA LLTTFGVGEL SAINGIAGSY AEYVPVLHIV GAPCSAAQQR GELMHHTLGD
GDFRHFYRMS QAISAASAIL DEQNACFEID RVLGEMLAAR RPGYIMLPAD VAKKTAIPPT
QALALPVHEA QSGVETAFRY HARQCLMNSR RIALLADFLA GRFGLRPLLQ RWMAETPIAH
ATLLMGKGLF DEQHPNFVGT YSAGASSKEV RQAIEDADRV ICVGTRFVDT LTAGFTQQLP
TERTLEIQPY ASRIGETWFN LPMAQAVSTL RELCLECAFA PPPTRSAGQP VRIDKGELTQ
ESFWQTLQQY LKPGDIILVD QGTAAFGAAA LSLPDGAEVV VQPLWGSIGY SLPAAFGAQT
ACPDRRVILI IGDGAAQLTI QEMGSMLRDG QAPVILLLNN DGYTVERAIH GAAQRYNDIA
SWNWTQIPPA LNAAQQAECW RVTQAIQLAE VLERLARPQR LSFIEVMLPK ADLPELLRTV
TRALEARNGG