Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C2663 |
Symbol | |
ID | 6490464 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | - |
Start bp | 2578191 |
End bp | 2579843 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642742841 |
Product | indole-3-pyruvate decarboxylase |
Protein accession | YP_002046474 |
Protein GI | 194448111 |
COG category | [G] Carbohydrate transport and metabolism [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG3961] Pyruvate decarboxylase and related thiamine pyrophosphate-requiring enzymes |
TIGRFAM ID | [TIGR03393] indolepyruvate decarboxylase, Erwinia family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 0.603651 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAACC CCTATACCGT GGCCGACTAT TTGCTGGACA GACTGGCAGG ATGCGGCATT GGCCATCTTT TTGGCGTACC GGGCGATTAT AACTTGCAGT TTCTTGACCA TGTGATTGAC CACCCGACCC TACGTTGGGT GGGATGCGCC AATGAGCTGA ACGCCGCTTA TGCCGCGGAC GGCTATGCGC GCATGTCGGG CGCTGGAGCG CTACTCACTA CCTTTGGCGT GGGAGAACTT AGCGCTATTA ACGGTATCGC GGGCAGTTAC GCGGAATATG TCCCGGTCTT GCATATCGTC GGCGCGCCCT GTAGCGCTGC GCAGCAGCGA GGCGAATTGA TGCACCACAC CCTCGGCGAC GGCGATTTTC GTCATTTTTA TCGCATGAGT CAGGCGATAT CCGCTGCCAG CGCAATATTG GATGAACAGA ACGCCTGTTT CGAAATTGAC CGCGTATTGG GTGAAATGCT TGCCGCACGC AGGCCAGGAT ACATCATGTT GCCCGCCGAT GTGGCGAAAA AAACGGCCAT CCCGCCTACG CAGGCGCTGG CGTTGCCCGT GCATGAAGCG CAAAGCGGCG TGGAGACGGC TTTTCGTTAC CACGCCCGTC AGTGCCTGAT GAACAGTCGG CGCATTGCGC TATTGGCCGA CTTTCTTGCC GGGCGTTTTG GTTTACGACC ACTGTTGCAA CGCTGGATGG CGGAAACGCC CATCGCTCAT GCGACACTAC TGATGGGGAA GGGGCTTTTT GATGAACAGC ACCCGAACTT CGTTGGCACC TATAGCGCTG GCGCCAGCAG CAAAGAAGTG CGTCAGGCCA TAGAGGACGC CGATAGGGTT ATCTGCGTCG GCACCCGTTT TGTCGATACC CTTACGGCCG GATTCACCCA ACAATTACCG ACGGAACGCA CGCTGGAGAT TCAGCCTTAC GCGTCGCGCA TCGGCGAAAC CTGGTTCAAC CTCCCGATGG CGCAGGCGGT GTCTACGCTG CGCGAACTGT GCCTGGAATG CGCTTTTGCG CCGCCGCCGA CGCGTTCCGC CGGACAGCCA GTGCGGATTG ATAAAGGAGA ACTGACCCAG GAAAGCTTCT GGCAAACCTT ACAGCAGTAT CTCAAACCCG GCGATATTAT CCTTGTCGAC CAGGGGACTG CAGCTTTTGG CGCTGCCGCG CTGTCGCTTC CTGACGGCGC GGAAGTTGTG GTACAGCCGC TGTGGGGGTC TATCGGCTAT TCCTTGCCCG CCGCGTTTGG CGCGCAAACC GCCTGTCCCG ATCGGCGGGT GATTCTGATT ATCGGCGATG GCGCGGCGCA GCTCACGATT CAGGAGATGG GCTCGATGTT ACGCGACGGG CAGGCGCCGG TCATCCTGCT GCTCAACAAT GACGGCTATA CCGTAGAGCG CGCCATTCAC GGCGCGGCCC AGCGGTATAA CGACATCGCG AGCTGGAACT GGACGCAGAT ACCACCGGCG CTAAACGCGG CGCAACAGGC GGAGTGCTGG CGGGTGACGC AGGCTATCCA ACTGGCAGAG GTCCTCGAAC GTCTGGCGCG CCCACAACGT CTGTCATTTA TTGAAGTGAT GTTGCCAAAA GCCGATCTGC CGGAATTACT GCGTACCGTG ACCCGGGCGC TGGAAGCCCG CAACGGGGGA TAA
|
Protein sequence | MQNPYTVADY LLDRLAGCGI GHLFGVPGDY NLQFLDHVID HPTLRWVGCA NELNAAYAAD GYARMSGAGA LLTTFGVGEL SAINGIAGSY AEYVPVLHIV GAPCSAAQQR GELMHHTLGD GDFRHFYRMS QAISAASAIL DEQNACFEID RVLGEMLAAR RPGYIMLPAD VAKKTAIPPT QALALPVHEA QSGVETAFRY HARQCLMNSR RIALLADFLA GRFGLRPLLQ RWMAETPIAH ATLLMGKGLF DEQHPNFVGT YSAGASSKEV RQAIEDADRV ICVGTRFVDT LTAGFTQQLP TERTLEIQPY ASRIGETWFN LPMAQAVSTL RELCLECAFA PPPTRSAGQP VRIDKGELTQ ESFWQTLQQY LKPGDIILVD QGTAAFGAAA LSLPDGAEVV VQPLWGSIGY SLPAAFGAQT ACPDRRVILI IGDGAAQLTI QEMGSMLRDG QAPVILLLNN DGYTVERAIH GAAQRYNDIA SWNWTQIPPA LNAAQQAECW RVTQAIQLAE VLERLARPQR LSFIEVMLPK ADLPELLRTV TRALEARNGG
|
| |