Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2597 |
Symbol | |
ID | 6483044 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 2519321 |
End bp | 2520973 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642737930 |
Product | indole-3-pyruvate decarboxylase |
Protein accession | YP_002041670 |
Protein GI | 194445213 |
COG category | [G] Carbohydrate transport and metabolism [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG3961] Pyruvate decarboxylase and related thiamine pyrophosphate-requiring enzymes |
TIGRFAM ID | [TIGR03393] indolepyruvate decarboxylase, Erwinia family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.380997 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 82 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAACC CCTATACCGT GGCCGACTAT TTGCTGGACA GACTGGCAGG ATGCGGCATT GGCCATCTTT TTGGCGTACC GGGCGATTAT AACTTGCAGT TTCTTGACCA TGTGATTGAC CACCCGACCC TGCGTTGGGT GGGATGCGCC AATGAGCTGA ACGCCGCTTA TGCCGCGGAC GGCTATGCGC GCATGTCGGG CGCTGGAGCG CTACTCACTA CCTTTGGCGT GGGAGAACTT AGCGCTATTA ACGGTATCGC GGGCAGTTAC GCGGAATATG TCCCGGTCTT GCATATCGTC GGCGCGCCCT GTAGCGCTGC GCAGCAGCGA GGCGAATTGA TGCACCACAC CCTCGGTGAC GGCGATTTTC GTCATTTTTA TCGCATGAGC CAGGCGATAT CCGCTGCCAG CGCAATATTA GATGAACAGA ACGCCTGTTT CGAGATTGAC CGCGTGTTGG GTGAAATGCT TGCCGCACGC AGGCCAGGAT ACATCATGTT GCCCGCCGAT GTGGCGAAAA AAACGGCCAT CCCGCCTACG CAGGCGCTGG CGTTGCCCGT GCATGAAGCG CAAAGCGGCG TGGAGACGGC TTTTCGTTAC CACGCCCGTC AGTGCCTGAT GAACAGTCGG CGCATTGCGC TATTGGCCGA CTTTCTTGCC GGGCGTTTTG GTTTACGACC ACTGTTGCAA CGCTGGATGG CGGAAACGCC CATCGCTCAT GCCACACTAC TGATGGGGAA GGGGCTTTTT GATGAACAGC ACCCGAACTT CGTTGGCACC TATAGCGCAG GCGCCAGCAG CAAAGAAGTG CGTCAGGCCA TAGAGGACGC CGATAGGGTT ATCTGCGTCG GCACCCGTTT TGTCGATACC CTTACGGCCG GATTCACCCA ACAATTACCG GCGGAACGCA CGCTGGAGAT TCAGCCTTAC GCGTCGCGCA TCGGCGAAAC CTGGTTCAAC CTCCCGATGG CGCAGGCGGT GTCTACGCTG CGCGAACTGT GCCTGGAATG CGCTTTTGCG CCGCCGCCGA CGCGTTCCGC CGGACAGCCA GTGCGGATTG ATAAGGGAGA ACTGACCCAG GAAAGCTTCT GGCAAACCTT ACAGCAGTAT CTCAAACCCG GAGATATTAT CCTTGTCGAC CAGGGGACTG CAGCTTTTGG CGCTGCCGCG CTGTCGCTTC CTGACGGCGC GGAAGTTGTG GTACAGCCGC TGTGGGGATC TATCGGCTAT TCCTTGCCCG CCGCGTTTGG CGCGCAAACC GCCTGCCCCG ATCGGCGGGT GATTCTGATT ATTGGCGATG GCGCGGCGCA GCTCACGATT CAGGAGATGG GCTCGATGTT ACGCGACGGG CAGGCGCCGG TCATCCTGCT GCTCAACAAT GACGGCTATA CCGTAGAGCG CGCCATTCAC GGCGCGGCCC AGCGGTACAA CGACATCGCG AGCTGGAACT GGACGCAGAT ACCACCGGCG CTAAACGCGG CGCAACAGGC GGAGTGCTGG CGGGTGACGC AGGCTATCCA ACTGGCAGAG GTCCTCGAAC GTCTGGCGCG CCCACAACGT CTGTCATTTA TTGAAGTGAT GTTGCCAAAA GCCGATCTGC CGGAATTACT GCGTACCGTG ACCCGGGCGC TGGAAGCCCG CAACGGGGGA TAA
|
Protein sequence | MQNPYTVADY LLDRLAGCGI GHLFGVPGDY NLQFLDHVID HPTLRWVGCA NELNAAYAAD GYARMSGAGA LLTTFGVGEL SAINGIAGSY AEYVPVLHIV GAPCSAAQQR GELMHHTLGD GDFRHFYRMS QAISAASAIL DEQNACFEID RVLGEMLAAR RPGYIMLPAD VAKKTAIPPT QALALPVHEA QSGVETAFRY HARQCLMNSR RIALLADFLA GRFGLRPLLQ RWMAETPIAH ATLLMGKGLF DEQHPNFVGT YSAGASSKEV RQAIEDADRV ICVGTRFVDT LTAGFTQQLP AERTLEIQPY ASRIGETWFN LPMAQAVSTL RELCLECAFA PPPTRSAGQP VRIDKGELTQ ESFWQTLQQY LKPGDIILVD QGTAAFGAAA LSLPDGAEVV VQPLWGSIGY SLPAAFGAQT ACPDRRVILI IGDGAAQLTI QEMGSMLRDG QAPVILLLNN DGYTVERAIH GAAQRYNDIA SWNWTQIPPA LNAAQQAECW RVTQAIQLAE VLERLARPQR LSFIEVMLPK ADLPELLRTV TRALEARNGG
|
| |