Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A2394 |
Symbol | |
ID | 6872439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 2263171 |
End bp | 2264037 |
Gene Length | 867 bp |
Protein Length | 288 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642785485 |
Product | propanediol utilization |
Protein accession | YP_002216143 |
Protein GI | 198243668 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG4542] Protein involved in propanediol utilization, and related proteins (includes coumermycin biosynthetic protein), possible kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 75 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCTGTTG CGCAATGCCC CGCCTCATGC GGGGAACTTA TCCAGGGATG GATTCTGGGC AGTGAGAAAC TGGTCTCCTG CCCCGTTGAC TGGTACAGCA CCGTAGCAGT CACGGCTGCG CCGCCGTTGG TAAACGAGCG TCCATTGTCG CGGGCGATGG TGGAGCGCGT TCTGGCGCAC TGGCAGTATC CTGCGCACTG GAGTAACGAG ATTCGCGTCG ATGTGCGTTC GTCAATTCCC GTTGCCAAAG GCATGGCCAG CAGCACCGCA GATATCGCCG CTACGGCGGT GGCAACGGCG CATCATCTTG GCCATTCGCT GGATGAAACC ACCCTTGCAC AGCTTTGCGT CTCAATCGAA CCCACTGATA GCACCGTTTT TCATCAGTTA ACGCTGTTTG ATCATAATAA TGCGGCCACG CAAATCGCCT GCGAGCCACC GCCGCCAATC GATTTGCTGG TACTGGAAAG CCCGGTCACA CTGCGCACGC AAGATTACCA CCGTCTCCCT CGCCAGCAGA AATTAATAGC AAGTTCACCA ACCTTGCAGC AGGCCTGGAA TCTGGTGCAG GAAGCCTGTA TAACGCAAAA TCCGCTCCGA CTGGGTGAGG CGGCTACGCT TAGCGCTATC GCCAGCCAGA CGCTGTTACC TAAGCCAGGA TTTACCGCCC TGCTGTCGCT GGTCGAAGAG TGTGATTTAT ACGGATTGAA CGTGGCACAT AGCGGTAGCG TGGTGGGTCT GATGCTGGAC CGGAAACGTC ATGACATTGC GCGCCTGAAA GGTAAGCTGG CAGAGAAAAA ACTTACCCGA CACTGGCCAA AACAACATTT ACTCAAGATG GTCACAGGCG GGGTCAAACT GCAGTGA
|
Protein sequence | MAVAQCPASC GELIQGWILG SEKLVSCPVD WYSTVAVTAA PPLVNERPLS RAMVERVLAH WQYPAHWSNE IRVDVRSSIP VAKGMASSTA DIAATAVATA HHLGHSLDET TLAQLCVSIE PTDSTVFHQL TLFDHNNAAT QIACEPPPPI DLLVLESPVT LRTQDYHRLP RQQKLIASSP TLQQAWNLVQ EACITQNPLR LGEAATLSAI ASQTLLPKPG FTALLSLVEE CDLYGLNVAH SGSVVGLMLD RKRHDIARLK GKLAEKKLTR HWPKQHLLKM VTGGVKLQ
|
| |