Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2235 |
Symbol | |
ID | 6483922 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 2141429 |
End bp | 2142295 |
Gene Length | 867 bp |
Protein Length | 288 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642737583 |
Product | propanediol utilization |
Protein accession | YP_002041325 |
Protein GI | 194445064 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG4542] Protein involved in propanediol utilization, and related proteins (includes coumermycin biosynthetic protein), possible kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 74 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCTGTTG CGCAATGCCC CGCCTCATGC GGGGAACTTA TCCAGGGATG GATTCTGGGC AGTGAGAAAC TGGTCTCCTG CCCCGTTGAC TGGTACAGCA CCGTAGCAGT CACGGCTGCG CCGCCGTTGG TAAACGAACG CCCATTGTCG CGGGCGATGG TGGAGCGCGT TCTGGCGCAC TGGCAGTATC CTGCGCACTG GAGTAATGAG ATTCGCGTCG ATGTGCGTTC GTCAATTCCC GTTGCCAAAG GCATGGCCAG CAGCACCGCA GATATTGCCG CTACGGCAGT GGCAACGGCG CATCATCTTG GCCATTCGCT GGATGAAACT ACCCTTGCAC AGCTTTGCGT CTCAATCGAA CCCACTGATA GCACCGTTTT TCATCAGTTA ACGCTGTTTG ATCATAATAA TGCGGCCACG CAAATCGCCT GCGAGCCACC GCCGCCAATC GATTTGCTGG TACTGGAAAG TCCGGTCACA CTGCGCACGC AAGATTACCA CCGTCTCCCT CGCCAGCAGA AATTAATAGC AAGTTCACCA ACCTTGCAGC AGGCCTGGAA TCTGGTGCAG GAAGCCTGTA TAACGCAAAA TCCGCTCCAA CTGGGTGAGG CGGCTACGCT TAGCGCTATC GCCAGCCAGA CGCTGTTACC TAAGCCAGGA TTTACCGCCC TGCTGTCGCT GGTCGAAGAG TGTGATTTAT ACGGATTGAA CGTGGCACAT AGCGGTAGCG TGGTGGGTCT GATGCTGGAC CGGAAACGTC ATGACATTGC GCGCCTGAAA GGTAAGCTGG CAGAGAAAAA ACTTACCCGA CACTGGCCAA AACAACATTT ACTCAAGATG GTCACAGGCG GAGTCAAACT GCAGTGA
|
Protein sequence | MAVAQCPASC GELIQGWILG SEKLVSCPVD WYSTVAVTAA PPLVNERPLS RAMVERVLAH WQYPAHWSNE IRVDVRSSIP VAKGMASSTA DIAATAVATA HHLGHSLDET TLAQLCVSIE PTDSTVFHQL TLFDHNNAAT QIACEPPPPI DLLVLESPVT LRTQDYHRLP RQQKLIASSP TLQQAWNLVQ EACITQNPLQ LGEAATLSAI ASQTLLPKPG FTALLSLVEE CDLYGLNVAH SGSVVGLMLD RKRHDIARLK GKLAEKKLTR HWPKQHLLKM VTGGVKLQ
|
| |