Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C2281 |
Symbol | |
ID | 6491136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 2180369 |
End bp | 2181235 |
Gene Length | 867 bp |
Protein Length | 288 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642742471 |
Product | propanediol utilization protein |
Protein accession | YP_002046106 |
Protein GI | 194447350 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG4542] Protein involved in propanediol utilization, and related proteins (includes coumermycin biosynthetic protein), possible kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 0.0102974 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCTGTTG CGCAATGCCC CGCCTCATGC GGGGAACTTA TCCAGGGATG GATTCTGGGC AGTGAGAAAC TGGTCTCCTG CCCCGTTGAC TGGTACAGCA CCGTAGCAGT CACGGCTGCG CCGCCGTTGA TAAACGAGCG CCCATTGTCG CGGGCGATGG TGGAGCGCGT TCTGGCGCAC TGGCAGTATC CTGCGCACTG GAGTAACGAG ATTCGCGTCG ATGTGCGTTC GTCAATTCCC GTTGCCAAAG GCATGGCCAG CAGCACCGCA GATATTGCCG CTACGGCGGT GGCAACGGCG CATCATCTTG GCCATTCGCT GGATGAAACC ACCCTTGCAC AGCTTTGCGT CTCAATCGAA CCCACTGATA GCACCGTTTT TCATCAGTTA ACGCTGTTTG ATCATAATAA TGCGGCCACG CAAATCGCCT GCGAGCCACC GCCGCCAATC GATTTGCTGG TACTGGAAAG CCCGGTCACA CTGCGCACGC AAGATTACCA CCGTCTCCCT CGCCAGCAGA AATTAATAGC AAGTTCAGCA ACCTTGCAGC AGGCCTGGAA TCTGGTGCAG GAAGCCTGTA TAACGCAAAA TCCGCTCCGG CTGGGTGAGG CGGCTACGCT TAGCGCTATC GCCAGCCAGA CGCTGTTACC TAAGCCGGGC TTTACCGCCC TGCTGTCGCT GGTCGAAGAG TGTGATTTAT ACGGATTGAA CGTGGCGCAT AGCGGTAGCG TGGTGGGTCT GATGCTGGAC CGGAAACGTC ATGATATTGC GCGTCTGAAA GGTAAGCTGG CAGAGAAAAA ACTTACCCGA CACTGGCCAA AACAACATTT ACTCAAGATG GTCACTGGCG GGGTCAAACT GCAGTGA
|
Protein sequence | MAVAQCPASC GELIQGWILG SEKLVSCPVD WYSTVAVTAA PPLINERPLS RAMVERVLAH WQYPAHWSNE IRVDVRSSIP VAKGMASSTA DIAATAVATA HHLGHSLDET TLAQLCVSIE PTDSTVFHQL TLFDHNNAAT QIACEPPPPI DLLVLESPVT LRTQDYHRLP RQQKLIASSA TLQQAWNLVQ EACITQNPLR LGEAATLSAI ASQTLLPKPG FTALLSLVEE CDLYGLNVAH SGSVVGLMLD RKRHDIARLK GKLAEKKLTR HWPKQHLLKM VTGGVKLQ
|
| |