Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A2228 |
Symbol | |
ID | 6516593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | + |
Start bp | 2112439 |
End bp | 2113341 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642747298 |
Product | propanediol utilization |
Protein accession | YP_002115091 |
Protein GI | 194734860 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG4542] Protein involved in propanediol utilization, and related proteins (includes coumermycin biosynthetic protein), possible kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.484991 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.20213 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGCAC ACTATTTGTA CCTGAAAGGT GATAACGTGG CTGTTGCGCA ATGCCCCGCC TCATGCGGGG AACTTATCCA GGGATGGATT CTGGGCAGTG AGAAACTGGT CTCCTGCCCT GTTGACTGGT ACAGCACCGT AGCAGTCACG GCTGCGCCGC CGCTGGTAAA CGAGCGTCCA TTGTCGCGGG CGATGGTGGA GCGCGTTTTG GCGCACTGGC AGTATCCTGC GCACTGGAGT AACGAGATTC GCGTCGATGT GCGTTCGTCA ATTCCCGTTG CCAAAGGCAT GGCCAGCAGT ACCGCAGATA TCGCCGCTAC GGCGGTGGCA ACGGCGCATC ATCTTGGCCA TTCGCTGGAT GAAACCACCC TTGCACAGCT TTGCGTCTCA ATCGAACCCA CTGATAGCAC CGTTTTTCAT CAGTTAACGC TGTTTGATCA TAATAATGCA GCCACGCAAA TCGCCTGCGA GCCACCGCCG CCAATCGATT TGCTGGTACT GGAAAGCCCG GTCACACTGC GCACGCAAGA TTACCACCGT CTCCCTCGCC AGCAGAAATT AATAGCAAGT TCAGCAACCT TGCAGCAGGC CTGGAATCTG GTGCAGGAAG CCTGTATAAC GCAAAATCCG CTCCGACTGG GTGAGGCGGC TACGCTTAGC GCAATCGCCA GCCAGACGCT GTTACCTAAG CCGGGCTTTA CCGCCCTGCT GTCGCTGGTC GAAGAGTGTG ATTTATACGG ATTGAACGTG GCGCATAGCG GTAGCGTGGT GGGTCTGATG CTGGACCGGA AACGTCATGA TATTGCGCGT CTGAAAGGTA AGCTGGCAGA GAAAAAACTT ACCCGACACT GGCCAAAACA ACATTTACTC AAGATGGTCA CAGGCGGGGT CAAACTGCAG TGA
|
Protein sequence | MRAHYLYLKG DNVAVAQCPA SCGELIQGWI LGSEKLVSCP VDWYSTVAVT AAPPLVNERP LSRAMVERVL AHWQYPAHWS NEIRVDVRSS IPVAKGMASS TADIAATAVA TAHHLGHSLD ETTLAQLCVS IEPTDSTVFH QLTLFDHNNA ATQIACEPPP PIDLLVLESP VTLRTQDYHR LPRQQKLIAS SATLQQAWNL VQEACITQNP LRLGEAATLS AIASQTLLPK PGFTALLSLV EECDLYGLNV AHSGSVVGLM LDRKRHDIAR LKGKLAEKKL TRHWPKQHLL KMVTGGVKLQ
|
| |