Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B2181 |
Symbol | |
ID | 6794743 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 2090559 |
End bp | 2091461 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642776390 |
Product | propanediol utilization |
Protein accession | YP_002147015 |
Protein GI | 197248419 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG4542] Protein involved in propanediol utilization, and related proteins (includes coumermycin biosynthetic protein), possible kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGCAC ACTATTCGTA CCTGAAAGGT GATAACGTGG CTGTTGCGCA ATGCCCCGCC TCATGCGGAG AACTTATCCA GGGATGGATT CTGGGCAGTG AGAAACTGGT CTCCTGCCCC GTTGACTGGT ACAGCACCGT AGCAGTCACG GCTGCGCCGC CGTTGGTAAA CGAGCGCCCA TTGTCGCGGG CGATGGTGGA GCGCGTTCTG GCGCACTGGC AGTATCCTGC GCACTGGAGT AACGAGATTC GCGTCGATGT GCGTTCGTCA ATTCCCGTTG CCAAAGGCAT GGCCAGCAGC ACCGCAGATA TCGCCGCTAC GGCGGTGGCA ACGGCGCATC ATCTTGGCCA TTCGCTGGAT GAAACCACCC TTGCACAGCT TTGCGTCTCA ATCGAACCCA CCGATAGCAC CGTTTTTCAT CAGTTAACGC TGTTTGATCA TAATAATGCG GTCACGCAAA TCGCCTGCGA GCCACCGCCG CCAATCGATT TGCTGGTACT GGAAAGTCCG GTCACACTGC GCACGCAAGA TTATCACCGT CTCCCTCGCC AGCAGAAATT AATAGCAAGT TCAGCAACCT TGCAGCAGGC CTGGAATCTG GTGCAGGAAG CCTGTATAAC GCAAAATCCG CTCCGACTGG GTGAGGCGGC TACGCTTAGC GCTATCGCCA GCCAGACGCT GTTACCTAAG CCAGGATTTA CCGCCCTGCT GTCGCTGGTC GAAGAGTGTG ATTTATACGG ATTGAACGTG GCGCATAGCG GTAGCGTGGT GGGTCTGATG CTGGACCGGA AACGTCATGA TATTGCGCGT CTGAAAGGTA AGCTGGCAGA GAAAAAACTT ACCCGACACT GGCCAAAACA ACATTTACTC AAGATGGTCA CAGGCGGGGT CAAACTGCAG TGA
|
Protein sequence | MRAHYSYLKG DNVAVAQCPA SCGELIQGWI LGSEKLVSCP VDWYSTVAVT AAPPLVNERP LSRAMVERVL AHWQYPAHWS NEIRVDVRSS IPVAKGMASS TADIAATAVA TAHHLGHSLD ETTLAQLCVS IEPTDSTVFH QLTLFDHNNA VTQIACEPPP PIDLLVLESP VTLRTQDYHR LPRQQKLIAS SATLQQAWNL VQEACITQNP LRLGEAATLS AIASQTLLPK PGFTALLSLV EECDLYGLNV AHSGSVVGLM LDRKRHDIAR LKGKLAEKKL TRHWPKQHLL KMVTGGVKLQ
|
| |