Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B1957 |
Symbol | pepT |
ID | 6792921 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | - |
Start bp | 1899496 |
End bp | 1900725 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642776183 |
Product | peptidase T |
Protein accession | YP_002146814 |
Protein GI | 197250100 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2195] Di- and tripeptidases |
TIGRFAM ID | [TIGR01882] peptidase T |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.355866 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAAAC TACTTGAGCG TTTTTTACAC TACGTATCGC TGGATACCCA ATCAAAGTCG GGTGTTCGGC AGGTTCCCAG CACTGAGGGG CAGTGGAAGT TACTACGTTT GCTCAAACAG CAGCTCGAAG AGATGGGGCT GGTTAACATT ACATTAAGTG AAAAAGGGAC GTTGATGGCG ACGCTCCCGG CCAATGTTGA GGGGGATATT CCCGCCATTG GTTTTATCTC CCATGTGGAT ACCTCTCCGG ATTTCAGCGG TAAAAACGTT AACCCGCAGA TTGTCGAGAA TTATCGCGGC GGCGATATAG CATTAGGGAT TGGCGATGAG GTGTTGTCAC CCGTGATGTT CCCGGTACTG CATCAATTAC TGGGACAGAC GCTGATTACT ACCGATGGTA AGACATTGCT GGGCGCGGAC GATAAAGCCG GTGTTGCGGA GATCATGACC GCGCTGGCGG TGCTGAAAGG TAATCCTATT CCCCACGGCG ACATTAAAGT GGCGTTTACG CCTGACGAAG AGGTAGGGAA AGGCGCGAAG CACTTCGATG TCGAGGAATT TGGCGCGCAG TGGGCCTATA CGGTCGACGG CGGCGGCGTG GGCGAACTGG AGTTTGAAAA CTTCAATGCC GCCTCGGTGA ATATCAAAAT CGTCGGCAAT AACGTGCATC CTGGTACGGC GAAAGGTGTG ATGGTCAATG CGCTGTCGTT GGCGGCGAGG ATTCACGCGG AAGTGCCGGC GGATGAAGCG CCTGAAACCA CTGAAGGTTA CGAAGGGTTT TATCATCTGG CCAGCATGAA AGGCACCGTT GACCGGGCCG AAATGCACTA CATCATTCGC GATTTCGACC GTAAGCAGTT TGAAGCGCGT AAACGCAAAA TGATGGAGAT TGCCAAAAAA GTCGGTAAGG GGCTGCATCC GGACTGCTAT ATCGAACTGG TGATTGAAGA CAGTTATTAC AATATGCGCG AAAAAGTGGT TGAGCATCCG CATATTCTCG ATATCGCCCA GCAGGCCATG CGCGACTGTC ATATTACGCC GGAGATGAAA CCGATTCGCG GCGGTACAGA CGGGGCGCAA CTGTCGTTTA TGGGCCTGCC GTGTCCTAAT CTCTTTACCG GCGGATATAA CTATCATGGT AAACATGAGT TTGTGACGCT GGAGGGGATG GAAAAAGCGG TACAGGTGAT TGTACGTATC GCGGAGCTGA CGGCGAAGCG CGGTCAGTAG
|
Protein sequence | MDKLLERFLH YVSLDTQSKS GVRQVPSTEG QWKLLRLLKQ QLEEMGLVNI TLSEKGTLMA TLPANVEGDI PAIGFISHVD TSPDFSGKNV NPQIVENYRG GDIALGIGDE VLSPVMFPVL HQLLGQTLIT TDGKTLLGAD DKAGVAEIMT ALAVLKGNPI PHGDIKVAFT PDEEVGKGAK HFDVEEFGAQ WAYTVDGGGV GELEFENFNA ASVNIKIVGN NVHPGTAKGV MVNALSLAAR IHAEVPADEA PETTEGYEGF YHLASMKGTV DRAEMHYIIR DFDRKQFEAR KRKMMEIAKK VGKGLHPDCY IELVIEDSYY NMREKVVEHP HILDIAQQAM RDCHITPEMK PIRGGTDGAQ LSFMGLPCPN LFTGGYNYHG KHEFVTLEGM EKAVQVIVRI AELTAKRGQ
|
| |