Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2447 |
Symbol | pta1 |
ID | 5593264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 2456318 |
End bp | 2458462 |
Gene Length | 2145 bp |
Protein Length | 714 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640921569 |
Product | phosphate acetyltransferase |
Protein accession | YP_001459103 |
Protein GI | 157161785 |
COG category | [C] Energy production and conversion |
COG ID | [COG0280] Phosphotransacetylase |
TIGRFAM ID | [TIGR00651] phosphate acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0000498187 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCCCGTA TTATTATGCT GATCCCTACC GGAACCAGCG TCGGTCTGAC CAGCGTCAGC CTTGGCGTGA TCCGTGCAAT GGAACGCAAA GGCGTTCGTC TGAGCGTTTT CAAACCTATC GCTCAGCCGC GTACCGGTGG CGATGCGCCC GATCAGACTA CGACTATCGT GCGTGCGAAC TCTTCCACCA CGACGGCCGC TGAACCGCTG AAAATGAGCT ACGTTGAAGG TCTGCTTTCC AGCAATCAGA AAGATGTGCT GATGGAAGAG ATCGTCGCAA ACTACCACGC TAACACCAAA GACGCTGAAG TCGTTCTGGT TGAAGGTCTG GTCCCGACAC GTAAGCACCA GTTTGCCCAG TCTCTGAACT ACGAAATCGC TAAAACGCTG AATGCGGAAA TCGTCTTCGT TATGTCTCAG GGCACTGACA CCCCGGAACA GCTGAAAGAG CGTATCGAAC TGACCCGCAA CAGCTTCGGC GGTGCCAAAA ACACCAACAT CACCGGCGTT ATCGTTAACA AACTGAACGC ACCGGTTGAT GAACAGGGTC GTACTCGCCC GGATCTGTCC GAGATTTTCG ACGACTCTTC CAAAGCTAAA GTAAACAATG TTGATCCGGC GAAGCTGCAA GAATCCAGCC CGCTGCCGGT TCTCGGCGCT GTGCCGTGGA GCTTTGACCT GATCGCGACT CGTGCGATCG ATATGGCTCG CCACCTGAAT GCGACCATCA TCAACGAAGG CGACATCAAT ACTCGCCGCG TTAAATCCGT CACTTTCTGC GCACGCAGCA TTCCGCACAT GCTGGAGCAC TTCCGTGCCG GTTCTCTGCT GGTGACTTCC GCAGACCGTC CTGACGTGCT GGTGGCCGCT TGCCTGGCAG CCATGAACGG CGTAGAAATC GGTGCCCTGC TGCTGACTGG CGGTTACGAA ATGGACGCGC GCATTTCTAA ACTGTGCGAA CGTGCTTTCG CTACCGGCCT GCCGGTATTT ATGGTGAACA CCAACACCTG GCAGACCTCT CTGAGCCTGC AGAGCTTCAA CCTGGAAGTT CCGGTTGACG ATCACGAACG TATCGAGAAA GTTCAGGAAT ACGTTGCTAA CTACATCAAC GCTGACTGGA TCGAATCTCT GACTGCCACT TCTGAGCGCA GCCGTCGTCT GTCTCCGCCT GCGTTCCGTT ATCAGCTGAC TGAACTTGCG CGCAAAGCGG GCAAACGTAT CGTACTGCCG GAAGGTGACG AACCGCGTAC CGTTAAAGCA GCCGCTATCT GTGCTGAACG TGGTATCGCA ACTTGCGTAC TGCTGGGTAA TCCGGCAGAG ATCAACCGTG TTGCAGCGTC TCAGGGTGTA GAACTGGGTG CAGGGATTGA AATCGTTGAT CCAGAAGTGG TTCGCGAAAG CTATGTTGGT CGTCTGGTCG AACTGCGTAA GAACAAAGGC ATGACCGAAA CCGTTGCCCG CGAACAGCTG GAAGACAACG TGGTGCTCGG TACGCTGATG CTGGAACAGG ATGAAGTTGA TGGTCTGGTT TCCGGTGCTG TTCACACTAC CGCAAACACC ATCCGTCCGC CGCTGCAGCT GATCAAAACT GCACCGGGCA GCTCCCTGGT ATCTTCCGTG TTCTTCATGC TGCTGCCGGA ACAGGTTTAC GTTTACGGTG ACTGTGCGAT CAACCCGGAT CCGACCGCTG AACAGCTGGC AGAAATCGCG ATTCAGTCCG CTGATTCCGC TGCGGCCTTC GGTATCGAAC CGCGCGTTGC TATGCTCTCC TACTCCACCG GTACTTCTGG TGCAGGTAGC GACGTAGAAA AAGTTCGCGA AGCAACTCGT CTGGCGCAGG AAAAACGTCC TGACCTGATG ATCGACGGTC CGCTGCAGTA CGACGCTGCG GTAATGGCTG ACGTTGCGAA ATCCAAAGCG CCGAACTCTC CGGTTGCAGG TCGCGCTACC GTGTTCATCT TCCCGGATCT GAACACCGGT AACACCACCT ACAAAGCGGT ACAGCGTTCT GCCGACCTGA TCTCCATCGG GCCGATGCTG CAGGGTATGC GCAAGCCGGT TAACGACCTG TCCCGTGGCG CACTGGTTGA CGATATCGTC TACACCATCG CGCTGACTGC GATTCAGTCT GCACAGCAGC AGTAA
|
Protein sequence | MSRIIMLIPT GTSVGLTSVS LGVIRAMERK GVRLSVFKPI AQPRTGGDAP DQTTTIVRAN SSTTTAAEPL KMSYVEGLLS SNQKDVLMEE IVANYHANTK DAEVVLVEGL VPTRKHQFAQ SLNYEIAKTL NAEIVFVMSQ GTDTPEQLKE RIELTRNSFG GAKNTNITGV IVNKLNAPVD EQGRTRPDLS EIFDDSSKAK VNNVDPAKLQ ESSPLPVLGA VPWSFDLIAT RAIDMARHLN ATIINEGDIN TRRVKSVTFC ARSIPHMLEH FRAGSLLVTS ADRPDVLVAA CLAAMNGVEI GALLLTGGYE MDARISKLCE RAFATGLPVF MVNTNTWQTS LSLQSFNLEV PVDDHERIEK VQEYVANYIN ADWIESLTAT SERSRRLSPP AFRYQLTELA RKAGKRIVLP EGDEPRTVKA AAICAERGIA TCVLLGNPAE INRVAASQGV ELGAGIEIVD PEVVRESYVG RLVELRKNKG MTETVAREQL EDNVVLGTLM LEQDEVDGLV SGAVHTTANT IRPPLQLIKT APGSSLVSSV FFMLLPEQVY VYGDCAINPD PTAEQLAEIA IQSADSAAAF GIEPRVAMLS YSTGTSGAGS DVEKVREATR LAQEKRPDLM IDGPLQYDAA VMADVAKSKA PNSPVAGRAT VFIFPDLNTG NTTYKAVQRS ADLISIGPML QGMRKPVNDL SRGALVDDIV YTIALTAIQS AQQQ
|
| |