Gene ECH74115_4483 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4483 
Symbolpnp 
ID6970946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4153297 
End bp4155432 
Gene Length2136 bp 
Protein Length711 aa 
Translation table11 
GC content54% 
IMG OID643388198 
Productpolynucleotide phosphorylase/polyadenylase 
Protein accessionYP_002272635 
Protein GI209397723 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1185] Polyribonucleotide nucleotidyltransferase (polynucleotide phosphorylase) 
TIGRFAM ID[TIGR03591] polyribonucleotide nucleotidyltransferase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTTAATC CGATCGTTCG TAAATTCCAG TACGGCCAAC ACACCGTGAC TCTGGAAACC 
GGCATGATGG CTCGTCAGGC TACTGCCGCT GTTATGGTTA GCATGGATGA CACCGCGGTA
TTTGTTACCG TTGTTGGCCA GAAAAAAGCC AAACCAGGTC AGGACTTCTT CCCACTGACC
GTTAACTATC AGGAGCGTAC CTACGCTGCT GGTCGTATCC CGGGTAGCTT CTTCCGTCGT
GAAGGTCGTC CAAGCGAAGG CGAAACCCTG ATCGCGCGTC TGATTGACCG CCCGATTCGC
CCGCTGTTCC CGGAAGGCTT CGTCAACGAA GTTCAGGTTA TCGCCACCGT GGTTTCTGTT
AACCCGCAGG TTAACCCGGA TATCGTCGCG ATGATTGGTG CTTCCGCAGC ACTGTCTCTG
TCTGGTATTC CGTTCAATGG CCCGATTGGT GCTGCCCGCG TAGGTTACAT CAATGACCAG
TACGTACTGA ACCCGACTCA GGACGAGCTG AAAGAGAGCA AACTGGATCT GGTTGTTGCG
GGTACTGAAG CCGCTGTACT GATGGTTGAA TCTGAAGCTG AACTGCTGAG CGAAGACCAG
ATGCTGGGCG CAGTAGTGTT CGGTCATGAA CAACAGCAGG TTGTTATTCA GAACATCAAT
GAACTGGTGA AAGAAGCCGG TAAACCGCGT TGGGACTGGC AGCCGGAGCC GGTAAACGAA
GCGCTGAACG CGCGCGTTGC TGCACTGGCT GAAGCTCGTC TGAGCGATGC TTACCGCATC
ACCGACAAAC AAGAGCGTTA TGCGCAGGTT GATGTCATCA AATCTGAAAC CATCGCGACG
CTGCTTGCTG AAGACGAAAC CCTGGACGAA AACGAACTGG GTGAAATTCT GCACGCGATC
GAGAAAAACG TTGTTCGTAG CCGCGTACTG GCAGGCGAAC CGCGTATCGA TGGTCGTGAA
AAAGATATGA TCCGTGGTCT GGATGTGCGT ACTGGCGTGC TGCCGCGTAC TCACGGTTCT
GCGCTGTTCA CCCGTGGTGA AACGCAGGCG CTGGTTACCG CAACGCTGGG TACTGCTCGT
GACGCGCAGG TTCTTGATGA ACTGATGGGC GAACGTACCG ATACCTTCCT GTTCCACTAC
AACTTCCCTC CGTACTCCGT AGGCGAAACC GGCATGGTCG GTTCTCCGAA GCGTCGTGAA
ATTGGTCACG GTCGTCTGGC GAAGCGCGGC GTGCTGGCAG TAATGCCGGA TATGGACAAA
TTCCCGTACA CCGTACGTGT TGTGTCTGAA ATCACTGAAT CCAACGGTTC TTCTTCTATG
GCTTCCGTGT GCGGTGCGTC TCTGGCGCTG ATGGACGCAG GTGTGCCAAT CAAAGCTGCC
GTTGCGGGTA TCGCAATGGG TCTGGTGAAA GAAGGCGACA ACTACGTTGT ACTGTCTGAC
ATTTTGGGCG ACGAAGATCA CCTGGGCGAT ATGGACTTCA AAGTTGCGGG TTCCCGCGAC
GGTATCTCTG CACTGCAGAT GGATATCAAA ATTGAAGGTA TCACCAAAGA GATCATGCAG
GTTGCACTGA ACCAGGCTAA AGGTGCGCGT CTGCACATCC TGGGTGTAAT GGAACAGGCG
ATCAACGCGC CACGTGGCGA TATCTCTGAG TTCGCTCCGC GTATCCATAC CATCAAGATC
AACCCGGACA AGATCAAAGA CGTTATCGGT AAAGGCGGCT CTGTGATCCG TGCCCTGACC
GAAGAAACTG GCACCACCAT CGAAATCGAA GATGACGGTA CTGTGAAGAT CGCAGCGACC
GATGGCGAGA AAGCGAAACA CGCTATTCGT CGTATCGAAG AGATCACTGC AGAAATTGAA
GTGGGCCGCG TCTACACTGG TAAAGTGACC CGTATCGTTG ACTTCGGCGC ATTTGTTGCC
ATCGGCGGCG GTAAAGAAGG TCTGGTCCAC ATCTCTCAAA TCGCTGACAA ACGCGTTGAG
AAAGTGACCG ATTACCTGCA GATGGGTCAG GAAGTACCGG TGAAAGTTCT GGAAGTTGAT
CGCCAGGGCC GTATCCGTCT GAGCATTAAA GAAGCGACTG AGCAGTCTCA ACCTGCTGCA
GCACCGGAAG CTCCGGCTGC TGAACAGGGC GAGTAA
 
Protein sequence
MLNPIVRKFQ YGQHTVTLET GMMARQATAA VMVSMDDTAV FVTVVGQKKA KPGQDFFPLT 
VNYQERTYAA GRIPGSFFRR EGRPSEGETL IARLIDRPIR PLFPEGFVNE VQVIATVVSV
NPQVNPDIVA MIGASAALSL SGIPFNGPIG AARVGYINDQ YVLNPTQDEL KESKLDLVVA
GTEAAVLMVE SEAELLSEDQ MLGAVVFGHE QQQVVIQNIN ELVKEAGKPR WDWQPEPVNE
ALNARVAALA EARLSDAYRI TDKQERYAQV DVIKSETIAT LLAEDETLDE NELGEILHAI
EKNVVRSRVL AGEPRIDGRE KDMIRGLDVR TGVLPRTHGS ALFTRGETQA LVTATLGTAR
DAQVLDELMG ERTDTFLFHY NFPPYSVGET GMVGSPKRRE IGHGRLAKRG VLAVMPDMDK
FPYTVRVVSE ITESNGSSSM ASVCGASLAL MDAGVPIKAA VAGIAMGLVK EGDNYVVLSD
ILGDEDHLGD MDFKVAGSRD GISALQMDIK IEGITKEIMQ VALNQAKGAR LHILGVMEQA
INAPRGDISE FAPRIHTIKI NPDKIKDVIG KGGSVIRALT EETGTTIEIE DDGTVKIAAT
DGEKAKHAIR RIEEITAEIE VGRVYTGKVT RIVDFGAFVA IGGGKEGLVH ISQIADKRVE
KVTDYLQMGQ EVPVKVLEVD RQGRIRLSIK EATEQSQPAA APEAPAAEQG E