Gene EcSMS35_3460 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3460 
Symbolpnp 
ID6146841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3535789 
End bp3537924 
Gene Length2136 bp 
Protein Length711 aa 
Translation table11 
GC content54% 
IMG OID641618289 
Productpolynucleotide phosphorylase/polyadenylase 
Protein accessionYP_001745438 
Protein GI170682265 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1185] Polyribonucleotide nucleotidyltransferase (polynucleotide phosphorylase) 
TIGRFAM ID[TIGR03591] polyribonucleotide nucleotidyltransferase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.234004 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTTAATC CGATCGTTCG TAAATTCCAG TACGGCCAAC ACACCGTGAC TCTGGAAACC 
GGCATGATGG CTCGTCAGGC TACTGCCGCT GTTATGGTTA GCATGGATGA CACCGCGGTA
TTCGTTACCG TTGTTGGCCA GAAAAAAGCC AAACCAGGTC AGGACTTCTT CCCACTGACC
GTTAACTATC AGGAGCGTAC CTACGCTGCT GGTCGTATCC CGGGTAGCTT CTTCCGTCGT
GAAGGCCGCC CAAGCGAAGG CGAAACCCTG ATCGCGCGTC TGATTGACCG CCCGATTCGC
CCGCTGTTCC CGGAAGGCTT CGTCAACGAA GTTCAGGTTA TCGCCACCGT GGTTTCTGTT
AACCCGCAAG TTAACCCGGA TATCGTCGCG ATGATTGGTG CTTCCGCAGC ACTGTCTCTG
TCTGGTATTC CGTTCAATGG TCCGATTGGT GCTGCCCGCG TAGGTTACAT CAATGACCAG
TACGTACTGA ACCCGACTCA GGACGAGCTG AAAGAGAGCA AACTGGATCT GGTTGTTGCG
GGTACTGAAG CCGCTGTACT GATGGTTGAA TCTGAAGCTG AACTGCTGAG CGAAGACCAG
ATGTTGGGCG CAGTAGTGTT TGGTCATGAA CAACAGCAGG TTGTCATTCA GAACATCAAT
GAACTGGTGA AAGAAGCCGG TAAACCGCGT TGGGACTGGC AGCCGGAGCC GGTAAACGAA
GCGCTGAACG CGCGCGTTGC TGCACTGGCT GAAGCTCGTC TGAGCGATGC TTACCGCATC
ACCGACAAAC AAGAGCGTTA TGCGCAGGTT GATGTCATCA AATCTGAAAC CATCGCGACG
CTGCTTGCTG AAGATGAAAC CCTGGACGAA AACGAACTGG GTGAAATTCT GCACGCTATC
GAGAAAAACG TTGTTCGTAG CCGCGTACTG GCAGGCGAAC CGCGTATCGA CGGTCGTGAA
AAAGATATGA TTCGTGGTCT GGATGTGCGT ACTGGCGTGC TGCCGCGTAC TCACGGTTCT
GCGCTGTTCA CCCGTGGTGA AACTCAGGCG CTGGTTACCG CAACGCTGGG TACTGCACGT
GACGCGCAGG TTCTTGATGA ACTGATGGGC GAACGTACTG ACACCTTCCT GTTCCACTAC
AACTTCCCTC CGTACTCCGT AGGCGAAACC GGCATGGTCG GTTCTCCGAA GCGTCGTGAA
ATTGGTCACG GTCGTCTGGC GAAGCGCGGC GTGCTGGCAG TCATGCCGGA TATGGACAAA
TTCCCGTACA CCGTACGTGT GGTGTCTGAA ATCACCGAAT CCAACGGTTC TTCTTCTATG
GCTTCCGTGT GCGGCGCGTC TCTGGCGCTG ATGGACGCAG GTGTGCCAAT CAAAGCTGCC
GTTGCGGGTA TCGCAATGGG TCTGGTGAAA GAAGGCGACA ACTACGTTGT ACTGTCTGAC
ATTTTGGGCG ACGAAGATCA CCTGGGCGAT ATGGACTTCA AAGTTGCGGG TTCCCGCGAC
GGTATCTCTG CATTGCAGAT GGATATCAAA ATTGAAGGTA TCACCAAAGA GATCATGCAG
GTTGCACTGA ACCAAGCTAA AGGTGCGCGT CTGCATATCC TGGGCGTAAT GGAACAGGCG
ATCAACGCGC CGCGCGGCGA TATCTCTGAG TTCGCTCCGC GTATCCATAC CATCAAGATC
AACCCGGACA AGATCAAAGA CGTTATCGGT AAAGGCGGTT CTGTTATCCG TGCTCTGACC
GAAGAAACTG GCACCACCAT CGAAATCGAA GATGACGGTA CTGTGAAAAT CGCAGCGACC
GACGGCGAGA AAGCGAAACA TGCTATTCGT CGTATCGAAG AGATCACAGC AGAAATCGAA
GTGGGCCGCG TCTACAATGG TAAAGTGACC CGTATCGTTG ACTTTGGCGC ATTTGTTGCC
ATCGGCGGCG GTAAAGAAGG TCTGGTACAC ATCTCTCAGA TCGCTGACAA ACGCGTTGAG
AAAGTGACCG ATTACCTGCA GATGGGTCAG GAAGTACCGG TGAAAGTTCT GGAAGTTGAT
CGCCAGGGCC GTATCCGTCT GAGCATTAAA GAAGCGACTG AGCAGTCTCA ACCTGCTGCA
GCACCGGAAG CTCCGGCTGC TGAACAGGGC GAGTAA
 
Protein sequence
MLNPIVRKFQ YGQHTVTLET GMMARQATAA VMVSMDDTAV FVTVVGQKKA KPGQDFFPLT 
VNYQERTYAA GRIPGSFFRR EGRPSEGETL IARLIDRPIR PLFPEGFVNE VQVIATVVSV
NPQVNPDIVA MIGASAALSL SGIPFNGPIG AARVGYINDQ YVLNPTQDEL KESKLDLVVA
GTEAAVLMVE SEAELLSEDQ MLGAVVFGHE QQQVVIQNIN ELVKEAGKPR WDWQPEPVNE
ALNARVAALA EARLSDAYRI TDKQERYAQV DVIKSETIAT LLAEDETLDE NELGEILHAI
EKNVVRSRVL AGEPRIDGRE KDMIRGLDVR TGVLPRTHGS ALFTRGETQA LVTATLGTAR
DAQVLDELMG ERTDTFLFHY NFPPYSVGET GMVGSPKRRE IGHGRLAKRG VLAVMPDMDK
FPYTVRVVSE ITESNGSSSM ASVCGASLAL MDAGVPIKAA VAGIAMGLVK EGDNYVVLSD
ILGDEDHLGD MDFKVAGSRD GISALQMDIK IEGITKEIMQ VALNQAKGAR LHILGVMEQA
INAPRGDISE FAPRIHTIKI NPDKIKDVIG KGGSVIRALT EETGTTIEIE DDGTVKIAAT
DGEKAKHAIR RIEEITAEIE VGRVYNGKVT RIVDFGAFVA IGGGKEGLVH ISQIADKRVE
KVTDYLQMGQ EVPVKVLEVD RQGRIRLSIK EATEQSQPAA APEAPAAEQG E