Gene EcSMS35_0154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0154 
SymbolpcnB 
ID6144731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp168134 
End bp169498 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content56% 
IMG OID641615055 
Productpoly(A) polymerase I 
Protein accessionYP_001742271 
Protein GI170680324 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0617] tRNA nucleotidyltransferase/poly(A) polymerase 
TIGRFAM ID[TIGR01942] poly(A) polymerase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000110744 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTAAGCC GCGAGGAAAG CGAGGCTGAA CAGGCAGTCG CCCGTCCACA GGTGACGGTG 
ATCCCGCGTG AGCAGCATGC TATTTCCCGC AAAGATATCA GTGAAAATGC CCTGAAGGTA
ATGTACAGGC TCAATAAAGC GGGATACGAA GCCTGGCTGG TTGGCGGCGG CGTGCGCGAC
CTGTTACTTG GCAAAAAGCC GAAAGATTTT GACGTGACCA CCAACGCCAC GCCAGAGCAG
GTGCGCAAAC TGTTCCGTAA CTGCCGCCTG GTGGGTCGCC GTTTCCGTCT GGCTCATGTG
ATGTTTGGCC CGGAGATTAT CGAAGTTGCG ACCTTCCGTG GACATCACGA AGGTAACGTC
AGCGATCGCA CGACCTCCCA ACGCGGGCAA AACGGTATGT TGCTGCGCGA CAACATTTTC
GGCTCCATCG AAGAAGACGC CCAGCGCCGC GATTTCACTA TCAACAGCCT GTATTACAGC
GTGGCAGATT TTACCGTCCG TGATTACGTT GGCGGCATGA AGGATCTGAA AGACGGCGTT
ATCCGTCTGA TTGGTAACCC GGAAACGCGC TACCGTGAAG ATCCGGTACG GATGCTTCGC
GCGGTTCGTT TTGCCGCCAA ATTGGGCATG CGCATCAGCC CGGAAACTGC CGAGCCGATC
CCACGCCTGG CAACCTTACT GAACGATATC CCACCGGCAC GTCTGTTTGA AGAATCGCTG
AAACTGTTGC AAGCGGGCTA CGGTTACGAA ACTTATAAGC TGCTGTGTGA ATATCATCTG
TTCCAGCCGC TGTTTCCGAC CATTACCCGC TACTTCACGG AAAATGGCGA CAGCCCGATG
GAGCGGATCA TTGAACAGGT GCTGAAGAAT ACCGATACGC GTATCCATAA CGATATGCGC
GTGAACCCGG CGTTCCTGTT TGCCGCCATG TTCTGGTACC CATTGCTGGA GACGGCACAG
AAGATCGCCC AGGAAAGCGG CCTGACCTAT CACGACGCTT TCGCACTGGC GATGAACGAC
GTGCTGGACG AAGCCTGCCG TTCACTGGCG ATCCCGAAAC GTCTGACTAC GCTAACTCGC
GATATCTGGC AGTTGCAGTT GCGTATGTCC CGTCGTCAGG GAAAACGCGC ATGGAAACTG
CTGGAGCATC CTAAGTTCCG TGCGGCATAC GACCTGTTGG CCTTACGAGC CGAAGTTGAG
CGTAACGCTG AACTGCAACG TCTGGTGAAA TGGTGGGGTG AGTTCCAGGT TTCCGCGCCG
CCAGACCAAA AAGGGATGCT TAACGAACTG GATGAAGAAC CGTCACCGCG TCGTCGTACT
CGTCGTCCAC GCAAACGCGC ACCGCGTCGT GAGGGTACCG CATGA
 
Protein sequence
MLSREESEAE QAVARPQVTV IPREQHAISR KDISENALKV MYRLNKAGYE AWLVGGGVRD 
LLLGKKPKDF DVTTNATPEQ VRKLFRNCRL VGRRFRLAHV MFGPEIIEVA TFRGHHEGNV
SDRTTSQRGQ NGMLLRDNIF GSIEEDAQRR DFTINSLYYS VADFTVRDYV GGMKDLKDGV
IRLIGNPETR YREDPVRMLR AVRFAAKLGM RISPETAEPI PRLATLLNDI PPARLFEESL
KLLQAGYGYE TYKLLCEYHL FQPLFPTITR YFTENGDSPM ERIIEQVLKN TDTRIHNDMR
VNPAFLFAAM FWYPLLETAQ KIAQESGLTY HDAFALAMND VLDEACRSLA IPKRLTTLTR
DIWQLQLRMS RRQGKRAWKL LEHPKFRAAY DLLALRAEVE RNAELQRLVK WWGEFQVSAP
PDQKGMLNEL DEEPSPRRRT RRPRKRAPRR EGTA