Gene SNSL254_A0201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0201 
SymbolpcnB 
ID6486780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp216025 
End bp217338 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content57% 
IMG OID642735638 
Productpoly(A) polymerase I 
Protein accessionYP_002039420 
Protein GI194446356 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0617] tRNA nucleotidyltransferase/poly(A) polymerase 
TIGRFAM ID[TIGR01942] poly(A) polymerase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000171206 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATTA TCCCGCGTGA ACAGCACGCT ATCTCCCGCA AAGATATCAG TGAAAATGCC 
CTCAAGGTAC TGTACAGGCT GAACAAAGCG GGCTATGAAG CCTACCTGGT CGGCGGCGGC
GTCCGCGATC TCCTGCTCGG TAAAAAGCCG AAGGATTTCG ACGTGACCAC CAACGCAACA
CCGGATCAGG TACGGAAATT ATTCCGCAAT TGCCGTCTGG TGGGGCGTCG TTTCCGCCTG
GCTCACGTGA TGTTTGGCCC GGAAATTATC GAAGTGGCAA CGTTTCGTGG TCATCATGAA
GGCAGTGAAA GCGACCGTAC GACCTCCCAG CGTGGGCAAA ACGGTATGCT GCTGCGCGAC
AACATCTTCG GTTCTATCGA AGAAGATGCC CAGCGCCGCG ATTTCACCAT CAACAGCCTT
TACTACAGCG TGGCGGATTT TACTGTGCGC GATTACGTCG GCGGGATGCA GGATCTGCAA
GAAGGCGTGA TTCGCCTGAT CGGCAATCCG GAAACGCGCT ACCGCGAAGA TCCGGTTCGA
ATGCTGCGCG CCGTGCGTTT CGCTGCGAAG CTCAATATGC GTATCAGCCC TGAAACGGCT
GAGCCAATCC CGCGTCTGGC AACCTTGCTA AACGACATTC CTCCCGCGCG CCTGTTCGAA
GAGTCGCTGA AGCTGTTGCA GGCGGGGAAC GGTTATGAAA CCTATCAACA ACTGCGGGAA
TACCACCTCT TCCAGCCGTT GTTTCCTACC ATTACGCGTT ATTTCACCGA AAACGGCGAC
AGCGCAATGG AACGCATCAT TGCACAGGTG TTGAAGAATA CGGATAACCG CATCCGTAAC
GAGATGCGCG TTAACCCGGC GTTTTTGTTT GCCGCCATGT TCTGGTATCC GCTGCTGGAG
ATGGCGCAAA AAATCGCTCA GGAGAGCGGC CTGGCCTATT ACGATGCTTT CGCGCTGGCC
ATGAATGACG TGCTGGATGA AGCCTGCCGT TCACTGGCGA TCCCGAAACG CCTTACCACG
CTGACCCGTG ATATTTGGCA GCTTCAGTTA CGCATGTCCC GTCGTCAGGG CAAACGCGCC
TGGAAGCTGA TGGAACATCC CAAATTCCGC GCCGCGTTTG ATTTGCTGGA GCTGCGCGCT
CAGGTGGAAA ATAATACTGA ACTGCAACGT CTGGCGCAGT GGTGGGCCGA GTTTCAGGCT
TCCGCGCCGC CGGAACAAAA AGGGATGCTC AACGAGCTGG ACGACGATCC TGCTCCACGC
CGCCGTCGTT CACGTCCGCG CAAACGCGCG CCGCGCCGCG AGGGCACCGT ATGA
 
Protein sequence
MTIIPREQHA ISRKDISENA LKVLYRLNKA GYEAYLVGGG VRDLLLGKKP KDFDVTTNAT 
PDQVRKLFRN CRLVGRRFRL AHVMFGPEII EVATFRGHHE GSESDRTTSQ RGQNGMLLRD
NIFGSIEEDA QRRDFTINSL YYSVADFTVR DYVGGMQDLQ EGVIRLIGNP ETRYREDPVR
MLRAVRFAAK LNMRISPETA EPIPRLATLL NDIPPARLFE ESLKLLQAGN GYETYQQLRE
YHLFQPLFPT ITRYFTENGD SAMERIIAQV LKNTDNRIRN EMRVNPAFLF AAMFWYPLLE
MAQKIAQESG LAYYDAFALA MNDVLDEACR SLAIPKRLTT LTRDIWQLQL RMSRRQGKRA
WKLMEHPKFR AAFDLLELRA QVENNTELQR LAQWWAEFQA SAPPEQKGML NELDDDPAPR
RRRSRPRKRA PRREGTV