Gene SbBS512_E0134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0134 
SymbolpcnB 
ID6270946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp144799 
End bp146163 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content56% 
IMG OID641724387 
Productpoly(A) polymerase I 
Protein accessionYP_001878946 
Protein GI187731223 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0617] tRNA nucleotidyltransferase/poly(A) polymerase 
TIGRFAM ID[TIGR01942] poly(A) polymerase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000109672 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTAAGCC GCGAGGAAAG CGAGGCTGAA CAGGCAGTCG CCCGTCCACA GGTGACGGTG 
ATCCCGCGTG AGCAGCATGC TATTTCCCGC AAAGATATCA GTGAAAATGC CCTGAAGGTA
ATGTACAGGC TCAATAAAGC GGGATACGAA GCCTGGCTGG TTGGCGGCGG CGTGCGCGAC
CTGTTACTTG GCAAAAAGCC GAAAGATTTT GACGTAACCA CTAACGCCAC GCCTGAGCAG
GTGCGCAAAC TGTTCCGTAA CTGCCGCCTT GTGGGTCGCC GTTTCCGTCT GGCTCATGTG
ATGTTTGGCC CGGAGATTAT CGAAGTTGCA ACCTTCCGTG GACACCACGA AGGTAACGTC
AGCGACCGCA CGACCTCCCA ACGCGGGCAA AACGGCATGT TGCTGCGCGA CAACATTTTC
GGCTCCATCG AAGAAGACGC CCAGCGCCGC GATTTCACTA TCAACAGCCT GTATTACAGC
GTAGCGGATT TTACCGTCCG TGATTACGTT GGCGGCATGA AGGATCTGAA GGACGGCGTT
ATCCGTCTGA TTGGTAACCC GGAAACGCGC TACCGTGAAG ATCCGGTACG TATGCTGCGC
GCGGTACGTT TTGCCGCCAA ATTGGGTATG CGCATCAGCC CGGAAACCGC AGAACCGATC
CCTCGCCTCG CTACCCTGCT GAACGATATC CCACCGGCAC GCCTGTTTGA AGAATCGCTT
AAACTGCTAC AAGCGGGCTA CGGTTACGAA ACCTATAAGC TGTTGTGTGA ATATCATCTG
TTCCAGCCGC TGTTCCCGAC CATTACCCGC TACTTCACGG AAAATGGCGA CAGCCCGATG
GAGCGGATCA TTGAACAGGT GCTGAAGAAT ACCGATACGC GTATCCATAA CGATATGCGC
GTGAACCCGG CGTTCTTGTT TGCCGCCATG TTCTGGTACC CACTGCTGGA GACGGCACAG
AAGATCGCCC AGGAAAGCGG CCTGACCTAT CACGACGCTT TCGCGCTGGC GATGAACGAC
GTGCTGGACG AAGCCTGCCG TTCACTGGCA ATCCCGAAAC GTCTGACGAC GTTAACCCGC
GATATCTGGC AGTTGCAGTT GCGTATGTCC CGTCGTCAGG GTAAACGCGC ATGGAAACTG
CTGGAGCATC CTAAGTTCCG TGCGGCTTAT GACCTGTTGG CCTTGCGAGC TGAAGTTGAA
CGTAACGCTG AACTGCAGCG TCTGGTGAAA TGGTGGGGTG AGTTCCAGGT TTCCGCGCCA
CCAGATCAAA AAGGGATGCT TAACGAGTTG GATGAAGAGC CGTCACCGCG TCGCCGTACT
CGTCGTCCAC GCAAACGCGC ACCGCGTCGT GAGGGTACCG CATGA
 
Protein sequence
MLSREESEAE QAVARPQVTV IPREQHAISR KDISENALKV MYRLNKAGYE AWLVGGGVRD 
LLLGKKPKDF DVTTNATPEQ VRKLFRNCRL VGRRFRLAHV MFGPEIIEVA TFRGHHEGNV
SDRTTSQRGQ NGMLLRDNIF GSIEEDAQRR DFTINSLYYS VADFTVRDYV GGMKDLKDGV
IRLIGNPETR YREDPVRMLR AVRFAAKLGM RISPETAEPI PRLATLLNDI PPARLFEESL
KLLQAGYGYE TYKLLCEYHL FQPLFPTITR YFTENGDSPM ERIIEQVLKN TDTRIHNDMR
VNPAFLFAAM FWYPLLETAQ KIAQESGLTY HDAFALAMND VLDEACRSLA IPKRLTTLTR
DIWQLQLRMS RRQGKRAWKL LEHPKFRAAY DLLALRAEVE RNAELQRLVK WWGEFQVSAP
PDQKGMLNEL DEEPSPRRRT RRPRKRAPRR EGTA