Gene SbBS512_E0228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0228 
SymboldinP 
ID6272955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp236157 
End bp237221 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content54% 
IMG OID641724476 
ProductDNA polymerase IV 
Protein accessionYP_001879027 
Protein GI187733383 
COG category[L] Replication, recombination and repair 
COG ID[COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.190428 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTATAAGA AAGTGAAGAT TATTCATGTG GATATGGACT GCTTTTTCGC GGCGGTGGAG 
ATGCGCGACA ATCCCGCCCT GCGCGATATC CCTATTGCTA TTGGCGGCAG CCGCGAACGT
CGGGGGGTGA TCAGTACCGC CAATTATCCC GCGCGTAAAT TTGGCGTACG TAGCGCTATG
CCGACAGGGA TGGCGCTCAA ATTATGCCCG CATCTCACCT TGCTTCCGGG GCGCTTTGAC
GCCTACAAAG AAGCCTCAAA TCATATCCGC GAAATCTTCT CGCGCTACAC CTCGCGTATT
GAACCGTTGT CACTGGATGA GGCTTATCTC GACGTCACCG ATAGCGTCCA TTGCCACGGT
TCTGCGACCC TCATCGCCCA GGAAATCCGC CAGACGATTT TCAACGAGCT GCAACTGACG
GCGTCTGCGG GCGTGGCACC CGTAAAGTTT CTCGCCAAAA TCGCCTCCGA CATGAATAAA
CCCAACGGCC AGTTTGTGAT TACGCCGGCA GAAGTTCCGG CATTTTTACA AACCTTACCA
CTGGCAAAAA TCCCCGGCGT CGGCAAAGTC TCGGCGGCAA AACTGGAAGC GATGGGGCTA
CGAACCTGCG GTGATGTACA AAAGTGTGAT CTGGTGATGC TGCTTAAACG CTTTGGCAAA
TTTGGCCGCA TTTTGTGGGA GCGTAGTCAG GGGATTGACG AGCGCGACGT TAACAGCGAA
CGGTTGCGAA AATCCGTCGG CGTGGAACGC ACGATGGCGG AAGATATCCA CCACTGGTCT
GAATGTGAAG CGATTATCGA GCGGCTGTAT CCGGAACTTG AACGCCGTCT GGCAAAGGTG
AAACCTGATT TACTGATTGC TCGCCAGGGG GTGAAATTAA AGTTTGATGA TTTTCAGCAA
ACCACTCAGG AGCACGTCTG GCCGCGGCTG AATAAAGCTG ACTTAATCGC CACCGCGCGT
AAAACCTGGG ATGAACGCCG CGGCGGGCGC GGTGTGCGAC TGGTGGGGCT GCATGTGACG
TTGCTTGACC CGCAAATGGA AAGACAACTG GTGCTGGGAT TATGA
 
Protein sequence
MYKKVKIIHV DMDCFFAAVE MRDNPALRDI PIAIGGSRER RGVISTANYP ARKFGVRSAM 
PTGMALKLCP HLTLLPGRFD AYKEASNHIR EIFSRYTSRI EPLSLDEAYL DVTDSVHCHG
SATLIAQEIR QTIFNELQLT ASAGVAPVKF LAKIASDMNK PNGQFVITPA EVPAFLQTLP
LAKIPGVGKV SAAKLEAMGL RTCGDVQKCD LVMLLKRFGK FGRILWERSQ GIDERDVNSE
RLRKSVGVER TMAEDIHHWS ECEAIIERLY PELERRLAKV KPDLLIARQG VKLKFDDFQQ
TTQEHVWPRL NKADLIATAR KTWDERRGGR GVRLVGLHVT LLDPQMERQL VLGL