Gene SbBS512_E1286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1286 
Symbol 
ID6273229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1175200 
End bp1177128 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content52% 
IMG OID641725407 
Productphage terminase large subunit (GpA) 
Protein accessionYP_001879918 
Protein GI187731821 
COG category[R] General function prediction only 
COG ID[COG5525] Bacteriophage tail assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATAT CAGAGCAACA ACTGAATAAT ATGATGAGTG CTGTCACAAC AGCATTACAG 
CCCCTGATAA GGGCATTGCC GGTGACGCCA GTTGAATGGG CTGATCAAAA TTATTATCTG
CCTAAAGAAT CTTCATATGG TGAGGGAGAA TGGAAAACGC TGCCGTTCCA GGTCGCCATT
ATGAACTGTA TGGGTAACGA CCAGGTTCGC ACGGTTAACC TGATTAAATC TGCCCGTGTT
GGCTATACAA AGATGTTGCT GGGGGTGGTC GGGTATTTTA TTGAGCATAA ATCACGAAAC
AGTCTGCTTT TTCAGCCCAC GGATTCTGCC GCTGAAGATT TTATGAAGTC TCACGTGGAG
GCGACGATTC GGGATGTGCC ATGCCTGAAA GACCTTTCCC CATGGCTGGG TCGTAAACAT
CGTGATAATA CCCTCACGCT GAAACGTTTT TCATCGGGCG TGGGCTTCTG GTGCCTGGGC
GGGGCTGCCG CTAAAAACTA CCGTGAAAAA TCCGTGGACG TGGTCTGCTA TGACGAACTT
TCCTCGTTCG AACCGGATGT CGAAAAAGAG GGTTCGCCAA CCCTGCTGGG GGATAAACGT
ATTGAGGGCT CTGTATGGCC CAAATCCATT CGCGGCTCGA CGCCTAAAAT CAAAGGCACC
TGCCAGATCG AAAAAGCGGC CAACGAGTCG GCGCATTTTA TGCGTTTTTA TGTGCCCTGC
CCACACTGTG GGGAGGAGCA GTATCTGAAA TTTGGCGATG AGTCCACGCC TTTTGGGCTT
AAATGGGAGA AGGACAGTCC CGAAAGTGTT TTCTACCTCT GTGAACATCA TGGCTGCGTG
ATCCATCAGT CTGAACTGGA CCAGAGCAAC GGGCGGTGGA TCTGTGAAAA CACGGGCATG
TGGACCCGTG ACGGTCTGAC GTTTTTCAGC GCTGCGGGTA ATGAAATTCC GCCGCCGCGC
TCCATCACGT TCCATATCTG GACGGCGTAC AGTCCGTTTA CCACCTGGGT ACAGATAGTC
TATGACTGGC TGGATGCACT GAAAGATCCC AACGGCCTGA AAACCTTTGT GAACACCACG
CTGGGCGAGA CCTGGGAAGA AGCCGTGGGC GAAAAACTCG ATCACCAGGT ACTGATGGAT
AAGGTCGTGC ATTACACGGC GGCGGTACCT GCCCGGGTGG TTTATCTGAC GGCGGGCATT
GACTCGCAGC GAAACCGTTT TGAGATGTAT GTCTGGGGAT GGGCACCGGG AGAGGAAGCT
TTTCTGGTGG ATAAAATCAT CATTATGGGC CGTCCCGATG AGGAAGAGAC GCTGTTACGT
GTGGATGCGG CGATCAACAA AAAATACTGC CATGCAGACG GAACCGAAAT GACCATTTCC
CGTGTCTGCT GGGACACCGG GGGGATCGAT GGTGAAATTG TCTATCAGAG GTCAAAAAAA
CACGGTGTTT TCCGGGTGCT GCCGGTAAAA GGCGCATCTG TCTATGGCAA GCCGGTGATC
ACCATGCCGA AAACCCGCAA TCAGCGGGGC GTGTATCTGT GTGAAGTGGG GACGGACACC
GCAAAAGAAA TTCTCTATGC CCGTATGAAA GCCGATCCCA CGCCTGCGGA TGAAGCCACG
TCGTATGCCA TCCGTTTTCC TGATGATCCG GAGATTTTTT CGCAGACAGA GGCGCAGCAA
CTGGTCGCGG AAGAGCTTGT GGAGAAGTGG GAAAAAGGAA AGATGCGTCT GCTGTGGGAT
AACAAAAAGC GGCGTAACGA AGCGCTGGAC TGCCTGGTGT ATGCCTACGC GGCATTACGT
GTGTCCGTGC AACGCTGGCA GCTTGATCTG GCTGTACTGG CAAAATCCCG GGAAGAAGAG
ACGACCCGGC CAACCCTTAA AGAACTGGCA GCGAAGCTGT CCGGAGGAGT GAATGGTTAC
AGTCGCTGA
 
Protein sequence
MNISEQQLNN MMSAVTTALQ PLIRALPVTP VEWADQNYYL PKESSYGEGE WKTLPFQVAI 
MNCMGNDQVR TVNLIKSARV GYTKMLLGVV GYFIEHKSRN SLLFQPTDSA AEDFMKSHVE
ATIRDVPCLK DLSPWLGRKH RDNTLTLKRF SSGVGFWCLG GAAAKNYREK SVDVVCYDEL
SSFEPDVEKE GSPTLLGDKR IEGSVWPKSI RGSTPKIKGT CQIEKAANES AHFMRFYVPC
PHCGEEQYLK FGDESTPFGL KWEKDSPESV FYLCEHHGCV IHQSELDQSN GRWICENTGM
WTRDGLTFFS AAGNEIPPPR SITFHIWTAY SPFTTWVQIV YDWLDALKDP NGLKTFVNTT
LGETWEEAVG EKLDHQVLMD KVVHYTAAVP ARVVYLTAGI DSQRNRFEMY VWGWAPGEEA
FLVDKIIIMG RPDEEETLLR VDAAINKKYC HADGTEMTIS RVCWDTGGID GEIVYQRSKK
HGVFRVLPVK GASVYGKPVI TMPKTRNQRG VYLCEVGTDT AKEILYARMK ADPTPADEAT
SYAIRFPDDP EIFSQTEAQQ LVAEELVEKW EKGKMRLLWD NKKRRNEALD CLVYAYAALR
VSVQRWQLDL AVLAKSREEE TTRPTLKELA AKLSGGVNGY SR