Gene SbBS512_E1454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1454 
Symbol 
ID6269814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1327230 
End bp1329158 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content52% 
IMG OID641725555 
Productphage terminase large subunit (GpA) 
Protein accessionYP_001880061 
Protein GI187730353 
COG category[R] General function prediction only 
COG ID[COG5525] Bacteriophage tail assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTCAG ACGCACAGAA GGCAGCTAAT GCAGCCGGTG CGATAGCTAC AGGGCTTTTA 
TCTCTCATTA TTCCTGTTCC ACTGACGACA GTTCAGTGGG CCAATAAACA TTATTACCTT
CCTAAAGAGT CGTCTTATAC CCCGGGGCGG TGGGAAACAC TGCCGTTTCA GGTTGGCATC
ATGAACTGTA TGGGCAACGA TTTGATTCGC ACTGTTAACC TGATTAAATC TGCCCGTGTT
GGTTATACAA AGATGTTGCT GGGAGTGGAG GCTTATTTTA TTGAGCATAA ATCACGCAAC
AGCCTTCTTT TTCAGCCCAC GGACTCAGCT GCTGAAGATT TTATGAAATC TCATGTTGAG
CCAACGATAA GGGATGTTCC TGCATTGCTG GAGCTGGCTC CATGGTTCGG AAGAAAACAC
CGCGATAATA CGCTCACCCT GAAGCGTTTT TCCTCCGGTG TGGGTTTCTG GTGTCTGGGG
GGAGCGGCAG CAAAAAACTA CCGTGAAAAA TCCGTGGATG TGGTTTGTTA TGACGAGCTT
TCCTCGTTCG AACCGGATGT TGAAAAAGAG GGTTCGCCAA CCCTGCTGGG GGATAAACGT
ATTGAGGGCT CTGTATGGCC AAAATCCATT CGCGGCTCGA CGCCTAAAAT CAAAGGCTCC
TGCCAGATCG AAAAAGCCGC TAACGAGTCG GCACATTTCA TGCGTTTTTA TGTGCCCTGT
CCGCACTGTG GGGAGGAGCA GTATCTGAAA TTTGGCGATG ATGCCTCGCC TTTCGGTCTT
AAGTGGGAGA AGAATAAGCC AGAAAGTGTT TTCTACCTTT GTGAGCATCA TGGCTGTGTG
ATCCATCAGT CTGAGCTTGA CCAGAGTAAC GGGCGGTGGA TCTGTGAAAA CACGGGCATG
TGGACTCGTG ACGGTCTGAC ATTTTTCAGC GCCGCGGATA ATGAAATTCC GCCGCCGCGC
TCCATCACAT TCCATATCTG GACGGCGTAC AGTCCGTTCA CCACCTGGGT ACAGATTGTC
TATGACTGGC TGGATGCACT GAAAGATCCC AACGGCCTGA AAACCTTTGT GAACACCACG
CTGGGCGAGA CCTGGGAAGA GGCCGTGGGC GAAAAACTCG ATCACCAGGT GCTGATGGAT
AAGGTTGTGC GTTACACGGC TGCGGTGCCT TCCCGGGTGG TTTATCTGAC GGCGGGCATT
GACTCGCAGC GAAACCGTTT TGAGATGTAT GTCTGGGGAT GGGCTCCGGG AGAGGAAGCC
TTTCTGGTGG ATAAAATCAT CATTATGGGG CGTCCCGATG AGGAAGAGAC GCTGTTACGT
GTGGATGTGG CGATCAACAA AAAATACCGC CATGCAGACG GAACCGAAAT GACCATTTCC
CGTGTCTGCT GGGACACCGG GGGGATCGAT GGCGAAATTG TCTATCAGAG GTCAAAAAAA
CACGGTGTTT TCCGGGTGCT GCCGGTAAAA GGTGCATCTG TTTATGGCAA GCCGGTGATC
ACCATGCCAA AAACCCGCAA TCAGCGGGGC GTGTATCTGT GCGAAGTGGG GACGGACACC
GCAAAAGAAA TTCTCTATGC CCGTATGAAA GCCGATCCCA CGCCTGCGGA TGAAGCCACG
TCGTATGCCA TCCGTTTTCC TGATGATCCG GAGATTTTTT CGCAGACAGA GGCGCAGCAA
CTGGTGGCGG AAGAGCTTGT GGAGAAGTGG GAAAAAGGAA AGATGCGTCT GCTGTGGGAT
AACAAAAAGC GGCGTAACGA AGCGCTGGAC TGCCTGGTGT ATGCCTACGC GGCATTACGT
GTGTCCGTGC AACGCTGGCA GCTTGATCTG GCTGTACTGG CAAAATCCCG GGAAGAAGAG
ACGACCCGGC CAACCCTTAA AGAACTGGCA GCGAAGCTGT CCGGAGGAGT GAATGGTTAC
AGTCGCTGA
 
Protein sequence
MISDAQKAAN AAGAIATGLL SLIIPVPLTT VQWANKHYYL PKESSYTPGR WETLPFQVGI 
MNCMGNDLIR TVNLIKSARV GYTKMLLGVE AYFIEHKSRN SLLFQPTDSA AEDFMKSHVE
PTIRDVPALL ELAPWFGRKH RDNTLTLKRF SSGVGFWCLG GAAAKNYREK SVDVVCYDEL
SSFEPDVEKE GSPTLLGDKR IEGSVWPKSI RGSTPKIKGS CQIEKAANES AHFMRFYVPC
PHCGEEQYLK FGDDASPFGL KWEKNKPESV FYLCEHHGCV IHQSELDQSN GRWICENTGM
WTRDGLTFFS AADNEIPPPR SITFHIWTAY SPFTTWVQIV YDWLDALKDP NGLKTFVNTT
LGETWEEAVG EKLDHQVLMD KVVRYTAAVP SRVVYLTAGI DSQRNRFEMY VWGWAPGEEA
FLVDKIIIMG RPDEEETLLR VDVAINKKYR HADGTEMTIS RVCWDTGGID GEIVYQRSKK
HGVFRVLPVK GASVYGKPVI TMPKTRNQRG VYLCEVGTDT AKEILYARMK ADPTPADEAT
SYAIRFPDDP EIFSQTEAQQ LVAEELVEKW EKGKMRLLWD NKKRRNEALD CLVYAYAALR
VSVQRWQLDL AVLAKSREEE TTRPTLKELA AKLSGGVNGY SR