Gene SbBS512_E2487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2487 
Symbol 
ID6268397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2286906 
End bp2288006 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content55% 
IMG OID641726479 
Productphage late control gene D protein 
Protein accessionYP_001880959 
Protein GI187734053 
COG category[R] General function prediction only 
COG ID[COG3500] Phage protein D 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATTTCA GCTCTGAACT GCTTAACAAA GGCAACAAAA CTCCGGCATT CAGCATCAGT 
ATTGAAGGCA GGGATATCAC CACTGTGCTG GACAACCGCC TGATGGGGCT GACGCTGACG
GATAACCGGG GCTTTGAAGC GGACCAGCTT GATCTGGAGC TGGACGACGC CGATGGAAAA
ATCGTGCTGC CGCGCCGTGG GGCTGTCATT ACGCTGGCGC TGGGCTGGAA GGGACAGCCG
CTTTTCCCGA AAGGGGCATT CACGGTGGAC GAGATTGAAC ACACTGGCGC ACCGGATCGC
CTGACTATCC GGGCGCGAAG TGCTGATTTT CGTGAAACGC TGAATACCCG CCGTGAAAAG
TCGTGGCACA AGACCACCGT TGGGGAAGTG GTGAAGGAAA TAGCTGCGCG GCACAAACTG
AAGATGGCAT TGGGTGAAGA CTTGTCGGAT AAACTCGTGG AGCATATAGA CCAGACTAAT
GAGAGTGACG GCAGTTTTTT GATGCGGCTG GCGCGCCAGT ACGGTGCTAT TGCGTCGGTG
AAAAATGGCA ATCTGTTATT CATCCGGCAG GGGCAGGGTA AAAGCGCCAG CGGTAAACCA
CTACCGGTGA TCACTATCAC ACGCAAGGAC GGCGACAGTC ACCGCTTTAC CCTGGCAGAT
CGCGGAGCTT ACACGGGCGT CATTGCCAGC TGGTTGCATA CCCGCGAACC CGCGAAGAAA
GAAAGCACCA CGGTGAAGCG TAAGCGCAGG ACTAAGAAGC AGAAGAAAGA GCCGGAAGCG
AAGCAGGGCG ATTACCTGGT GGGTACGGAT GAAAACGTGC TGGTACTTAA TCGCACTTAT
GCCAACCGGA GCAACGCCGA ACGGGCAGCG AAAATGCAGT GGGAACGCCT GCAACGCGGC
GTTGCATCAT TCTCGCTACA ACTGGCGGAA GGGCGGGCAG ATCTCTACAC AGAAATGCCT
GTGAAGGTCA GTGGCTTTAA ACAGCCGATA GATGATGCGG AATGGACCAT TACGACTCTG
ACGCATACTG TCAGCCCGGA TAACGGTTTT ACGACCAGTC TGGAGCTTGA AGTGAGGATT
GATGATTTCG AAATGGAATG A
 
Protein sequence
MNFSSELLNK GNKTPAFSIS IEGRDITTVL DNRLMGLTLT DNRGFEADQL DLELDDADGK 
IVLPRRGAVI TLALGWKGQP LFPKGAFTVD EIEHTGAPDR LTIRARSADF RETLNTRREK
SWHKTTVGEV VKEIAARHKL KMALGEDLSD KLVEHIDQTN ESDGSFLMRL ARQYGAIASV
KNGNLLFIRQ GQGKSASGKP LPVITITRKD GDSHRFTLAD RGAYTGVIAS WLHTREPAKK
ESTTVKRKRR TKKQKKEPEA KQGDYLVGTD ENVLVLNRTY ANRSNAERAA KMQWERLQRG
VASFSLQLAE GRADLYTEMP VKVSGFKQPI DDAEWTITTL THTVSPDNGF TTSLELEVRI
DDFEME