Gene SbBS512_E4766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4766 
Symbol 
ID6273260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp4447266 
End bp4448270 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content49% 
IMG OID641728520 
Producthypothetical protein 
Protein accessionYP_001882915 
Protein GI187731231 
COG category[S] Function unknown 
COG ID[COG5464] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01784] conserved hypothetical protein (putative transposase or invertase) 


Plasmid Coverage information

Num covering plasmid clones63 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAACT TCACGACCAG CACGCCACAT GACGCATTAT TTAAATCCTT TCTCACTCAC 
CCTGACACCG CGCGGGATTT TATGGAGATC CACATACCCA AAGATTTACG TGAACTGTGC
GATCTCGACA GCTTAAAACT GGAATCCGCC AGCTTTGTCG ATGAAAAATT GCGGGTGCTA
CACTCCGATA TTCTGTGGTC GGTAAAGACC CGTGAAGGTG ATGGTTATAT TTACGTAGTG
ATTGAACATC AGAGCCGCGA GGATATTCAT ATGGCATTTC GCCTGATGCG ATATTCCATG
GCGGTGATGC AGCGCCATAT CGAGCATGAT AAACGCCGGC CGCTACCGCT GGTTATCCCG
ATGCTGTTTT ATCACGGTAG CCGTAGTCCT TACCCCTGGT CCCTGTGCTG GCTGGACGAA
TTTGCCGCCC CGACTACCGC ACGGAAACTT TATAGCGCAG CGTTCCCGCT GGTGGATGTC
ACTGTCGTGC CAGACGACGA GATTGTGCAG CATCGCAGAG TCGCCCTGTT GGAGTTGATC
CAAAAGCATA TTCGCCAGCG CGATTTGATG GGGCTTATTG ACCAACTGGT AGTATTACTG
GTTACAGAGT GTGCTAATGA CAGCCAGATA ACTGCGCTGT TAAATTACAT TTTACTGACT
GGCGATGAAG CGCGTTTTAA GGCGTTTATC AGCGAACTTA CCAGGCGAAT GCCACACCAC
AGGGAGCGAA TAATGACAAT TGCAGAGCGA ATTCATAATG ATGGATGGCT GTTGGGAAGG
GAGAGGGGGA GGAAAGAAGG GAAAGTAGAA GGGGAACGGA GCCTCCTCCG ATTGTTGTTG
CAGAATGGGG CCGATCCTGA ATGGATACAA CGATATACCG GACTTTCGGC AGAGCAAATG
CAGGCATTAG ATCTGAAGTG GCACACTGAA TTTGGCCACC TGAACAGAGG TGATATGCTC
ACCTCAGAAC AACACAGGTG CTCCAATGAA AAAAAGAAAT TTTAG
 
Protein sequence
MTNFTTSTPH DALFKSFLTH PDTARDFMEI HIPKDLRELC DLDSLKLESA SFVDEKLRVL 
HSDILWSVKT REGDGYIYVV IEHQSREDIH MAFRLMRYSM AVMQRHIEHD KRRPLPLVIP
MLFYHGSRSP YPWSLCWLDE FAAPTTARKL YSAAFPLVDV TVVPDDEIVQ HRRVALLELI
QKHIRQRDLM GLIDQLVVLL VTECANDSQI TALLNYILLT GDEARFKAFI SELTRRMPHH
RERIMTIAER IHNDGWLLGR ERGRKEGKVE GERSLLRLLL QNGADPEWIQ RYTGLSAEQM
QALDLKWHTE FGHLNRGDML TSEQHRCSNE KKKF