Gene SbBS512_E3249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E3249 
Symbol 
ID6268708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3031259 
End bp3032569 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content56% 
IMG OID641727160 
Producthypothetical protein 
Protein accessionYP_001881613 
Protein GI187731595 
COG category[S] Function unknown 
COG ID[COG3681] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.916046 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGATT CGACTTTAAA TCCGTTATGG CAGCGTTACA TCCTCGCCGT TCAGGAGGAA 
GTAAAACCGG CGCTGGGATG TACTGAACCG ATTTCACTGG CGCTGGCGGC GGCGGTTGCT
GCGGCAGAAC TGGAAGGTCC GGTTGAACGT GTAGAAGCCT GGGTTTCGCC AAATCTGATG
AAGAACGGTC TGGGCGTCAC CGTTCCCGGC ACGGGAATGG TGGGGCTGCC GATTGCGGCG
GCGCTGGGGG CGTTAGGTGG AAATGCCAAC GCCGGGCTGG AAGTGCTGAA AGACGCAACT
GCGCAGGCAA TTGCCGATGC CAAAGCACTG CTGGCGGCGG GGAAAGTCTC CGTTAAGATC
CAGGAACCTT GCAATGAAAT CCTCTTCTCA CGCGCCAAAG TCTGGAACGG TGAGAAGTGG
GCGTGTGTCA CCATCGTCGG CGGGCATACC AACATTGTGC ATATTGAGAC GCACAATAGT
GTGGTGTTTA CCCAGCAGGC GTGTGTGGCA GAGGGCGAGC AAGAGTCTCC GCTGACGGTG
CTTTCCAGAA CGACGCTGGC TGAGATCCTG AAGTTCGTCA ATGAAGTCCC GTTTGCGGCG
ATCCGCTTTA TTCTCGATTC CGCGAAGCTA AATTGTGCGT TATCGCAGGA AGGTTTGAGC
GGTAAGTGGG GGCTGCATAT TGGCGCGACG CTGGAAAAAC AGTGCGAGCG CGGTTTGCTG
GCGAAAGATC TCTCTTCATC CATTGTGATT CGTACCAGCG CGGCATCCGA TGCGCGTATG
GGCGGCGCTA CGCTTCCGGC TATGAGTAAC TCCGGCTCGG GTAACCAGGG GATCACCGCA
ACAATGTCTG TGGTGGTTGT AGCAGAACAC TTCGGAGCGG ATGATGAACG ACTGGCGCGT
GCGCTGATGC TTTCTCATTT GAGCGCAATT TACATCCATA ACCAGTTACC GCGTTTGTCT
GCACTTTGTG CCGCAACGAC CGCAGCAATG GGGGCCGCCG CCGGGATGGC ATGGCTGGTG
GATGGGCGTT ATGAAACTAT CTCGATGGCG ATCAGCAGTA TGATCGGCGA TGTCAGCGGC
ATGATTTGCG ATGGTGCGTC GAACAGCTGC GCGATGAAGG TTTCGACCAG TGCTTCGGCT
GCGTGGAAAG CGGTGTTAAT GGCGCTGGAT GATACCGCCG TGACCGGCAA TGAAGGGATC
GTGGCGCATG ATGTTGAGCA GTCGATTGCC AACCTGTGTG CGTTAGCAAG CCATTCGATG
CAGCAAACGG ATCGGCAGAT TATCGAGATT ATGGCGAGCA AGGCCAGATA A
 
Protein sequence
MFDSTLNPLW QRYILAVQEE VKPALGCTEP ISLALAAAVA AAELEGPVER VEAWVSPNLM 
KNGLGVTVPG TGMVGLPIAA ALGALGGNAN AGLEVLKDAT AQAIADAKAL LAAGKVSVKI
QEPCNEILFS RAKVWNGEKW ACVTIVGGHT NIVHIETHNS VVFTQQACVA EGEQESPLTV
LSRTTLAEIL KFVNEVPFAA IRFILDSAKL NCALSQEGLS GKWGLHIGAT LEKQCERGLL
AKDLSSSIVI RTSAASDARM GGATLPAMSN SGSGNQGITA TMSVVVVAEH FGADDERLAR
ALMLSHLSAI YIHNQLPRLS ALCAATTAAM GAAAGMAWLV DGRYETISMA ISSMIGDVSG
MICDGASNSC AMKVSTSASA AWKAVLMALD DTAVTGNEGI VAHDVEQSIA NLCALASHSM
QQTDRQIIEI MASKAR