Gene SbBS512_E3996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E3996 
Symbol 
ID6272573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3726096 
End bp3727424 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content52% 
IMG OID641727842 
Producthypothetical protein 
Protein accessionYP_001882274 
Protein GI187731127 
COG category 
COG ID 
TIGRFAM ID[TIGR03368] cellulose synthase operon protein YhjU 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGGGG CCATTTTTGT TTTATTAGTG GCCTGGTTAT TCCTGTCACA ATGGATTCGC 
ATTACCGTTT TTGTGGTTGC CATACTGCTA TGGCTGAACG TACTTACCCT GGCGGGACCA
AGTTTCTCCT TGTGGCCAGC CGGACAACCG ACGACCACTG TAACAACGAC GGGTGGTAAC
GCAGCGGCAA CCGTTGCGAC GACGGGTGGC GCACCGGTAG TGGGTGATAT GCCCGCACAA
ACTACACCGC CAACAACGGC GAACCTTAAC GCCTGGCTGA ATAATTTCTA TAACGCGGAG
GCGAAACGTA AATCGACCTT CCCGTCTTCG CTGCCCGCTG ATGCTCAGCC ATTTGAACTA
CTGGTGATTA ACATCTGTTC GCTTTCCTGG TCGGATATAG AAGCCGCCGG GTTGATGTCG
CATCCACTGT GGTCGCATTT CGATATTGAG TTCAAGAACT TTAACTCCGC CACCTCCTAC
AGTGGCCCGG CGGCGATCCG TTTACTGCGC GCCAGCTGCG GGCAGACTTC GCACACTAAT
CTGTATCAAC CGGCAAATAA CGACTGCTAT CTGTTTGATA ACCTTTCGAA ACTGGGCTTT
ACCCAGCACC TGATGATGGG ACATAACGGC CAGTTCGGCG GTTTTTTGAA AGAAGTTCGC
GAAAATGGCG GCATGCAGAC TGAATTGATG GATCAAACAA ATCTGCCGGT TATTTTGCTG
GGCTTTGATG GTTCGCCGGT TTATGACGAT ACCGCCGTGC TTAACCGCTG GCTGGACGTT
ACCGAAAAAG ATAAAAACAG CCGTAGTGCC ACGTTCTACA ACACGCTTCC ACTGCATGAC
GGCAACCATT ATCCGGGGGT CAGCAAAACA GCGGATTACA AAGCGCGGGC GCAGAAATTC
TTTGATGAAC TGGACGCCTT CTTTACTGAA CTGGAGAAAT CGGGTCGTAA AGTGATGGTG
GTCGTGGTGC CGGAACACGG CGGCGCGCTG AAGGGCGACA GAATGCAGGT ATCTGGCCTA
CGTGATATCC CTAGCCCGTC TATCACAGAC GTCCCCGTTG GGGTGAAATT CTTCGGCATG
AAGGCACCAC ATCAGGGGGC ACCGATTGTC ATCGACCAAC CGAGCAGCTT CCTGGCTATC
TCCGATCTGG TGGTTCGCGT TCTTGATGGC AAGATTTTCA CCGAAGACAA TGTTGACTGG
AAAAAACTCA CCAGTGGGTT GCCACAAACA GCACCGGTCT CCGAGAACTC AAATGCAGTA
GTTATTCAAT ACCAGGATAA ACCGTACGTT CGCCTGAACG GCGGCGACTG GGTGCCTTAC
CCGCAGTAA
 
Protein sequence
MIGAIFVLLV AWLFLSQWIR ITVFVVAILL WLNVLTLAGP SFSLWPAGQP TTTVTTTGGN 
AAATVATTGG APVVGDMPAQ TTPPTTANLN AWLNNFYNAE AKRKSTFPSS LPADAQPFEL
LVINICSLSW SDIEAAGLMS HPLWSHFDIE FKNFNSATSY SGPAAIRLLR ASCGQTSHTN
LYQPANNDCY LFDNLSKLGF TQHLMMGHNG QFGGFLKEVR ENGGMQTELM DQTNLPVILL
GFDGSPVYDD TAVLNRWLDV TEKDKNSRSA TFYNTLPLHD GNHYPGVSKT ADYKARAQKF
FDELDAFFTE LEKSGRKVMV VVVPEHGGAL KGDRMQVSGL RDIPSPSITD VPVGVKFFGM
KAPHQGAPIV IDQPSSFLAI SDLVVRVLDG KIFTEDNVDW KKLTSGLPQT APVSENSNAV
VIQYQDKPYV RLNGGDWVPY PQ