Gene SbBS512_E1554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1554 
Symbol 
ID6270900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1416633 
End bp1418030 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content55% 
IMG OID641725648 
Producthypothetical protein 
Protein accessionYP_001880154 
Protein GI187732603 
COG category[R] General function prediction only 
COG ID[COG3106] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGAC TTAAAAATGA ACTTAATGCG CTGGTGAATC GGGGTGTCGA CAGACATCTG 
CGCCTCGCCG TAACCGGACT AAGCCGCAGC GGCAAAACAG CGTTTATCAC TGCGATGGTC
AATCAGTTGC TCAATATTCA CGCCGGAGCA CGCTTGCCGC TGTTAAGCGC GGTGCGTGAA
GAGCGCCTGC TGGGCGTAAA ACGCATTCCT CAGCGTGACT TTGGCATTCC GCGCTTCACA
TATGATGAAG GACTGGCGCA GTTATACGGC GATCCTCCCG CCTGGCCGAC GCCAACGCGC
GGCGTCAGTG AAATCCGCCT GGCGCTACGT TTTAAATCGA ACGATTCGCT GCTACGCCAC
TTCAAGGACA CCTCCACGCT GTATCTGGAG ATTGTGGATT ATCCCGGCGA ATGGTTGCTC
GACCTGCCGA TGCTGGCGCA GGACTATTTA AGCTGGTCAC GCCAGATGAC GGGCTTACTC
AATGGTCAGC GCGGCGAATG GTCGGTCAAA TGGCGAATGA TGTGCGAAGG GCTGGACCCG
CTAGCACCTG CCGACGAAAA CCGGCTGGCA GACATTGCCG CCGCGTGGAC CGATTATCTC
CACCACTGTA AACAGCAGGG GCTGCACTTT ATTCAGCCAG GGCGCTTTGT CTTGCCAGGA
GATATGGCAG GTGCGCCCGC GCTGCAATTC TTCCCGTGGC CGGATGTCGA TACCTGGGGC
GAGTCCAAAC TGGCGCAGGC CGATAAGCAC ACCAATGCCG GAATGCTGCG CGAGCGGTTT
AATTATTACT GCGAGAAGGT GGTGAAGGGG TTCTATAAGA ATCATTTTCT GCGCTTTGAC
CGCCAGATTG TGCTGGTTGA TTGCCTGCAA CCTCTCAACA GTGGGCCACA GGCATTTAAT
GATATGCGTC TGGCACTGAC GCAGCTGATG CAAAGTTTCC ACTACGGGCA GCGTACCTTG
TTCCGGCGTT TGTTTTCGCC GGTTATCGAT AAGCTATTGT TTGCTGCCAC TAAAGCGGAC
CATGTGACCA TCGATCAGCA CGCCAATATG GTTTCATTAC TGCAACAACT GATTCAGGAT
GCCTGGCAAA ATGCGGCGTT TGAAGGGATT AGTATGGATT GTCTGGGGCT GGCCTCGGTG
CAGGCGACCA CCAGCGGCAT TATTGACGTC AACGGCGAGA AAATTCCGGC ATTGCGCGGT
AACCGGCTCA GCGATGGTGC GCCACTCACC GTTTATCCCG GTGAAGTTCC TGCGCGTTTG
CCCGGTCAGG CGTTCTGGGA CAAACAAGGG TTCCAGTTTG AAGCGTTTCG CCCGCAGGTG
ATGGATGTCG ACAAACCGCT GCCGCATATT CGTCTTGATG CCGCGCTGGA ATTTTTAATA
GGAGATAAAT TGCGATGA
 
Protein sequence
MKRLKNELNA LVNRGVDRHL RLAVTGLSRS GKTAFITAMV NQLLNIHAGA RLPLLSAVRE 
ERLLGVKRIP QRDFGIPRFT YDEGLAQLYG DPPAWPTPTR GVSEIRLALR FKSNDSLLRH
FKDTSTLYLE IVDYPGEWLL DLPMLAQDYL SWSRQMTGLL NGQRGEWSVK WRMMCEGLDP
LAPADENRLA DIAAAWTDYL HHCKQQGLHF IQPGRFVLPG DMAGAPALQF FPWPDVDTWG
ESKLAQADKH TNAGMLRERF NYYCEKVVKG FYKNHFLRFD RQIVLVDCLQ PLNSGPQAFN
DMRLALTQLM QSFHYGQRTL FRRLFSPVID KLLFAATKAD HVTIDQHANM VSLLQQLIQD
AWQNAAFEGI SMDCLGLASV QATTSGIIDV NGEKIPALRG NRLSDGAPLT VYPGEVPARL
PGQAFWDKQG FQFEAFRPQV MDVDKPLPHI RLDAALEFLI GDKLR