Gene SbBS512_E1800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1800 
Symbol 
ID6268947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1645947 
End bp1647455 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content49% 
IMG OID641725870 
Producthypothetical protein 
Protein accessionYP_001880368 
Protein GI187732713 
COG category[S] Function unknown 
COG ID[COG5339] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAT CGCTGGTAGC GGTAGGCGTC ATTGTTGCGC TAGGCGTAGT CTGGACAGGC 
GGCGCATGGT ATACAGGCAA GAAGATTGAA ACCCATCTCG AAGACATGGT CGCGCAGGCG
AACGCGCAAC TCAAACTGAC AGCTCCTGAA TCCAACCTGG AAGTGAGTTA TCAAAACTAT
CATCGCGGCG TATTCAGCAG CCAGTTGCAA CTGTTGGTGA AACCCATTGC CGGGAAAGAA
AATCCGTGGA TTAAAAGCGG TCAGAGCGTC ATCTTCAACG AATCGGTTGA TCATGGTCCC
TTCCCGCTTG CCCAGCTTAA AAAACTGAAC CTGATCCCGT CGATGGCATC AATTCAAACC
ACGCTGGTTA ATAACGAAGT AAGCAAACCA CTGTTTGATA TGGCAAAAGG TGAAACGCCT
TTTGAGATTA ACTCGCGCAT TGGTTACAGC GGTGATTCCA GTTCCGATAT TTCGCTCAAG
CCACTGAATT ACGAGCAAAA GGATGAAAAA GTCGCCTTTA GCGGCGGCGA GTTCCAGTTA
AATGCTGACA GAGACGGCAA AGCCATCTCC CTTTCCGGGG AGGCGCAAAG TGGTCGGATA
GACGCAGTTA ACGAATACAA CCAGAAAGTG CAGTTGACCT TTAATAATCT GAAAACCGAC
GGTTCCAGCA CGCTGGCAAG TTTTGGTGAG CGCGTAGGAA ACCAAAAACT GTCACTGGAA
AAAATGACCA TTTCAGTGGA AGGCAAAGAA CTGGCACTGC TGGAAGGCAT GGAGATCAGC
GGTAAATCGG ATCTGGTCAA TGACGGTAAA ACGATCAATA GCCAACTGGA TTACTCGCTA
AACAGCCTGA AGGTACAGAA TCAGGATCTG GGCAGCGGCA AGCTGACTTT AAAAGTCGGC
CAGATTGATG GTGAAGCCTG GCATCAGTTC AGCCAGCAAT ATAACGCGCA AACTCAGGCG
CTGCTGGCAC AGCCAGAAAT TGCCAACAAT CCCGAACTTT ATCAGGAGAA AGTGACGGAA
GCCTTCTTTA GCGCCCTGCC GCTGATGTTG AAAGGCGATC CGGTGATTAC TGTCGCGCCG
CTAAGCTGGA AAAACAGTCA GGGTGAAAGT GCGCTGAATC TGTCGCTGTT CCTGAAAGAT
CCGGCAACGA CTAAAGAAGC GCCGCAAACG CTGGCGCAGG AAGTAGATCG TTCGGTTAAA
TCTCTGGATG CGAAACTGAC CATTCCGGTG GATATGGCAA CTGAGTTTAT GACTCAGGTA
GCGAAGCTGG AAGGTTATCA GGAAGATCAA GCGAAAAAAC TGGCGAAACA GCAAGTTGAA
GGTGCATCAG CAATGGGGCA GATGTTCCGT CTGACCACCT TGCAGGACAA TACCATCACC
ACCAGCCTGC AATATACTAA CGGTCAGATA ACGTTAAATG GGCAGAAAAT GTCGTTAGAA
GATTTTGTTG GCATGTTTGC GATGCCAGCT CTTAACGTTC CGGCTGTACC GGCAATTCCG
CAGCAGTAA
 
Protein sequence
MNKSLVAVGV IVALGVVWTG GAWYTGKKIE THLEDMVAQA NAQLKLTAPE SNLEVSYQNY 
HRGVFSSQLQ LLVKPIAGKE NPWIKSGQSV IFNESVDHGP FPLAQLKKLN LIPSMASIQT
TLVNNEVSKP LFDMAKGETP FEINSRIGYS GDSSSDISLK PLNYEQKDEK VAFSGGEFQL
NADRDGKAIS LSGEAQSGRI DAVNEYNQKV QLTFNNLKTD GSSTLASFGE RVGNQKLSLE
KMTISVEGKE LALLEGMEIS GKSDLVNDGK TINSQLDYSL NSLKVQNQDL GSGKLTLKVG
QIDGEAWHQF SQQYNAQTQA LLAQPEIANN PELYQEKVTE AFFSALPLML KGDPVITVAP
LSWKNSQGES ALNLSLFLKD PATTKEAPQT LAQEVDRSVK SLDAKLTIPV DMATEFMTQV
AKLEGYQEDQ AKKLAKQQVE GASAMGQMFR LTTLQDNTIT TSLQYTNGQI TLNGQKMSLE
DFVGMFAMPA LNVPAVPAIP QQ