Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4230 |
Symbol | |
ID | 6270819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 3955642 |
End bp | 3956712 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641728050 |
Product | hypothetical protein |
Protein accession | YP_001882471 |
Protein GI | 187733845 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGGCAAC AACAGGACGA ACAGGGGCGG TTTTCAATTT GCAGCCGTCA GGCTGCCGTG GTTCATCCGA AGTCAGAGAA ATATGTTTTC ATCCATGGCC CGGAAAATCC TGATGAAACA TGGTATTACG ATTTCCATCA TCGACGCGGA GTCATTGCTG AAGGCGGCAA GGTGAGCAAT CTCGATGCAA TGGATATTAC CGCGCCGTAT ACGCCAGGAG CGCTGCGCGG CGGCAGCCAT GTGCATGTCT TTAGCCCGAA CGGTGAAAGG GTGAGCTTTA CCTATAACGA CCATGTAATG CATGAACTCG ATCCGGCGCT GGATTTGCGA AACGTCGGTG TTGCTGCGCC GTTTGGCCCG GTCAACGTAC AAAAGCAGCA TCCGCGTGAA TACAGCGGTA GCCACTGGTG CGTGCTGGTG AGTAAAACCA CGCCCACGCC GCAGCCTGGC AGCGATGAAA TCAATCGTGC TTATGAAGAA GGATGGGTAG GAAATCACGC GCTGGCGTTT ATTGGCGACA CACTTTCGCC AAAGGGCGAG AAAGTGCCGG AGCTGTTTAT CGTTGAGTTA CCGCAAGATG AAGCTGGCTG GAAAGCGGCA GGTGATGCGC CGTTAAGTGG AACGGAAACC ACCCTGCCCG CGCCACCGCG CGGCGTCGTG CAGCGACGTT TAACCTTTAC CCACCATCGG GCTTATCCGG GGTTAGTCAA CGTCCCGCGC CACTGGGTGC GCTGTAATCC GCAGGGTACG CAAATCGCGT TTTTAATGCG TGATGATAAC GGCATTGTGC AACTGTGGCT TATCTCGCCA CAGGGCGGCG AGCCGCGCCA GTTAACCCAT AACAAAACGG ATATTCAGTC TGCATTTAAC TGGCATCCGT CAGGAGAATG GTTGGGCTTT GTGCTGGATA ATCGAATTGC TTGTGCCCAT GCGCAAAGTG GCGAGGTTGA GTATTTAACC GAAAACCACG CCAATCCACC TTCTGCGGAC GCCGTGGTCT TCTCGCCGGA TGGTCAATGG CTGGCGTGGA TGGAAGGCGG CCAGCTGTGG ATCACCGAAA CTGATCGCTA A
|
Protein sequence | MRQQQDEQGR FSICSRQAAV VHPKSEKYVF IHGPENPDET WYYDFHHRRG VIAEGGKVSN LDAMDITAPY TPGALRGGSH VHVFSPNGER VSFTYNDHVM HELDPALDLR NVGVAAPFGP VNVQKQHPRE YSGSHWCVLV SKTTPTPQPG SDEINRAYEE GWVGNHALAF IGDTLSPKGE KVPELFIVEL PQDEAGWKAA GDAPLSGTET TLPAPPRGVV QRRLTFTHHR AYPGLVNVPR HWVRCNPQGT QIAFLMRDDN GIVQLWLISP QGGEPRQLTH NKTDIQSAFN WHPSGEWLGF VLDNRIACAH AQSGEVEYLT ENHANPPSAD AVVFSPDGQW LAWMEGGQLW ITETDR
|
| |