Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1387 |
Symbol | |
ID | 6271896 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1263325 |
End bp | 1264719 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641725493 |
Product | hypothetical protein |
Protein accession | YP_001880003 |
Protein GI | 187730867 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGCCGTT TCGTTCCTCG CATTATCCCG TTTTATTTAC TCTTGCTTGT AGCAGGCGGT ACAGCTAACG CACAATCTAC CTTCGAGCAA AAAGCGGCAA ATCCCTTTGA TAATAACAAT GATGGTCTGC CGGATTTAGG CATGGCTCCC GAAAATCATG ATGGGGAAAA ACACTTTGCT GAAATTGTGA AAGATTTCGG CGAAACCAGT ATGAATGATA ACGGGCTGGA TACTGGCGAG CAGGCAAAAG CTTTCGCATT GGGAAAAGTC CGCGACGCGC TTAGTCAACA GGTTAATCAG CACGTAGAGT CCTGGCTATC ACCGTGGGGA AATGCCAGTG TTGACGTCAA AGTGGATAAC GAAGGACATT TCACCGGCAG TCGTGGAAGC TGGTTTGTGC CGTTACAAGA TAATGATCGT TATCTCACCT GGAGCCAGCT TGGTCTTACT CTGCAGGATA ATGGGTTGGT GAGCAATGTG GGCGTTGGGC AACGCTGGGC GCGCGGCAAC TGGCTGGTGG GTTATAACAC TTTTTATGAC AACTTGCTGG ACGAAAATCT TCAGCGAGCG GGCTTTGGTG CCGAAGCGTG GGGCGAATAT TTGCGATTAT TGGCAAACTT TTATCAGCCG TTTGCTGCAT GGCATGAACA GACAGCCACG CAGGAACAAC GGATGGCGCG CGGGTACGAC CTGACAGCTC GGATGCGCAT GCCGTTCTAT CAACACCTCA ATACCAGTGT CAGCCTAGAA CAGTATTTTG GTGATCGTGT TGATTTGTTT AACTCTGGTA CGGGTTATCA CAATCCCGTC GCGTTGAGTC TGGGATTAAA TTACACCCCT GTGCCATTAG TCACTGTGAC GGCCCAGCAT AAACAGGGTG AAAGTGGCGA GAATCAAAAT AACCTCGGGC TGAATCTTAA TTACCGCTTT GGTGTACCGC TCAAAAAACA ACTTTCTGCG GGCGAGGTTG CCGAAAGTCA GTCGTTACGT GGTAGTCGCT ATGACAATCC GCAGCGAAAT AATCTACCGA CTCTTGAGTA CCGACAGCGA AAAACGTTAA CGGTGTTTCT GGCGACACCG CCGTGGGATC TAAAACCTGG CGAAACAGTG CCGCTGAAAT TACAAATCCG CAGTCGTTAC GGTATTCGGC AACTGATTTG GCAGGGCGAT ACGCAGATAT TAAGTTTGAC GCCGGGCGCA CAAGCCAACA GCGCGGAGGG CTGGACGCTG ATCATGCCTG ACTGGCAGAA CGGGGAAGGG GCAAGCAATC ACTGGCGATT GTCGGTGGTG GTGGAAGATA ACCAGGGGCA GCGTGTCTCC TCCAATGAGA TCACGCTAAC GCTTGTCGAA CCGTTCGATG CATTGTCAAA CGACGAACTG CGCTGGGAAC CGTAA
|
Protein sequence | MSRFVPRIIP FYLLLLVAGG TANAQSTFEQ KAANPFDNNN DGLPDLGMAP ENHDGEKHFA EIVKDFGETS MNDNGLDTGE QAKAFALGKV RDALSQQVNQ HVESWLSPWG NASVDVKVDN EGHFTGSRGS WFVPLQDNDR YLTWSQLGLT LQDNGLVSNV GVGQRWARGN WLVGYNTFYD NLLDENLQRA GFGAEAWGEY LRLLANFYQP FAAWHEQTAT QEQRMARGYD LTARMRMPFY QHLNTSVSLE QYFGDRVDLF NSGTGYHNPV ALSLGLNYTP VPLVTVTAQH KQGESGENQN NLGLNLNYRF GVPLKKQLSA GEVAESQSLR GSRYDNPQRN NLPTLEYRQR KTLTVFLATP PWDLKPGETV PLKLQIRSRY GIRQLIWQGD TQILSLTPGA QANSAEGWTL IMPDWQNGEG ASNHWRLSVV VEDNQGQRVS SNEITLTLVE PFDALSNDEL RWEP
|
| |