Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1902 |
Symbol | |
ID | 6271919 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1743031 |
End bp | 1744320 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641725965 |
Product | hypothetical protein |
Protein accession | YP_001880459 |
Protein GI | 187731234 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGATA ACAAATTTGA TGCCATTGTG GTCGGTGCGG ACGTTGCTGG TAGCGTTGCC GCACTGGTCA TGGCGCGAGC CGGACTGGAT GTCCTGGTGA TAGAACGCGG CGACAGTGCC GGATGTAAAA ACATGACCGG CGGGCGTCTT TATGCCCACA CACTTGAAGC AATCATTCCA GGCTTTGCAG TATCAGCGCC GGTAGAACGC AAGGTCACAC GCGAGAAAAT CTCCTTCTTA ACCGAAGAAA GCGCCGTTAC CCTCGATTTT CACCGCGAGC AACCAGATGT TCCGCAACAC GCATCTTATA CCGTATTGCG TAATCGTCTG GACCCGTGGT TGATGGAACA AGCCGAGCAG GCGGGCGCAC AGTTTATCCC GGGCGTTCGC GTCGATGCGT TGGTTCGTGA AGGAAACAAG GTCACTGGCG TGCAGGCCGG GGATGATATT CTCCAAGCGA ATGTGGTGGT TCTAGCTGAT GGCGTAAACT CGATGCTTGG CCGCTCGCTG GGAATGGTTC CCGCTTCCGA TCCGCATCAT TACGCTGTTG GTGTTAAAGA GGTTATTGGC CTCACACCAG AACAGATCAA CGATCGCTTT AATATTACGG GCGAGGAAGG TGCCGCCTGG CTGTTTGCCG GTTCCCCTTC TGACGGCCTG ATGGGCGGGG GATTTCTCTA TACCAACAAG GATTCCATAT CCTTGGGGCT GGTTTGTGGA TTGGGTGATA TCGCCCATGC GCAAAAAAGC GTGCCGCAAA TGCTGGAAGA TTTTAAACAA CACCCCGCCA TTCGCCCACT AATTAGCGGC GGCAAACTGC TTGAATATTC CGCGCATATG GTGCCAGAAG GCGGTCTGGC AATGGTGCCG CAGATGGTTA ACGATGGCGT GATGATCGTT GGTGACGCCG CAGGCTTCTG CCTGAATTTG GGTTTTACAG TTCGTGGCAT GGATTTAGCC ATTGCATCGG CTCAGGCTGC CGCCACAACG GTGATCGCCG CCAAAGAACG CGAGGATTTC TCCGCCAGCA GTCTGGCGCA ATACAAACGT GAGCTGGAAC AAAGCTGCGT CATGCGCGAT ATGCAGCATT TCCGCAAGAT CCCGGCGCTG ATGGAAAACC CGCGCCTGTT TAGCCAATAC CCACGAATGG TCGCCGACAT CATGAACGAG ATGTTCACCA TTGACGGCAA ACCAAACCAG CCGGTACGCA AAATGATCAT GGGACACGCG AAGAAAATTG GGCTGATCAA CTTGCTGAAA GATGGCATTA AGGGAGCAAC CGCGCTATGA
|
Protein sequence | MSDNKFDAIV VGADVAGSVA ALVMARAGLD VLVIERGDSA GCKNMTGGRL YAHTLEAIIP GFAVSAPVER KVTREKISFL TEESAVTLDF HREQPDVPQH ASYTVLRNRL DPWLMEQAEQ AGAQFIPGVR VDALVREGNK VTGVQAGDDI LQANVVVLAD GVNSMLGRSL GMVPASDPHH YAVGVKEVIG LTPEQINDRF NITGEEGAAW LFAGSPSDGL MGGGFLYTNK DSISLGLVCG LGDIAHAQKS VPQMLEDFKQ HPAIRPLISG GKLLEYSAHM VPEGGLAMVP QMVNDGVMIV GDAAGFCLNL GFTVRGMDLA IASAQAAATT VIAAKEREDF SASSLAQYKR ELEQSCVMRD MQHFRKIPAL MENPRLFSQY PRMVADIMNE MFTIDGKPNQ PVRKMIMGHA KKIGLINLLK DGIKGATAL
|
| |