Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3107 |
Symbol | |
ID | 6272374 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 2903081 |
End bp | 2904349 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641727034 |
Product | pyridine nucleotide-disulphide oxidoreductase family protein |
Protein accession | YP_001881493 |
Protein GI | 187732774 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 0.916046 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGACG ACTGCGACAT TATTATTATT GGTGCCGGTA TTGCAGGCAC CGCTTGCGCG TTACGCTGCG CGCGAGCGGG TTTATCCGTT TTGTTACTGG AACGCGCTGA AATCCCCGGC AGCAAAAATC TTTCCGGCGG ACGGTTATAT ACCCATGCAC TCGCGGAACT CCTCCCTCAG TTTCATCTGA CCGCGCCTCT TGAACGACGC ATCACTCACG AAAGCCTTTC CCTGTTAACG CCCGATGGCG TAACGACGTT TTCCAGCTTA CAGCCCGGCG GTGAATCCTG GAGTGTATTA CGTGCACGGT TCGATCCGTG GCTGGTTGCC GAAGCCGAAA AAGAAGGTGT CGAATGTATC CCCGGAGCGA CGGTGGATGC ACTGTATGAA GAAAACGGCA GAGTCTGTGG CGTTATTTGT GGTGACGATA TTCTCCGCGC CCGTTATGTG GTGCTGGCAG AAGGTGCCAA CAGCGTCCTG GCTGAACGTC ACGGGTTAGT GACTCGTCCT GCTGGCGAAG CGATGGCGTT GGGGATCAAA GAAGTGCTGT CGCTGGAAAC ATCCGCTATT GAAGAACGTT TTCATCTGGA GAATAACGAA GGCGCAGCGT TGCTGTTCAG CGGCGGAATC TGTGATGACT TACCCGGCGG CGCATTTCTT TATACTAATC AACAAACGCC CTCGTTAGGG ATTGTTTGCC CGCTCTCTTC CCTTACGCAA AGTCGTGTTC CGGCAAGCGA GCTGCTGACT CGCTTTAAAG CGCATCCGGC AGTGCGCCCG CTTATCAAAA ACACGGAATC ACTGGAGTAT GGTGCGCATC TGGTGCCAGA AGGTGGCTTG CACAGTATGC CGGTGCAATA CGCCGGTAAC GGCTGGCAGC TGGTGGGCGA TGCGTTGCGC AGTTGCGTCA ATACCGGAAT TTCCGTGCGC GGCATGGATA TGGCGCTGAC TGGCGCGCAG GCGGCACAAA CGCTGATAAG CGCCTGCCAG CACCGCGAGC CGCAAAATCT GTTTGCGCTT TATCATCACA ACGTAGAGCG CAGCCTGCTA TGGGATGTTC TACAACGTTA TCAGCATGTT CCGGCGCTTT TGCAACGCCC TGGCTGGTAT CGGGCGTGGC CTGCGTTAAT GCAGGATATT TCCCGCGATT TATGGGATCA GGGTGATAAA CCTGTTCCAC CGCTGCGCCA GTTATTCTGG CGTCATTTAC GTCGTCATGG CCTGTGGCAT CTGGCGGGCG ATGTTATCAG GAGTCTGCGA TGTCTGTAG
|
Protein sequence | MEDDCDIIII GAGIAGTACA LRCARAGLSV LLLERAEIPG SKNLSGGRLY THALAELLPQ FHLTAPLERR ITHESLSLLT PDGVTTFSSL QPGGESWSVL RARFDPWLVA EAEKEGVECI PGATVDALYE ENGRVCGVIC GDDILRARYV VLAEGANSVL AERHGLVTRP AGEAMALGIK EVLSLETSAI EERFHLENNE GAALLFSGGI CDDLPGGAFL YTNQQTPSLG IVCPLSSLTQ SRVPASELLT RFKAHPAVRP LIKNTESLEY GAHLVPEGGL HSMPVQYAGN GWQLVGDALR SCVNTGISVR GMDMALTGAQ AAQTLISACQ HREPQNLFAL YHHNVERSLL WDVLQRYQHV PALLQRPGWY RAWPALMQDI SRDLWDQGDK PVPPLRQLFW RHLRRHGLWH LAGDVIRSLR CL
|
| |