Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4198 |
Symbol | |
ID | 6273109 |
Type | |
Is gene spliced | No |
Is pseudo gene | Yes |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 3920144 |
End bp | 3925112 |
Gene Length | 4969 bp |
Protein Length | |
Translation table | |
GC content | 52% |
IMG OID | |
Product | |
Protein accession | |
Protein GI | |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACCA GAATAGTGGT TGGCCTCACG GCAGGGACGT GTCTGATTTT CTCGCAAAAC CTGATGGCCG AGGTCAGTGT ATTCAATCCG GCGCTTCTGG AAATCGACCA TCAATCCGGA GTCGATATTC GCCAGTTTAA TCGGGCAAAC CTGATGCCCC CAGGTGTTTA TAGCGTTGAT ATTTTTATCA ACGGTAAAAT GTTTGAACGT CAGGATGTGA CATTTGTTCA GAATAATCCA GATGCTGATC TGCACGCTTG CTTTATTGCC ATTAAAAAAA CACTGTCCTC CTTTGGCATA AAAGTTGATG CGCTCAAATC GTTCAATGAT GTGGATGAGA CGGTTTGCCT CGATCCTGCT CCACGTATTG AAGGCTCATC CTGGCAGTTT GACAGTGATA AATTGCAGCT GAATATATCC ATTCCCCAAA TCTACATGGA CGCGATGGCT TATGATTACA TCAGCCCCAC GCGTTGGGAT GAGGGGATTA ACGCGCTCAC CATCAACTAC GATTTTTCTG GTTCACATAC ACTACGTTCA GATTATGGTT CACAAGAGAC AGATACCAGT TATCTCAATC TGCGCAATGG ACTGAATATT GGACGTAAGC GTACAGCCTG AACCGTCTGG TCAGAATCTG ACGAATTAGA CAAAGTGGTG TCCACCAAAT AAGTAGTGGG AACCAAAGTG TCAGATATGC AGAAAAATGT GAATCCCGGC AGGCGAAAAG GCTGCCCTAA TTATCCTCCC GAATTTAAAC AGCAGCTCGT TGCTGCCTCC TGTGAACCCG GGATATCCAT CTCAAAACTT GCTCTTGAAA ATGGCATTAA TGCCAATCTG TTGTTCAAAT GGCGACAACA ATGGCACGAG GGAAAGCTGC TATTACCTTC TTCAGAGAGC CCCCAGCTAC TTCCTGTGAC TCTCGATGCA GCTGCCGAAC AGCCAGAATC GCTCGCAGAG GATCCGGAAA CCCTCAGTAT CAGCTGTGAG GTAACGTTCC GGCACGGGAC GCTCCGCTTC AATGGCAATG TCAGCGAAAA GCTCCTGACT CTGCTGATAC AGGAACTGAA GCGATGATCC CGTTACCTTC CGGGACCAAA ATTTGGCTGG TTGCCGGTAT CACCGATATG AGAAATGGCT TCAACGGCCT GGCGGCAAAG GTGCAGACGA CGCTGAAAGA CGATCCGATG TCAGGTCACG TTTTTATCTT CCGTGGGCGT AATGGCAGTC AGGTAAAGCT CCTCTGGTCT ACCGGCGATG GACTGTGTCT GCTGACCAAA CGGCTGGAGC GCGGCCGCTT CGCCTGGCCG TCAGCCCGGG ATGGCAAAGT GTTCCTCACA CCGGCACAGC TGGCGATGCT CCTTGAAGGT ATCGACTGGC GGCAGCTTAA AAGACTGCTT ACGTCCCTGA CTATGTTGTA AGCCTCTTTA TCCTGGTCGA CGCTGAATGA GCCTGGTAAT ATACCCGGTA TGAGCGGCTC ACTTCCTGAC GATATCAATG CACTGAAACG TCTCCTTGCC GAACAGGAGG CGCTGAACCG TGCCCTGCTG GAAAAGCTGA ACGAGCGTGA ACGCGAAATA GACCATCTGC AGGCACAGCT GGATAAGCTG CGCCGGATGA ACTTCGGCAG CCGCTCCGAA AAAGTCTCCC GTCGTATCGC ACAGATGGAA GCTGACCTGA AGGCACTTCA GAAAGAAAGT GATACCCTTA CCGGTCGGGT TGACGACCCG GCCGTGCAGC GCCCGCTGCG TCAAACCCGC ACCCGCAAAC CGTTCCCCGA ATCACTCCCC CGCGATGAAA AACGGCTGCT GCCGGCAGCG TCATGCTGCC CGGAATGTGG AGGCTCACTG AGCTATCTGG GTGAGGATGC CGCCGAACAG CTGGAGTTGA TGCGCAGCGT CTTCCGGGTT ATCCGGACTG TACGTGAAAA GCATGCCTGT ACTCAGTGCG ATGCCATCGT GCAGGCCCCC GCGCCTTCAC GGCCCATCGA GCGGGGTATC GCAGGACCGG GGCTGCTGGC CCGCGTGCTG ATCTCAAAGT ATGCAGAGCA CACCCCGCTG TACCGCCAGT CTGAAATGTA CGGCCGCCAG GGCGTGGAGC TGAGTCGTTC ACTGCTGTCG GGCTGGGTGG ATGCATGCTG CCGGCTACTG TCACCGCTGG AAGAAGCGCT TCAGGACTAT GTGCTGACTG ACGGTAAGCT CCATGCTGAT GACACGCCTG TCCCGGTGCT GTTGCCAGGC AATAAGAAAA CGAAGACCGG GCGGTTATGG ACCTACGTTC GTGACGACCG TAACGCCGGG TCAACGCTGG CGCCGGCGGT GTGGTTCGCT TACAGCCCGG ACAGAAAAGG CATCCATCCG CAGACCCATC TTGCGGGGTT CAGTGGTGTA CTGCAGGCGG ATGCATACGC CGGGTTCAAC GAGCTGTACC GGGATGGCCG GATAACGGAA GCCGCCTGTT GGGCTCACGC CCGCCGTAAA ATCCACGATG TGCACGTTCG CACCCCGTCA GCCCTGACGG AGGAAGCGCT GAAACGGATC GGCGAACTGT ACGCCATCGA GGCAGAGATA AGGGGAATGA CGGCGGAGCA GCGCCTTGCC GAACGTCAGT TGAAAACGAA ACCGCTGCTG AAATCCCTGG AAAGCTGGCT GCGTGAAAAG ATGAAAACCC TGTCGCGACA CTCAGAACTG GCGAAAGCGT TCGCATACGC CCTGAACCAG TGGCCGGCGC TGACGTACTA TGCAGATGAT GGCTGGGCTG AGGCGGACAA TAACATCGCT GAAAATGCGT TGCGGATGGT CAGTCTGGGC CGCAAAAACT ACCTGTTCTT CGGATCGGAT CATGGAGGAG AGCGGGGAGC GCTGCTGTAC AGCCTGATCG GGACGTGCAA ACTGAACGGA GTGGAGCCAG AAAGCTACCT CCGCTATGTC CTTGACGTCA TAGCCGACTG GCCGATAAAC CGGGTCGGCG AACTGCTCCC CTGGCGCGTA GCACTGCCGA CTGAATAACA CATCCCCGTC AATACGGTTC TTGCTGCACG CTTACTATTG GACCGTGGCG GCTACGTAAT TACAGTACTT TAAACACCAG CGATGGCCGT GCGGAATACA ACTCCATTAG TACCTGGATA CAGCGCGATA TTGCCGCGTT AAGAAGCCAG ATTATGATTG GTGATACGTG GACGGCGAGC GATATTTTCG ACAGTACGCA AATTCGCGGC GCGCGTTTGT ATACTGATAA CGATATGCTA CCCGCCAGCC AGAATGGCTT TGCTCCTGTG GTTCGTGGGA TTGCAAAGTC CAACGCCACC GTCATCATTC GGCAGAATGG CTACGTGATT TATCAGTCTG CCGTTCCAGA AGGTGCTTTT GAGATCACCG ATCTCAACAC CGCCAGTACA GGTGGCGATT TGGACGTAAC CATCAAAGAA GAAGACGGTA GCGAACAACG ATTCACCCAA CCTTATGCTT CATTGGCGAT TCTTAAACGT GAAGGTCAGA CAGATGTTGA TGTCAGCGTG GGTGAATTGC GCGATGAAGA CGGATTTACA CCGGACGTCC TTCAGGCGCA AATACTTCAT GGTTTTTCCC ACGGGATCAC TTTATATGGA GGTATGCAGG CTGCTGAAAA TTATGGTTCT GCAGCTCTGG GTGTCGGTAA AGATCTTGGC GCTTTGGGCG CAATTTCTTT CGATGTGACA CATGCTCGTG CGAATTTTAG CCATGATGAT ACAGAAACGG GTCAGTCATA TCGCTTTCTC TATTCAAAAC GATTTGACGA CACAGACACT AGCTTGCGCC TGGTTGGCTA TCGTTACTCC ACCGAGGGCT ACTATACCCT CAATGAGTGG GCATCGCGGC GCAACAGCCC TGAAGACTTT TGGGAAACAG GTAACCGACG TAGTCGCGTG GAGGGAACGC TAACGCAGTC GTTGGGGAGA GATTATGGCA ATTTATACCT GACATTAAGC CGGCAACAAT ACTGGCATAC CGATGATGTC GAACGATTAA TGCAATTTGG CTACAGCAGT AGCTGGAAGC GTCTCTCGTG GAACGTCTCC TGGAGTTATT CCAATACTGC CAGACAGGGG ACGGGGAACA ACCATGCCAG TGATAACACC AGTGAGCAGA TCTACATGCT CTCTTTATCT GTTCCTTTAT CGGGCTGGTG GGGTAATAGT TACGCCACCT ATTCTGTTTC GCAAAACGAT AATTCCGGTA GCTCACATCA ACTCGGACTC AGCGGTACGG CGCTGGAAAG AAATAACCTT TCATGGAATT TAATGCAGTC CTATAACAGT CATGATGATG AGGTTGGCGG TAATATGTCC CTGACCTATG ATGGCTCTTA TGGCACGGTG AACGGCAGCT ATAACTACAG CCAAAATTCC CAGAGGCTGA ATTATGGTAT CAGAGGGGGA ATTCTGGCAC ACAGCGAAGG GGTAACGTTA AGTCAGGAGT TAGGTGAAAC TATTGCTCTT GTTAAAGCAC CTGGGGCCGC CGGGTTAGAA ATAGATAATA TGCGCGGTGC TGCGACGGAC TGGCGCGGCT ATACGGTCAA GACACAGCTA AACCCTTATG ATGAAAATCG GGTAGCAATC AGCGATAACT ATTTCTCGAA GTCGAATATA GAACTTGATA ATACCGTCGT TACGATGGTT CCCACGCGTG GTGCAGTGGT TAAAGCGGAG TTTGTGACTC ATGTGGGTTA TCGCGTTCTC TTCAGAGTGT TAAATGCAAA TGGTAAACCG GTACCTTTTG GAGCCATTGC TGCGATACAA GATGCAAGTT TGGCAGATTC AGGAATTGTC GGTGACCGTG GCGAACTTTA TCTTTCTGGT CTACCAGAAA AAGGACAGGT TACGTTATCC TGGGGAGAAA ACGCCTCAAC AAAATGCATC TTCAATTATT CACTTTCGAC ACCAGAAAGT GAGAGCGGAT TAATTGAACA GGGTGTGACA TGTCATTAA
|
Protein sequence | |
| |