Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2348 |
Symbol | |
ID | 6269614 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 2136361 |
End bp | 2137551 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641726352 |
Product | hypothetical protein |
Protein accession | YP_001880834 |
Protein GI | 187732748 |
COG category | [R] General function prediction only |
COG ID | [COG1092] Predicted SAM-dependent methyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.287324 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGTAC GTTTAGTGTT AGCCAAAGGG CGCGAAAAAT CATTACTTCG TCGCCATCCG TGGGTCTTTT CCGGGACCGT TGCCCGCATG GAAGGTAAAG CCAGCCTCGG TGAAACCATC GATATTGTTG ATCATCAGGG AAAATGGTTA GCACGCGGCG CTTATTCGCC AGCTTCGCAA ATCCGGGCGC GCGTCTGGAC GTTTGACCCG TCTGAGTCTA TCGACATTGC TTTTTTTTCC CGCCGTTTGC AACAAGCACA AAAATGGCGT GACTGGCTGG CGCAAAAAGA TGGCCTCGAC AGCTATCGTT TAATCGCCGG AGAATCTGAT GGCCTGCCGG GTATTACTAT CGATCGTTTC GGTAATTTTC TGGTGCTGCA ACTGCTGAGT GCTGGGGCAG AATATCAGCG CGCGGCATTA GTTAGTGCCC TGCAAACGCT GTACCCGGAA TGTGCGATTT ACGATCGCAG CGATGTTGCG GTACGTAAAA AAGAAGGGAT GGAGCTGACC CTGGGCCTCG TCACCGGCGA GTTGCCGCCT GCCCTGCTGC CGATTGAAGA ACACGGCATG AAGCTGCTGG TGGATATTCA GCACGGGCAC AAAACGGGCT ACTACCTGGA CCAGCGAGAC AGCCGCCTGG CTACCCGCCG CTACGTTGAA AATAAACGTG TGCTGAACTG TTTCTCCTAT ACCGGTGGTT TCGCCGTATC GGCACTGATG GGCGGTTGCA GCCAGGTTGT CAGCGTTGAT ACCTCCCAGG AAGCGCTGGA TATTGCACGG CAGAACGTTG AGCTGAACAA ACTGGATCTG AGCAAGGCTG AGTTTGTCCG TGATGATGTC TTTAAATTGC TGCGTACTTA TCGCGATCGC GGTGAAAAAT TTGACGTTAT CGTGATGGAC CCGCCGAAGT TTGTTGAGAA TAAAAGCCAG TTGATGGGCG CGTGTCGTGG CTATAAAGAT ATCAACATGC TGGCGATTCA GTTGCTGAAT GAAGGCGGTA TTCTCCTGAC TTTCTCCTGT TCCGGTCTGA TGACCAGCGA TTTATTTCAG AAAATCATCG CGGATGCCGC AATTGATGCC GGCCGTGATG TACAATTTAT AGAGCAGTTC CGTCAGGCAG CCGATCATCC GGTGATCGCT ACCTATCCGG AAGGGCTATA TCTGAAAGGG TTTGCCTGTC GCGTCATGTA A
|
Protein sequence | MSVRLVLAKG REKSLLRRHP WVFSGTVARM EGKASLGETI DIVDHQGKWL ARGAYSPASQ IRARVWTFDP SESIDIAFFS RRLQQAQKWR DWLAQKDGLD SYRLIAGESD GLPGITIDRF GNFLVLQLLS AGAEYQRAAL VSALQTLYPE CAIYDRSDVA VRKKEGMELT LGLVTGELPP ALLPIEEHGM KLLVDIQHGH KTGYYLDQRD SRLATRRYVE NKRVLNCFSY TGGFAVSALM GGCSQVVSVD TSQEALDIAR QNVELNKLDL SKAEFVRDDV FKLLRTYRDR GEKFDVIVMD PPKFVENKSQ LMGACRGYKD INMLAIQLLN EGGILLTFSC SGLMTSDLFQ KIIADAAIDA GRDVQFIEQF RQAADHPVIA TYPEGLYLKG FACRVM
|
| |