Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2551 |
Symbol | |
ID | 6271390 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 2352152 |
End bp | 2353237 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641726533 |
Product | hypothetical protein |
Protein accession | YP_001881013 |
Protein GI | 187731992 |
COG category | [C] Energy production and conversion |
COG ID | [COG2055] Malate/L-lactate dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAGTG GTCATCGCTT TGATGCTCAG ACGCTGCACA GTTTTATTCA GGCTGTATTT CGTCAGATGG GTAGCGAGGA ACAAGAAGCG AAATTAGTGG CCGATCATTT AATCGCGGCA AACCTAGCAG GGCATGATTC TCATGGTATT GGCATGATCC CAAGCTATGT GCGTTCCTGG AGTCAGGGGC ATCTGCAAAT TAACCATCAT GCCAAAATCG TTAAAGAGGC GGGGGCGGCA GTCACGCTCG ATGGCGATCG CGCATTCGGT CAGGTCGCGG CACACGAAGC GATGGCGCTG GGGATTGAGA AAGCGCATCA GCACGGTATT GCTGCCGTGG CGCTACATAA CTCGCATCAT ATCGGCCGTA TCGGTTACTG GGCGGAGCAG TGTGCAGCGG CGGGGTTTGT CTCTATCCAC TTTGTTAGCG TGGTCGGTAT TCCAATGGTC GCACCGTTCC ACGGTCGCGA CAGCCGCTTT GGCACCAATC CGTTCTGTGT GGTTTTCCCT CGTAAAGATA ATTTCCCGCT GTTGCTTGAT TACGCCACCA GCGCCATCGC ATTTGGTAAA ACTCGCGTCG CCTGGCATAA AGGCGTCCCC GTGCCGCCAG GTTGCCTGAT TGACGTTAAC GGCGTGCCGA CGACCAATCC GGCGGTAATG CAGGAGTCGC CGTTGGGGTC GCTGTTGACC TTTGCCGAAC ATAAAGGCTA CGCCCTTGCA GCGATGTGTG AAATTCTTGG CGGGGCGCTT TCTGGCGGTA AAACGACGCA TCAGGAAACG TTACAAACCA GTCCCGATGC CATTCTTAAC TGCATGACCA CTATCATCAT CAACCCGGAA CTGTTCGGCG CGCCGGATTG TAGCGCGCAG ACTGAAGCCT TTGCCGAGTG GGTGAAAGCC TCGCCGCATG ATGACGATAA GCCGATTTTG CTACCGGGCG AGTGGGAAGT GAACACGCGT CGCGAACGGC AGGAGCAGGG GATTCCATTG GATGCGGGAA GCTGGCAGGC CATTTGTGAT GCGGCGCGGC AGATTGGTAT GTCGGAAGAG ACGTTACAGG CTTTCTGTCA GCAGTTAGCC AGCTAA
|
Protein sequence | MESGHRFDAQ TLHSFIQAVF RQMGSEEQEA KLVADHLIAA NLAGHDSHGI GMIPSYVRSW SQGHLQINHH AKIVKEAGAA VTLDGDRAFG QVAAHEAMAL GIEKAHQHGI AAVALHNSHH IGRIGYWAEQ CAAAGFVSIH FVSVVGIPMV APFHGRDSRF GTNPFCVVFP RKDNFPLLLD YATSAIAFGK TRVAWHKGVP VPPGCLIDVN GVPTTNPAVM QESPLGSLLT FAEHKGYALA AMCEILGGAL SGGKTTHQET LQTSPDAILN CMTTIIINPE LFGAPDCSAQ TEAFAEWVKA SPHDDDKPIL LPGEWEVNTR RERQEQGIPL DAGSWQAICD AARQIGMSEE TLQAFCQQLA S
|
| |