Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4768 |
Symbol | |
ID | 6271909 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 4450010 |
End bp | 4451290 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641728522 |
Product | putatitve membrane protein |
Protein accession | YP_001882917 |
Protein GI | 187730651 |
COG category | [S] Function unknown |
COG ID | [COG2733] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 51 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAC TCATTGAACT CAGACGCGCC AAAATGTTGG CGCTCTCTTT ACTGCTTATC GCCGCTGCTA CCTTTGTCGT TACGCTGTTT TTGCCGCCCA ATTTTTGGGT GAGCGGCGTG AAGGCGATTG CTGAAGCGGC GATGGTCGGC GCGCTGGCGG ACTGGTTTGC GGTGGTGGCG CTGTTTCGCC GCGTGCCGAT TCCGATCATT TCTCGTCATA CGGCGATTAT CCCGCGTAAT AAAGACCGGA TTGGCGAAAA TCTCGGTCAG TTCGTGCAGG AAAAATTTCT CGATACCCAG TCGCTGGTGG CATTGATTCG ACGTCACGAA CCGGCGTTGT TGATTGGCAA CTGGTTTAGT CAGCCGGAGA ACGCCCGCCG CGTTGGTCAG CATCTGTTGC AGATCATGAG CGGTTTTCTT GAACTGACCG ATGATGCGCG TATTCAGCGC CTGCTTAAGC GCGCGGTCCA TCGGGCGATT GATAAGGTCG ATCTTTCCGG AACCAGTGCG TTGATGCTGG AGAGTATGAC CAAAAACGAT CGTCATCAGG TGCTGCTCGA TACGCTGATC GCACAGTTGA TCGCCCTTCT CCAGCGCGAT AAATCGCGCA AGTTTATTGC CCAGCAAATT GTTCGCTGGC TGGAGAGCGA GCATCCGCTG AAAGCCAAAA TTTTGCCTAC TGAATGGCTG GGCGAACATA GCGCGGAGTT GGTTTCTGAC GCGGTGAATT CTTTGCTTGA TGATATCAGC CGCGATCGTG CGCATCAGAT CCGTCATGCG TTTGATCGCG CCACTTTTGC CCTGATCGAC AAGCTGAAAA ACGATCCGGA AATGGCAGCG CGAGCCGATG CCGTAAAAAG TTATCTGAAA GAAGATGAAG CTTTTAACCG CTATCTCAGT GAATTGTGGG GGGATTTACG GGAATGGCTG AAAGCGGATA TCAACAGTGA AGATTCTCGT GTGAAAGAAC GTATCGCGCG GGCGGGTCAA TGGTTTGGCG AAACGTTAAT TGCCGATGAT GCCTTGCGGG CATCGTTAAA TGGTCACCTG GAACAAGCCG CGCACCGCGT CGCGCCTGAG TTTTCCGCAT TCCTGACGCG CCACATCAGC GATACAGTAA AAAGCTGGGA TGCTCGGGAT ATGTCGCGGC AAATCGAGTT AAATATCGGC AAAGATCTGC AGTTTATCCG TGTCAACGGT ACGCTGGTTG GCGGTTGTAT TGGGCTAATT TTATATTTGT TGTCGCAGCT CCCGGCCTTG TTCCCCCTCA GCAATTTTTA G
|
Protein sequence | MNKLIELRRA KMLALSLLLI AAATFVVTLF LPPNFWVSGV KAIAEAAMVG ALADWFAVVA LFRRVPIPII SRHTAIIPRN KDRIGENLGQ FVQEKFLDTQ SLVALIRRHE PALLIGNWFS QPENARRVGQ HLLQIMSGFL ELTDDARIQR LLKRAVHRAI DKVDLSGTSA LMLESMTKND RHQVLLDTLI AQLIALLQRD KSRKFIAQQI VRWLESEHPL KAKILPTEWL GEHSAELVSD AVNSLLDDIS RDRAHQIRHA FDRATFALID KLKNDPEMAA RADAVKSYLK EDEAFNRYLS ELWGDLREWL KADINSEDSR VKERIARAGQ WFGETLIADD ALRASLNGHL EQAAHRVAPE FSAFLTRHIS DTVKSWDARD MSRQIELNIG KDLQFIRVNG TLVGGCIGLI LYLLSQLPAL FPLSNF
|
| |