Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0713 |
Symbol | |
ID | 6270253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 669100 |
End bp | 670431 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641724902 |
Product | invasion plasmid antigen |
Protein accession | YP_001879431 |
Protein GI | 187734068 |
COG category | [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.29021 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCAGTCAC TCTCAGCCCT TCTCAATAGC CTGGAGACGC TACCTGATCT TCCCCCGGCT CTACAAAAAC TTTCTGTTGG CAACAACCAG CTTACTGCCT TACCAGAATT ACCATGTGAA CTACAGGAAC TAAGTGCTTT TGATAACAGA TTACAAGAGC TACCGCCCCT TCCTCAAAAT CTGAGGCTTT TAAACGTTGG GGAAAACCAA CTACACAGAC TGCCCGAACT TCCACAACGT CTGCAATCAC TATATATCCC TAACAATCAG CTGAACACAT TGCCAGACAG TATCATGAAT CTGCACATTT ATGCAGATGT TAATATTTAT AACAATCCAT TGTCGACTCG CACTCTGCAA GCCCTGCAAA GATTAACCTC TTCGCCGGAC TACCACGGCC CACGGATTTA CTTCTCCATG AGTGACGGAC AACAGAATAC ACTCCATCGC CCCCTGGCTG ATGCCGTGAC AGCATGGTTC CCGGAAAACA AACAATCTGA TGTATCACAG ATATGGCATG CTTTTGAACA TGAAGAGCAC GCCAACACCT TTTCCGCGTT CCTTGACCGC CTTTCCGATA CCGTCTCTGC ACGCAATACC TCCGGATTCC GTGAACAGGT CGCTGCATGG CTGGAAAAAC TCAGTGCCTC TGCGGAGCTT CGACAGCAGT CTTTCGCTGT TGCTGCTGAT GCCACTGAGA GCTGTGAGGA CCGTGTCGCG CTCACATGGA ACAATCTCCG GAAAACCCTC CTGGTCCATC AGGCATCAGA AGGCCTTTTC GATAATGATA CCGGCGCTCT GCTCTCCCTG GGCAGGGAAA TGTTCCGCCT CGAAATTCTG GAGGACATTG CCCGGGATAA AGTCAGAACT CTCCATTTTG TGGATGAGAT AGAAGTCTAC CTGGCCTTCC AGACCATGCT CGCAGAGAAA CTTCAGCTCT CCACTGCCGT GAAGGAAATG CGTTTCTATG GCGTGTCGGG AGTGACAGCA AATGACCTCC GCACTGCCGA AGCCATGGTC AGAAGCCGTG AAGAGAATGA ATTTACGGAC TGGTTCTCCC TCTGGGGACC ATGGCATGCT GTACTGAAGC GTACGGAAGC TGACCGCTGG GCGCTGGCAG AAGAGCAGAA ATATGAGATG CTGGAGAATG AGTACCCTCA GAGGGTGGCT GACCGGCTGA AAGCATCAGG TCTGAGCGGT GATGCGGATG CGGAGAGGGA AGCCGGTGCA CAGGTGATGC GTGAGACTGA ACAGCAGATT TACCGTCAGC TGACTGACGA GGTACTGGCC CTGCGATTGT CTGAAAACGG CTCACAACTG CACCATTCAT AA
|
Protein sequence | MQSLSALLNS LETLPDLPPA LQKLSVGNNQ LTALPELPCE LQELSAFDNR LQELPPLPQN LRLLNVGENQ LHRLPELPQR LQSLYIPNNQ LNTLPDSIMN LHIYADVNIY NNPLSTRTLQ ALQRLTSSPD YHGPRIYFSM SDGQQNTLHR PLADAVTAWF PENKQSDVSQ IWHAFEHEEH ANTFSAFLDR LSDTVSARNT SGFREQVAAW LEKLSASAEL RQQSFAVAAD ATESCEDRVA LTWNNLRKTL LVHQASEGLF DNDTGALLSL GREMFRLEIL EDIARDKVRT LHFVDEIEVY LAFQTMLAEK LQLSTAVKEM RFYGVSGVTA NDLRTAEAMV RSREENEFTD WFSLWGPWHA VLKRTEADRW ALAEEQKYEM LENEYPQRVA DRLKASGLSG DADAEREAGA QVMRETEQQI YRQLTDEVLA LRLSENGSQL HHS
|
| |