Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_A0087 |
Symbol | |
ID | 6273472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010660 |
Strand | + |
Start bp | 53892 |
End bp | 55619 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641728738 |
Product | IpaH1.4 |
Protein accession | YP_001883129 |
Protein GI | 187734415 |
COG category | [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 87 |
Plasmid unclonability p-value | 0.00799741 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTAAAT CAACCAATAT ACAGGCAATC GGTTCTGGTA TTATGCATCA AATAAACAAT ATATACTCGT TAACTCCATT TCCTTTACCT ATGGAACTGA CTCCATCTTG TAATGAATTT TATTTAAAAG CCTGGAGTGA ATGGGAAAGG AACGGTACCC CAGGCGAGCA ACGCAATATC GCCTTCAATA GGCTGAAAAT ATGTTTACAA AATCAAGAGG CAGAATTAAA TTTATCTGAG TTAGATTTAA AAACATTACC AGATTTACCG CCTCAGATAA CAACACTGGA AATAAGAAAA AACCTATTAA CACATCTCCC TGATTTACCA CCAATGCTTA AGGTAATACA TGCTCAATTT AATCAACTGG AAAGCTTACC TGCCTTACCC GAGACGTTAG AAGAGCTTAA TGCGGGTGAT AACAAGATAA AAGAATTACC ATTTCTTCCT GAAAATCTAA CTCATTTACG GGTTCATAAT AACCGATTGC ATATTCTGCC ACTATTGCCA CCGGAACTAA AATTACTGGT AGTTTCTGGA AACAGATTAG ACAGCATTCC CCCCTTTCCA GATAAGCTTG AAGGGCTGGC TCTGGCTAAT AATTTTATAG AACAACTACC GGAATTACCT TTTAGTATGA ACAGGGCTGT GCTAATGAAT AATAATCTGA CAACACTTCC GGAAAGTGTC CTGAGATTAG CTCAGAATGC CTTCGTAAAT GTTGCAGGTA ATCCACTGTC TGGCCATACC ATGCGTACAC TACAACAAAT AACCACCGGA CCAGATTATT CTGGTCCTCG AATATTTTTC TCTATGGGAA GTTCTGCCAC AATTTCCGCT CCAGAACACT CCCTGGCTGA TGCCGTGACA GCATGGTTCC CGGAAAACAA ACAATCTGAT GTATCACAGA TATGGCATGC TTTTGAACAT GAAGAGCATG CCAACACCTT TTCCGCGTTC CTTGACCGCC TTTCCGATAC CGTCTCTGCA CGCAATACCT CCGGATTCCG TGAACAGGTC GCTGCATGGC TGGAAAAACT CAGTGCCTCT GCGGAGCTTC GACAGCAGTC TTTCGCTGTT GCTGCTGATG CCACTGAGAG CTGTGAGGAC CGTGTCGCGC TCACATGGAA CAATCTCCGG AAAACCCTCC TGGTCCATCA GGCATCAGAA GGCCTTTTCG ATAATGATAC CGGCGCTCTG CTCTCCCTGG GCAGGGAAAT GTTCCGCCTC GAAATTCTGG AGGACATTGC CCGGGATAAA GTCAGAACTC TCCATTTTGT GGATGAGATA GAAGTCTACC TGGCCTTCCA GACCATGCTC GCAGAGAAAC TTCAGCTCTC CACTGCCGTG AAGGAAATGC GTTTCTATGG CGTGTCGGGA GTGACAGCAA ATGACCTCCG CACTGCCGAA GCCATGGTCA GAAGCCGTGA AGAGAATGAA TTTAAGGACT GGTTCTCCCT CTGGGGACCA TGGCATGCTG TACTGAAGCG TACGGAAGCT GACCGCTGGG CGCAGGCAGA AGAGCAGAAG TATGAGATGC TGGAGAATGA GTACTCTCAG AGGGTGGCTG ACCGGCTGAA AGCATCAGGT CTGAGCGGTG ATACGGATGC GGAGAGGGAA GCCGGTGCAC AGGTGATGCG TGAGACTGAA CAGCAGATTT ACCGTCAGTT GACTGACGAG GTACTGGCCC TGCGATTGTC TGAAAACGGC TCAAATCATA TCGCATAA
|
Protein sequence | MIKSTNIQAI GSGIMHQINN IYSLTPFPLP MELTPSCNEF YLKAWSEWER NGTPGEQRNI AFNRLKICLQ NQEAELNLSE LDLKTLPDLP PQITTLEIRK NLLTHLPDLP PMLKVIHAQF NQLESLPALP ETLEELNAGD NKIKELPFLP ENLTHLRVHN NRLHILPLLP PELKLLVVSG NRLDSIPPFP DKLEGLALAN NFIEQLPELP FSMNRAVLMN NNLTTLPESV LRLAQNAFVN VAGNPLSGHT MRTLQQITTG PDYSGPRIFF SMGSSATISA PEHSLADAVT AWFPENKQSD VSQIWHAFEH EEHANTFSAF LDRLSDTVSA RNTSGFREQV AAWLEKLSAS AELRQQSFAV AADATESCED RVALTWNNLR KTLLVHQASE GLFDNDTGAL LSLGREMFRL EILEDIARDK VRTLHFVDEI EVYLAFQTML AEKLQLSTAV KEMRFYGVSG VTANDLRTAE AMVRSREENE FKDWFSLWGP WHAVLKRTEA DRWAQAEEQK YEMLENEYSQ RVADRLKASG LSGDTDAERE AGAQVMRETE QQIYRQLTDE VLALRLSENG SNHIA
|
| |