Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1759 |
Symbol | |
ID | 6270404 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 1599430 |
End bp | 1601802 |
Gene Length | 2373 bp |
Protein Length | 790 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641725833 |
Product | TonB-dependent receptor |
Protein accession | YP_001880331 |
Protein GI | 187733928 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.949304 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCGAG TTCTTATTCC TGGCGTCATT TTATGTGGCG CTGATGTGGC GCAGGCCGTC GATGACAAAA ACATGTACAT GCATTTTTTT GAAGAGATGA CGGTCTATGC TCCTGTCCCT GTACCCGTAA ACGGCAACAC GCATTACACC AGTGAAAGCA TCGAGCGTTT ACCGACCGGG AATGGCAATA TCAGCGATCT GCTGAGAACC AACCCTGCGG TACGCATGGA TTCAACGCAA AGTACCTCGT TGAACCAGGG AGATATTCGC CCTGAGAAAA TCTCTATTCA CGGTGCGTCG CCCTACCAGA ATGCCTATTT GATTGACGGT ATTAGTGCAA CTAATAACCT GAACCCAGCG AATGAGTCCG ATGCCAGTAG TGCAACCAAT ATTAGCGGGA TGTCACAGGG GTATTATCTT GATGTCAGCT TACTGGACAA TGTGACGCTT TATGACAGTT TTGTGCCGGT TGAATTTGGT CGCTTCAATG GCGGGGTAAT TGATGCAAAG ATCAAACGCT TCAACGCTGA TGATAGCAAG GTGAAATTGG GTTATCGCAC TACGCGTTCG GACTGGTTAA CATCGCATAT CGATGAGAAT AACAAGAGCG CATTTAATCA AGGTTCTTCA GGAAGTACTT ATTTCTCTCC AGATTTTAAA AAGAACTTTT ATACCTTGTC GTTTAATCAG GAGCTGGCTG ATAACTTTGG CGTTACCGCC GGTTTATCGC GCCGCCAGTC TGATATCACC CGCGCGGATT ATGTTTCGAA TGACGGCATT GTCGCCGGTC GGGCACAGTA TAAAAACGTT ATCGATACTG CATTGAGCAA ATTTACCTGG TTTGCCAGCG ACCGCTTTAC CCACGATTTA ACCTTAAAAT ATACCGGCTC CAGCCGTGAT TATAATACCA GCACCTTCCC GCAGTCTGAT CGCGAAATGG GTAATAAATC CTATGGTCTT GCATGGGATA TGGATACGCA ACTCGCATGG GCCAAACTAC GTACCACCGT TGGTTGGGAT CATATTAGTG ATTATACCCG TCACGATCAT GACATCTGGT ACACCGAACT TTCATGTACA TATGGTGATA TTACAGGGCG TTGTACCCGT GGCGGATTAG GACACATTTC CCAGGCTGTA GATAATTACA CCTTCAAAAC ACGCCTGGAC TGGCAAAAAT TCGCCGTGGG TGATGTTTCG CATCAACCCT ACTCCGGCGC GGAATACATC TATTCCGATG CATGGACTGA ACGCCATAAC CAGTCTGAAT CCTATGTGAT TAATGCTGCA GGAAAGAAAA CTAACCATAC CATTTACCAT AAAGGTAAAG GCAGCCTAGG AATTGACAAC TACACGCTGT ATATGGCGGA TCACATTAGC TGGAGGAATG TGTCGTTAAT GCCCGGTGTG CGTTATGACT ATGACAACTA TCTGTCAAAC CACAATATCT CCCCGCGCTT TATGACGGAA TGGGATATTT TTGCTGATCA AACCTCAATG ATTACCGCCG GTTATAACCG TTACTATGGC GGGAATATTC TTGATATGGG ATTACGTGAT ATCCGCAATA GCTGGACGGA ATCGGTATCA GGTAATAAAA CCCTGACGCG TTATCAGAAT TTGAAAACGC CTTATAACGA TGAACTGGCA ATGGGATTGC AGCAAAAAAT CGGTAAGAAC GTTATTGCAC GCGCAAACTA TGTTTACCGT GCAGCGCATG ATCAAATCAG CAAAAGCAGT CGTACCGACA GCGCGACTAA AACCACCATT ACTGAATATA ACAACGATGG CAAAACCAAA ACGCATTCGT TTAACCTCAG TTTTGAACTG GCCGAACCCC TGCATATCCG CCAGGTAGAT ATTAACCCGC AAATTGTCTT TAGCTATATC AAGAGCAAGG GCAACTTGTC GTTAAACAAT GGTTATGAGG AGAGCAATAC CGGTGATAAC CAGGTGGTTT ATAACGGTAA TCTGGTCTCT TACGATAGCG TTCCAGTGGC AGATTTTAAT AACCCATTAA AGATCTCCTT AAACATGGAT TTCACGCATC AACCGAGCGG GTTAGTGTGG GCGAATACGC TGGCCTGGCA AGAAGCGCGT AAAGCTCGCA TTATCCTTGG TAAGACAAAT GCGCAATACA TCAGCGAATA TTCAGATTAC AAGCAGTATG TTGACGAAAA ACTGGATAGC AGCCTGACCT GGGACACCCG CTTGTCCTGG ACGCCACAAT TTCTGAAACA ACAAAACCTG ACGATCAGTG CCGATATTCT CAATGTACTG GATAGCAAAA CCGCTGTTGA TACAACGAAT ACCGGTGTGG CGACCTACGC CAGTGGCCGT ACTTTCTGGC TTGATGTCAG CATGAAATTT TAA
|
Protein sequence | MKRVLIPGVI LCGADVAQAV DDKNMYMHFF EEMTVYAPVP VPVNGNTHYT SESIERLPTG NGNISDLLRT NPAVRMDSTQ STSLNQGDIR PEKISIHGAS PYQNAYLIDG ISATNNLNPA NESDASSATN ISGMSQGYYL DVSLLDNVTL YDSFVPVEFG RFNGGVIDAK IKRFNADDSK VKLGYRTTRS DWLTSHIDEN NKSAFNQGSS GSTYFSPDFK KNFYTLSFNQ ELADNFGVTA GLSRRQSDIT RADYVSNDGI VAGRAQYKNV IDTALSKFTW FASDRFTHDL TLKYTGSSRD YNTSTFPQSD REMGNKSYGL AWDMDTQLAW AKLRTTVGWD HISDYTRHDH DIWYTELSCT YGDITGRCTR GGLGHISQAV DNYTFKTRLD WQKFAVGDVS HQPYSGAEYI YSDAWTERHN QSESYVINAA GKKTNHTIYH KGKGSLGIDN YTLYMADHIS WRNVSLMPGV RYDYDNYLSN HNISPRFMTE WDIFADQTSM ITAGYNRYYG GNILDMGLRD IRNSWTESVS GNKTLTRYQN LKTPYNDELA MGLQQKIGKN VIARANYVYR AAHDQISKSS RTDSATKTTI TEYNNDGKTK THSFNLSFEL AEPLHIRQVD INPQIVFSYI KSKGNLSLNN GYEESNTGDN QVVYNGNLVS YDSVPVADFN NPLKISLNMD FTHQPSGLVW ANTLAWQEAR KARIILGKTN AQYISEYSDY KQYVDEKLDS SLTWDTRLSW TPQFLKQQNL TISADILNVL DSKTAVDTTN TGVATYASGR TFWLDVSMKF
|
| |