Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_A0115 |
Symbol | |
ID | 6273507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010660 |
Strand | - |
Start bp | 72670 |
End bp | 74931 |
Gene Length | 2262 bp |
Protein Length | 753 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641728762 |
Product | tonB-dependent receptor |
Protein accession | YP_001883153 |
Protein GI | 187734496 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 111 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGTTA TAAAACTGGC TATCGGCTCA GGCATATTAT TGCTCAGCTG CGGTGCTTAC TCACAATCCA TCAGTGAAAA AACTAATTCC GACAAAAAAG GAGCGGCAGA ATTCAGTCCG CTCAGCGTTT CTGTCGGGAA GACGACCAGT GAGCAGGAAG CTCTCGAGAA AACAGGCGCG ACCAGTTCCC GGACAACGGA CAAAAACCTG CAATCACTTG ACGCAACAGT GCGTAGTATG CCTGGTACTT ATACTCAAAT AGATCCTGGT CAGGGAGCAA TCAGTGTGAA TATTCGAGGC ATGAGCGGAT TTGGTCGTGT AAACACTATG GTCGATGGTA TTACCCAGAG TTTTTACGGA ACCTCTACCT CCGGAACAAC GGCGCATGGT TCAACTAACA ATATGGCTGG CGTACTTATA GATCCTAACT TACTGGTAGC AGTTGATGTT ACACGCGGTG ACAGCAGTGG CTCTGAAGGG ATCAACGCCC TTGCCGGTAG TGCAAATATG CGTACTATTG GCGTTGACGA TGTAATATTT AACGGTAATA CATATGGCCT TCGTTCACGT TTCTCTGTCG GTAGTAATGG GCTGGGACGC AGCGGAATGA TCGCCCTTGG TGGAAAAAGC GACGCTTTTA CGGATACGGG AAGCATTGGC GTTATGGCTG CTGTGAGCGG CAGTTCTGTG TACTCTAATT TCTCAAATGG TTCTGGAATT AACAGCAAAG AGTTTGGTTA TGATAAATAT ATGAAGCAGA ACCCCAAATC CCAACTGTAT AAAATGGATA TCAGACCAGA CGAATTTAAC AGCTTCGAAC TTTCCGCTCG AACCTATGAA AATAAATTTA CACGTCGTGA TATAACCAGT GACGACTATT ACATTAAATA TCATTACACC CCTTTTTCTG AATTAATTGA CTTTAACGTA ACGGCCAGTA CCAGTCGCGG TAATCAAAAG TATCGTGATG GCTCGCTGTA TACTTTCTAC AAAACCTCAG CGCAAAATCG TTCTGACGCG CTGGATATCA ACAATACCAG CCGGTTCACT GTCGCGGACA ATGAACTGGA GTTTATGCTG GGCAGCAAAC TGATGCGTAC CCGCTATGAC CGGACCATTC ACTCAGCGGC GGGCGACCCG AAAGCGAACC AGGAATCGAT CGAGAACAAT CCGTTCGCAC CCTCCGGCCA GCAGGATATT TCAGCGCTGT ATACCGGGCT GAAGGTTACG CGCGGCATCT GGGAGGCAGA TTTCAATCTC AACTACACAC GTAACAGGAT CACAGGGTAC AAGCCCGCCT GCGATTCACG CGTTATCTGC GTGCCACAGG GTAGCTACGA TATTGACGAT AAAGAGGGTG GCTTCAACCC TTCAGTTCAG CTTTCTGCTC AGGTAACACC ATGGCTTCAG CCGTTCATTG GCTACAGCAA ATCCATGCGT GCCCCGAACA TCCAGGAGAT GTTCTTCTCT AATTCAGGAG GCGCATCCAT GAACCCATTC CTGAAGCCTG AACGTGCAGA AACCTGGCAG GCGGGTTTTA ACATTGATAC CAGAGATTTA CTGGTCGAAC AGGATGCCCT GCGCTTTAAG GCTCTGGCGT ACCGCAGCAG GATCCAGAAC TACATCTACA GCGAGTCTTA TCTGGTTTGT TCTGGAGGTC GTAAATGCAG TATGGCTGAG GTGATTGGCA ATGACTGGGA GGGCATTAGC GATGAATACA GCGACAATAT GTACATCTAC GTTAACTCGG CAAGCGACGT TATTGCAAAG GGCTTCGAAC TGGAGATGGA TTATGATGCA GGTTTTGCTT TTGGCCGACT CTCTTTCAGC CAGCAGCAAA CAGACCAGCC AACCTCCATC GCCAGCACCT ACTTTGGCGC AGGGGATATG ACCGAACTGC CCAGAAAATA CATGACGCTG GATACTGGTG TTCGCTTCTT CGATAACGCG TTGACCCTGG GCACTATCAT AAAATACACA GGCAAGGCTC GTCGCCTGTC GCCTGATTTT GAGCAGGACG AACATACCGG CGCAATAATC AAACAGGATT TGCCGCAGAT CCCAACGATT ATCGATCTCT ATGGTACTTA CGAGTACAAC CGCAACCTGA CACTGAAACT TTCGGTACAA AACCTGATGA ACAGAGATTA TTCGGAGGCG CTGAATAAGC TCAACATGAT GCCAGGTCTT GGTGACGAGA GCCACCCAGC CAATTCCGCG CGTGGCAGAA CATGGATATT TGGCGGGGAC ATTCGTTTCT GA
|
Protein sequence | MNVIKLAIGS GILLLSCGAY SQSISEKTNS DKKGAAEFSP LSVSVGKTTS EQEALEKTGA TSSRTTDKNL QSLDATVRSM PGTYTQIDPG QGAISVNIRG MSGFGRVNTM VDGITQSFYG TSTSGTTAHG STNNMAGVLI DPNLLVAVDV TRGDSSGSEG INALAGSANM RTIGVDDVIF NGNTYGLRSR FSVGSNGLGR SGMIALGGKS DAFTDTGSIG VMAAVSGSSV YSNFSNGSGI NSKEFGYDKY MKQNPKSQLY KMDIRPDEFN SFELSARTYE NKFTRRDITS DDYYIKYHYT PFSELIDFNV TASTSRGNQK YRDGSLYTFY KTSAQNRSDA LDINNTSRFT VADNELEFML GSKLMRTRYD RTIHSAAGDP KANQESIENN PFAPSGQQDI SALYTGLKVT RGIWEADFNL NYTRNRITGY KPACDSRVIC VPQGSYDIDD KEGGFNPSVQ LSAQVTPWLQ PFIGYSKSMR APNIQEMFFS NSGGASMNPF LKPERAETWQ AGFNIDTRDL LVEQDALRFK ALAYRSRIQN YIYSESYLVC SGGRKCSMAE VIGNDWEGIS DEYSDNMYIY VNSASDVIAK GFELEMDYDA GFAFGRLSFS QQQTDQPTSI ASTYFGAGDM TELPRKYMTL DTGVRFFDNA LTLGTIIKYT GKARRLSPDF EQDEHTGAII KQDLPQIPTI IDLYGTYEYN RNLTLKLSVQ NLMNRDYSEA LNKLNMMPGL GDESHPANSA RGRTWIFGGD IRF
|
| |