Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0781 |
Symbol | |
ID | 6271195 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 733002 |
End bp | 734816 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641724962 |
Product | ABC transporter, periplasmic solute-binding protein |
Protein accession | YP_001879489 |
Protein GI | 187731668 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGTGC GCATACTGCT GCTGTTTATC GCCCTGTTCA CCTTCGGTGC GCAGGCGCAG GCTATCAAGG AAAGCTATGC CTTTGCCGTG CTGGGCGAAC CCCGGTACGC ATTTAATTTC AACCATTTTG ATTATGTGAA CCCCGCCGCG CCAAAAGGTG GGCAGATAAC GTTGTCAGCC CTCGGTACCT TCGATAATTT CAACCGCTAT GCACTGCGCG GCAACCCGGG CGCACGCACC GAGCAGTTGT ACGACACGCT ATTTACGACT TCCGATGACG AACCAGGCAG TTATTACCCG CTGATTGCTG AAAGTGCACG CTATGCTGAC GATTATTCCT GGGTGGAGGT CGCTATTAAT CCGCGCGCCC GTTTTCATGA TGGTTCGCCC ATTACTGCCC GCGATGTAGA GTTTACTTTT CAAAAATTTA TGACCGAAGG CGTGCCGCAA TTTCGTCTGG TCTACAAAGG CACCACCGTC AAAGCCATTG CACCGTTAAC CGTGCGCATT GAGTTAGCTA AACCCGGCAA AGAAGATATG CTGAGTCTGT TTTCGCTGCC GGTATTTCCA GAAAAGTACT GGAAGGATCA CAAACTTAGC GACCCTCTCG CCACGCCTCC GCTTGCCAGT GGTCCGTACC GCGTTACGTC CTGGAAAATG GGGCAAAATA TTGTCTATTC CCGTGTGAAA GATTACTGGG CAGCAAACTT ACCGGTAAAC CGTGGACGCT GGAATTTCGA CACCATTCGC TACGATTATT ACCTCGATGA TAATGTCGCC TTTGAAGCGT TTAAAGCAGG TGCCTTTGAT TTGCGTATGG AAAACGACGC TAAAAACTGG GCCACGCGTT ATACCGGTAA AAATTTCGAT AAAAAATACA TCATCAAAGA TGAGCAAAAG AACGAATCAG CCCAGGATAC GCGCTGGCTG GCGTTTAATA TCCAACGTCC GGTATTCAGC GATCGCCGGG TCCGGGAAGC AATCACTCTC GCCTTTGACT TTGAATGGAT GAACAAGGCG TTGTTTTACA ACGCCTGGAG TCGCACAAAC AGTTATTTTC AGAATACCGA ATACGCGGCC AGAAATTACC CCGACGCCGC GGAGCTGGTG CTTCTGGCAC CAATGAAAAA AGATCTACCG CCAGAAGTCT TCACACAAAT CTACCAGCCG CCGGTATCTA AAGGCGATGG CTACGATCGT GACAACCTGT TAAAAGCCGA CACACTTCTC AACGAAGCGG GCTGGGTGCT GAAGGGTCAG CAACGCGTTA ATGCAACAAC GGGTCAGCCA CTCAGCTTTG AATTATTGCT TCCCGCAAGC AGCAATAGTC AGTGGGTATT GCCGTTCCAG CACAGCCTGC AACGGCTGGG TATCAACATG GACATTCGCA AGGTGGATAA CTCGCAAATC ACCAACCGCA TGCGCAGTCG CGACTATGAC ATGATGCCGC GCCTATGGCG GGCGATGCCG TGGCCCAGTT CCGATTTACA GATTTCCTGG TCATCGGAAT ATATCAATTC CACTTATAAT GCCCCCGGCG TGCAAAGCCC GGTTATCGAC TCACTGATCA ACCAAATTAT TGCCGCGCAG GGAAATAAAG AAAAATTATT GCCGTTGGGG CGAGCACTGG ATCGCGTATT AACGTGGAAT TATTACATGC TGCCAATGTG GTACATGGCG GAAGACCGTC TCGCCTGGTG GGATAAATTC TCCCAACCCG CTGTACGCCC TGTTTACAGA CTGGGTATCG ATACCTGGTG GTATGACGTT AATAAAGCGG CCAAACTGCC GTCAGCCAGG CAACAGGGAG AGTAG
|
Protein sequence | MIVRILLLFI ALFTFGAQAQ AIKESYAFAV LGEPRYAFNF NHFDYVNPAA PKGGQITLSA LGTFDNFNRY ALRGNPGART EQLYDTLFTT SDDEPGSYYP LIAESARYAD DYSWVEVAIN PRARFHDGSP ITARDVEFTF QKFMTEGVPQ FRLVYKGTTV KAIAPLTVRI ELAKPGKEDM LSLFSLPVFP EKYWKDHKLS DPLATPPLAS GPYRVTSWKM GQNIVYSRVK DYWAANLPVN RGRWNFDTIR YDYYLDDNVA FEAFKAGAFD LRMENDAKNW ATRYTGKNFD KKYIIKDEQK NESAQDTRWL AFNIQRPVFS DRRVREAITL AFDFEWMNKA LFYNAWSRTN SYFQNTEYAA RNYPDAAELV LLAPMKKDLP PEVFTQIYQP PVSKGDGYDR DNLLKADTLL NEAGWVLKGQ QRVNATTGQP LSFELLLPAS SNSQWVLPFQ HSLQRLGINM DIRKVDNSQI TNRMRSRDYD MMPRLWRAMP WPSSDLQISW SSEYINSTYN APGVQSPVID SLINQIIAAQ GNKEKLLPLG RALDRVLTWN YYMLPMWYMA EDRLAWWDKF SQPAVRPVYR LGIDTWWYDV NKAAKLPSAR QQGE
|
| |