Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0438 |
Symbol | |
ID | 6271920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 429485 |
End bp | 430438 |
Gene Length | 954 bp |
Protein Length | 317 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641724663 |
Product | DNA-binding transcriptional activator AllS |
Protein accession | YP_001879211 |
Protein GI | 187731924 |
COG category | [K] Transcription |
COG ID | [COG0583] Transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.0774203 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCGATC CAGAAACCTT GCGGACTTTC ATTGCGGTTG CTGAAACAGG AAGTTTTTCA AAAGCGGCAG AACGATTATG TAAAACCACG GCGACGATCA GTTATCGCAT TAAACTTCTG GAAGAGAATA CCGGAGTAGC GCTGTTTTTC CGTACGACTC GTAGCGTAAC GTTGACAGCG GCTGGCGAGC ATCTACTTTC CCAGGCCAGA GACTGGCTGA GCTGGCTGAG CTGGCTGAGC TGGCTGAGCT GGCTGGAAAG TATGCCCAGC GAGCTGCAAC AGGTGAATGA TGGCGTGGAA CGCCAGGTGA ATATTGTCAT CAACAACCTG CTCTATAACC CCCAGGCCGT CGCCCAGTTG CTAGCGTGGC TGAATGAACG TTACCCCTTT ACCCAGTTTC ACATCTCCCG ACAAATCTAT ATGGGCGTCT GGGACTCGCT ATTGTACGAA GGTTTTTCGC TGGCTATCGG CGTCACGGGA ACTGAGGCGC TGGCAAATAC CTTTAGTCTT GATCCCTTAG GATCGGTGCA ATGGCGATTT GTCATGGCGG CGGATCATCC GCTGGCGAAC GTTGAAGAGC CGCTAACAGA AGCGCAGTTG CGTCGCTTTC CGGCGGTCAA TATTGAAGAC AGCGCCCGCA CCTTAACCAA ACGCGTCGCC TGGCGATTGC CAGGGCAAAA AGAGATTATT GTTCCTGATA TGGAAACGAA AATCGCCGCC CATCTGGCGG GCGTTGGCAT TGGTTTTTTG CCAAAATCGC TTTGCCAGTC AATGATCGAT AATCAACAAC TGGTCAGCCG GGTAATCCCA ACGATGCGCC CTCCTTCGCC ATTGAGTCTG GCATGGCGCA AATTTGGCAG CGGCAAAGCG GTAGAAGATA TTGTGACCTT GTTTACCCAG CGCAGGCCGG AAATCAGCGG ATTTTTAGAA ATTTTCGGCA ACCCACGCAG TTAA
|
Protein sequence | MFDPETLRTF IAVAETGSFS KAAERLCKTT ATISYRIKLL EENTGVALFF RTTRSVTLTA AGEHLLSQAR DWLSWLSWLS WLSWLESMPS ELQQVNDGVE RQVNIVINNL LYNPQAVAQL LAWLNERYPF TQFHISRQIY MGVWDSLLYE GFSLAIGVTG TEALANTFSL DPLGSVQWRF VMAADHPLAN VEEPLTEAQL RRFPAVNIED SARTLTKRVA WRLPGQKEII VPDMETKIAA HLAGVGIGFL PKSLCQSMID NQQLVSRVIP TMRPPSPLSL AWRKFGSGKA VEDIVTLFTQ RRPEISGFLE IFGNPRS
|
| |