Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2527 |
Symbol | |
ID | 6268980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 2327651 |
End bp | 2328916 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641726511 |
Product | hypothetical protein |
Protein accession | YP_001880991 |
Protein GI | 187732419 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.00600019 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTCTA CTTTTACCAG CGACACATTG CCAGCCGATC ACAAAGCAGC TATCCGTCAG ATGAAGCACG CGCTGCGGGC GCAGCTTGGC GACGTCCAGC AGATCTTTAA TCAGCTAAGC GATGACATTG CCACGCGAGT GGCTGAAATC AACGCACTCA AAGCACAGGG CGATGCCGTC TGGCCGGTGC TGTCTTATGC CGATATCAAA GCAGGTCATG TTACTGCAGA GCAGCGCGAA CAGATTAAAC GTCGCGGTTG TGCGGTGATA AAAGGCCATT TCCCCCGCGA ACAAGCGCTA GGCTGGGATC AGTCGATGCT GGACTATCTG GACCGCAACC GCTTTGACGA GGTCTACAAA GGCCCCGGCG ATAATTTCTT CGGGACGCTC AGCGCTTCAC GTCCCGAGAT TTACCCCATC TACTGGTCGC AGACGCAAAT GCAGGCCCGC CAGAGTGAAG AAATGGCGAA TGCGCAGTCG TTTCTCAATC GTCTGTGGAC ATTTGAAAGT GATGGAAAGC AATGGTTTAA CCCGGATATG AGCGTCATCT ACCCTGACCG TATCCGCCGC CGTCCGCCCG GAACGACCTC CAAAGGTCTT GGAGCGCATA CCGACTCCGG GGCGCTGGAA CGCTGGCTGC TTCCAGCGTA TCAGCGCGTT TTCGCCAACG TCTTTAATGG CAATCTGGCG CAATATGATC CCTGGCATGC GGCACATCGT ACGGAAGTTG AAGAGTACAC GGTGGACAAC ACCACCAAAT GTTCCGTGTT TCGGACATTC CAGGGCTGGA CAGCGCTCTC TGATATGCTG CCTGGTCAGG GGTTGCTGCA CGTTGTGCCC ATTCCTGAAG CCATGGCGTA CGTACTGTTA CGTACGCTGC TTGATGATGT GCCGGAGGAT GAACTGTGCG GCGTAGCGCC CGGAAGAGTG TTGCCGGTAT CAGAGCAATG GCATCCACTG TTAATTGAGG CGTTAACCAG CATTCCAAAA CTCGAGGCCG GAGACTCCGT CTGGTGGCAC TGCGACGTCA TCCATTCCGT TGCCCCCGTT GAAAATCAAC AGGGTTGGGG CAACGTGATG TACATTCCTG CGGCACCGAT GTGCAAGAAA AATCTTGCCT ACGCGCACAA GGTGAAGGCC GCACTGGAAA AAGGCGCATC GCCGGGCGAC TTCCCGCGCG AGGACTATGA AACAAACTGG GAAGGACGCT TTACGCTTGC CGACCTCAAC ATTCACGGTA AGCGAGCGTT GGGCATGGAT GTTTGA
|
Protein sequence | MASTFTSDTL PADHKAAIRQ MKHALRAQLG DVQQIFNQLS DDIATRVAEI NALKAQGDAV WPVLSYADIK AGHVTAEQRE QIKRRGCAVI KGHFPREQAL GWDQSMLDYL DRNRFDEVYK GPGDNFFGTL SASRPEIYPI YWSQTQMQAR QSEEMANAQS FLNRLWTFES DGKQWFNPDM SVIYPDRIRR RPPGTTSKGL GAHTDSGALE RWLLPAYQRV FANVFNGNLA QYDPWHAAHR TEVEEYTVDN TTKCSVFRTF QGWTALSDML PGQGLLHVVP IPEAMAYVLL RTLLDDVPED ELCGVAPGRV LPVSEQWHPL LIEALTSIPK LEAGDSVWWH CDVIHSVAPV ENQQGWGNVM YIPAAPMCKK NLAYAHKVKA ALEKGASPGD FPREDYETNW EGRFTLADLN IHGKRALGMD V
|
| |