Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1797 |
Symbol | tus |
ID | 6271282 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1640354 |
End bp | 1641283 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641725866 |
Product | DNA replication terminus site-binding protein |
Protein accession | YP_001880364 |
Protein GI | 187734206 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02648] DNA replication terminus site-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.00623944 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCGTT ACGATCTCGT AGACCGACTC AACACTACCT TTCGCCAGAT GGAACAAGAG CTGGCTGCAT TTGCCGCTCA TCTTGAGCAA CACAAGCTAT TGGTTGCCCG CGTGTTCTCT TTGCCGGAGG TAAAAAAAGA GGATGAGCAT AATCCGCTTA ATCGTATTGA GGTGAAACAA CATCTCGGCA ACGACGCGCA GTCGCTGGCG TTGCGTCATT TCCGCCATTT ATTTATTCAA CAACAGTCCG AAAATCGCAG CAGCAAAGCC GCTGTCCGTC TGCCTGGCGT GTTGTGTTAC CAGGTCAATA ACCTTTCGCA AGCAGCGTTG GTCAGTCATA TTCAGCACAT CAATAAACTC AAGACCACGT TCGAGCATAT CGTCACGGTT GAGTCAGAAC TCCCCACCGC GGCACGTTTT GAATGGGTGC ATCGTCATTT GCCGGGGCTG ATCACCCTTA ATGCTTACCG CTCGCTCACC GTTCTGCACG ACCCCGCCAC TTTACGTTTT GGCTGGGCTA ATAAACATAT CATTAAAAAT TTGCATCGTG ATGAAGTCCT GGCACAGCTG GAAAAAAGCC TGAAATCACC ACGCAGTGTC GCACCGTGGA CGCGCGAGGA GTGGCAAAGA AAACTGGAGC GAGAGTATCA GGATATCGCT GCCCTGCCAC AGAACGCGAA GTTAAAAATC AAACGTCCGG TGAAGGTGCA GCCGATTGCC CGCGTCTGGT ACAAAGGAGA TCAAAAACAA GTGCAACACG CCTGCCCTAC ACCGCTGATT GCACTGATTA ATCGGGATAA TGGTGCGGGC GTGCCGGACG TTGGTGAGTT GTTAAATTAC GATGCAGACA ATGTGCAGCA CCGTTATAAA CCTCAGGCGC AGCCGCTTCG TTTGATCATT CCACGGCTGC ACCTGTATGT TGCAGATTAA
|
Protein sequence | MARYDLVDRL NTTFRQMEQE LAAFAAHLEQ HKLLVARVFS LPEVKKEDEH NPLNRIEVKQ HLGNDAQSLA LRHFRHLFIQ QQSENRSSKA AVRLPGVLCY QVNNLSQAAL VSHIQHINKL KTTFEHIVTV ESELPTAARF EWVHRHLPGL ITLNAYRSLT VLHDPATLRF GWANKHIIKN LHRDEVLAQL EKSLKSPRSV APWTREEWQR KLEREYQDIA ALPQNAKLKI KRPVKVQPIA RVWYKGDQKQ VQHACPTPLI ALINRDNGAG VPDVGELLNY DADNVQHRYK PQAQPLRLII PRLHLYVAD
|
| |