Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4070 |
Symbol | |
ID | 6268955 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 3800493 |
End bp | 3801812 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641727910 |
Product | site-specific recombinase, phage integrase family |
Protein accession | YP_001882342 |
Protein GI | 187731467 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0582] Integrase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGTTTG GGTACATTGC TTTGGGTACA CCCCCTATTC AAAGTTTGGA TATTCGCAAA ATGCCAAAAC TAACAGACAT GCAGATCCGC GCATGGATTA AGAGCGGAGA GCGATTCGAG GGGCGGGCAG ACGGTAACGG TTTATACCTA CGTTACCGTG AAGCCGACAA AACCCCCACA TGGAGATTCC GCTATAAACT CGCAGGGAAG TCCCGCGCCA TGCTAATTGG TTCGTATAGC GAGCTATCAC TATCAAAGGC CAGAGAGACA GCCAAAGAGC TATCGGCTCG CGTTGCGCTG GGCTATGACG TTGCAGGAGA GAAGCAGAAG CACAAAACCG AAGCACTGGC GAAGATGGAA GCAGAGAAGA ACGCTATGCG CGTTTCAGAG CTTGCCGCTG AATACTTTGA GCGTCAGATC CTCCCGCGCT GGAAGCACCC CGATATACTC CGCCGCCGTA TCGACAAAGA TATAAACCCC TGCATTGGCA GCATGAAGGT AGAGGACGTG AAACCGCGCC ATATCGATGA CATGCTGAAA GGTATTGTTG ACCGTGGAGC GCCGACCATA GCAACGGACG TGCTGAGATG GACGCGCCGC ATATTCGACT ACGGAATCAA ACGGCACGCG CTAGAGATTA ACCCCTGTTC AGCCTTTGAG GTGGCAGACG CCGGAGGGAA AGAAGCTGCC CGTGACCGCT GGTTAACCCG CGATGAGTTA ATCCAGCTAT TCAAAGCCAT GCGCACGGCT AAGGGATTCA GTCGCCAGAA CGAAATCACG TTCAAATTAC TGTTAGCGTT ATGCGTCCGC AAAATGGAAT TATGCGCCGC ACGATGGGAA GAGTTTGATT TAGATGGTGC GGTATGGCAT TTGCCGGAAG AACGCAGCAA AAACGGAGAC CCTATTGATA TACCTCTACC TTCCCCAGCC GTTGAATGGT TGAGAGAGCT ACACACCTTT TCATGTAATA GCGCATGGGT GCTTCCGGCC AGGAAGATGC AAAACAGAAT GATCCCACAT ATTCAGGAAA GCACTTTACC CGTAGCACTG GCTAAGGTTC GCGCCGAAAT GCCGGATGTG CCTAATTTCA CGATTCACGA CTTTCGACGC ACCGCACGTA CTCATTTAGC AGCGTTGGGT GTTGATCCTG TTGTGGCGGA ACGATGCCTC AATCATCGCA TTAAGGGCGT AGAGGGGATT TATAACCGCC ATCAGTATTT TGATGAGCGT AAAGCAGCAC TGGCACAGTG GGCTGATCTG CTAGTGGCAC TGGAAAGCGG AAAAGACTAC AACGTAACGC CTCTCAGAAG GGCGAACTAA
|
Protein sequence | MRFGYIALGT PPIQSLDIRK MPKLTDMQIR AWIKSGERFE GRADGNGLYL RYREADKTPT WRFRYKLAGK SRAMLIGSYS ELSLSKARET AKELSARVAL GYDVAGEKQK HKTEALAKME AEKNAMRVSE LAAEYFERQI LPRWKHPDIL RRRIDKDINP CIGSMKVEDV KPRHIDDMLK GIVDRGAPTI ATDVLRWTRR IFDYGIKRHA LEINPCSAFE VADAGGKEAA RDRWLTRDEL IQLFKAMRTA KGFSRQNEIT FKLLLALCVR KMELCAARWE EFDLDGAVWH LPEERSKNGD PIDIPLPSPA VEWLRELHTF SCNSAWVLPA RKMQNRMIPH IQESTLPVAL AKVRAEMPDV PNFTIHDFRR TARTHLAALG VDPVVAERCL NHRIKGVEGI YNRHQYFDER KAALAQWADL LVALESGKDY NVTPLRRAN
|
| |