Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_A0078 |
Symbol | |
ID | 6273404 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010660 |
Strand | - |
Start bp | 48513 |
End bp | 50114 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641728735 |
Product | transposase family |
Protein accession | YP_001883126 |
Protein GI | 187734384 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3436] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 7.38697e-26 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAATG AACTCCCCGA TGATATTGAG CTGCTTAAAG CCATGTTGCG TAAGCAACAG AGTCGGCTTC GACAGTATGC CTGTCAGGTC GCGGGCTATG AGCAGGAAAT TGAACGGCTG AAAGCGCAAC TCGACAGGTT GCGTCGTATG CTGTTCGGCC AGAGTTCAGA GAAAAAGCGT CATAAGCTTG AAAATCAGAT CCGACAGGCA GAAAAACGAC TGTCGGAACT GGAAAACCGA CTGAACACAG CCAGAAATCT TCTGGAAGAT GCATCGTCAG TCACAGATTC ACCTGACACC AGTCCCCCGT CAGAAAACCC GATCGCCAGT AAGCCTGAAT CCCCGGGACG AAAATCTTCA CGAAAACCGC TGCCGGCAGA ACTTCCCCGG GAGACACATC GCCTTCTGCC TGCTGAAACC AGTTGCCCGG CCTGTGGAGG TGTTCTGAAA GAAATGGGGG AAACAATCTC AGAGCAACTG GATATCATTA ATACCGCCTT TAAAGTTATC GAAACCATAC GTCCCAAACT GGCCTGTAGC CGGTGTGATG TCATCGTTCA GGCACCACTT CCCCCTAAAC CGATCGAACG CGGTTATGCC AGTGCAGGGT TACTTGCACG GATCCTGGTC AGCAAATATA TGGAACATAT CCCTTTATAT CGCCAGTCAG AAATATACGC GCGACAGGGC GTGGAGCTGA GCCGTAATAC CATGGTGCGC TGGGTATCAG AAATGGCAGA CAAACTCCGT CCTCTGTATA TAGCGCTGAA TGACTATGTT CTGGAGGCAG GAAAGGTGCA CGCAGATGAC ACTCCGGTGA AAGTACTGGC CCCGGGGAAC GGAAAGACGA AAACGGGTCG TCTGTGGGTA TACGTCAGGG ATGATCGTAA TGCGGGTTCA TCCCTGCCGG CAGCCGTCTG GTTCGCGTAT TCGGCAGATC GCAAAGGAGA ACATCCGCAG CTCCACCTGG CAAAGTATCA GGGCGTACTG CAGGCTGATG CCTATGCAGG TTATAACGTA CTGTACGAAA CGGGCCGGGT GAAGGAAGCC GGGTGCCTGG CCCACGCCCG CCGAAAAATC CATGACGAGG ATGTGCGCCG TCCGACAGAA ATGACTCAGG AAGCGCTCAG ACGGATAGCA GAGTTATACG ACATAGAAGC GGAGATACGT GGCAGTCCGG CAGAGGAACG GCTTGCAGTC AGAAAAGCCA GAAGCGTCCA GTTGATGCAG TCATTGTACG ACTGGATACA GTTGCAGAGG AAAACGCTGT CGAAACATGC GGAGATGGCG AAGGCGTTCG ACTATATCCT GAATCACTGG AATGCGCTGA ACGAGTTCTG TCGTGACGGC TGGGTGGAAA TAGACAACAA CATCGGTGAA AACGCGTTAC GATCGGTGGC GGTTGGAAGA AAAAATTATC TCTTTTTCGG CTCAGACAAG GGAGGAGAAA GTGCGGCGAT CATCTACAGT CTGCTGGTCA CCTGCAAACA GAACGAAGTG GAGCCGGAGG ACTGGTTGCG CGAAGTGATC GAGAAGCTCA ATGACTGGCC GTCGAACCAA GTGCATGAAC TGCTGCCCTG GAACTTCTCG TCTGTAAAAT AA
|
Protein sequence | MNNELPDDIE LLKAMLRKQQ SRLRQYACQV AGYEQEIERL KAQLDRLRRM LFGQSSEKKR HKLENQIRQA EKRLSELENR LNTARNLLED ASSVTDSPDT SPPSENPIAS KPESPGRKSS RKPLPAELPR ETHRLLPAET SCPACGGVLK EMGETISEQL DIINTAFKVI ETIRPKLACS RCDVIVQAPL PPKPIERGYA SAGLLARILV SKYMEHIPLY RQSEIYARQG VELSRNTMVR WVSEMADKLR PLYIALNDYV LEAGKVHADD TPVKVLAPGN GKTKTGRLWV YVRDDRNAGS SLPAAVWFAY SADRKGEHPQ LHLAKYQGVL QADAYAGYNV LYETGRVKEA GCLAHARRKI HDEDVRRPTE MTQEALRRIA ELYDIEAEIR GSPAEERLAV RKARSVQLMQ SLYDWIQLQR KTLSKHAEMA KAFDYILNHW NALNEFCRDG WVEIDNNIGE NALRSVAVGR KNYLFFGSDK GGESAAIIYS LLVTCKQNEV EPEDWLREVI EKLNDWPSNQ VHELLPWNFS SVK
|
| |