Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GYMC61_3569 |
Symbol | |
ID | 8527456 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. Y412MC61 |
Kingdom | Bacteria |
Replicon accession | NC_013412 |
Strand | + |
Start bp | 7271 |
End bp | 9031 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | transposase IS4 family protein |
Protein accession | YP_003254595 |
Protein GI | 261420914 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGATCC TAGGAAAAAA CGCCGAATTT CATAGGCACA CGATTTATTC ATTTTTTCTT TTAAAAATAA AGTTTATACG TTATAATATA GGCATGTACA TACGACGAGT CACACGCAAA AACAAGGATG GAACAACCGT TGCTTATCTC CAGCTTGCTC ACAATGAATG GGATCCAAAG GCCAAATATG CGAAAGCGAA GGTGATTTAT TCGTTCGGGC GCGAAGACGA GGTGGATCGC GCCGTCTTGG AACGTCTGGC CAAAAGCATT TCGCGATTCC TTTCTCCTGA GCAGGCTTGG GAAGTCGAAA CGTTGACAGG AGAAGCTTCC GATGACTTTC AATTCCAGTC ATGCAAACAC CTCGGCGGCG TTTGGCTCTT GGATCAGCTC TGGAGACAAC TGGGGTTGGG AGAGATTCTC CACTCCTTGT TTACCTCCCG ACATCACCAG ATTTCGCTGG AACGGCTGAT TTTTGCCATG GTGGCCAATC GCGCCCTTCA TCCGTCAAGC AAGTTGGCGA TGGAGGAGTG GGTGGAGAAA GACGTGTATA TCCCTCACCT TCCTCAAGCC GCCAGCCACC AGTTGTACCG GGCGATGGAT GAGCTGCTGG CCGTGCAGCC GGAATTGGAA CGTCAAGTGT TCCATGCTGT GGCCGATTTA TTGAATTTGG AAGTCGACTT GATTTACTTC GATACAACTT CGTCGTACTT CGAAGTGGAT CCCTCTGAAA CACCGGAAGG AGAATCGCTT CGAAAACAAG GATTCTCGAA AGACAAACGC CCAGACTTGG TTCAAATCGT CATTGGGCTG GCTGTCACCC GGGAAGAAGT CCCGATTCGC GCTTGGGTAT GGCCTGGCAA TACCATGGAC ATGACGGTCA TCAAACAGGT GAAACAAGAC TTGATTGGCT GGAAGCTTGG ACGTGTGATC AGCGTCATGG ACCGCGGCTT TTCCTCTGAA GAGAATTTGC GAATCTTGCA ACAGGCCGGC GGACACTACA TTGTCGGCGA AAAAATGCGA TCCGGCAAAG CCGCCGTCAA AGAGGCCTTA AGCCGTCGCG GACGTTATCA TGAAGTGGAC GAGAATTTGC ACATCAAAGA AATCATCGTC GGCGACGGAG AAGCGCGTCA GCGCTATGTT CTCGTGTACA ATCCCAGCGA AGCCGAACGC CAACGCAAGG AGCGAGAAAA GCTGCTCGAA TCGCTGAAAG AGGAGTTAGA AGGGCTTCGC CAACTCCCAA ACGAAGCCCA TCATAAGGCG ACCTGCCGGC TGCGTTCCCA TCCGTCCTAC GGAAAATACT TGCGCCAGTT GAAGGACGGA ACCCTTCGCA TCGACAAGCA AGCGGTTCGT GACGCGGAAA AGTACGACGG CAAATATCTC ATCCGGACAT CCGATGACAC CTTGTCTGCC GAAGATGTCG CCATCGGGTA TAAGCAGCTG GTGGATATTG AGCAGGCCTT CCGAACATTG AAGTCTACAT TGGAATTGCG ACCTATGTAT CATCGCTTGG AAGACCGCAT TCGGGCGCAT GTGCTGCTCA GTTGGCTGGC TCTCTTGCTG GTTCGGATCG TGGAGATCCG AACCCATGAA TCGTGGCCGA AAGTAAGGGA TGAATGTGAG CGTCTTATGC TTGGACATTT TTCTTCCAAA AACGGCGACC TTTATCAACG AACCGAACTG ACGGCCAAAC AGGCTCAATT CTTTGCGGCT CTAGGGCTGG AGCCTCCTCC GAAGATCCTA GGCATCCATC CTCGCGCCTA G
|
Protein sequence | MRILGKNAEF HRHTIYSFFL LKIKFIRYNI GMYIRRVTRK NKDGTTVAYL QLAHNEWDPK AKYAKAKVIY SFGREDEVDR AVLERLAKSI SRFLSPEQAW EVETLTGEAS DDFQFQSCKH LGGVWLLDQL WRQLGLGEIL HSLFTSRHHQ ISLERLIFAM VANRALHPSS KLAMEEWVEK DVYIPHLPQA ASHQLYRAMD ELLAVQPELE RQVFHAVADL LNLEVDLIYF DTTSSYFEVD PSETPEGESL RKQGFSKDKR PDLVQIVIGL AVTREEVPIR AWVWPGNTMD MTVIKQVKQD LIGWKLGRVI SVMDRGFSSE ENLRILQQAG GHYIVGEKMR SGKAAVKEAL SRRGRYHEVD ENLHIKEIIV GDGEARQRYV LVYNPSEAER QRKEREKLLE SLKEELEGLR QLPNEAHHKA TCRLRSHPSY GKYLRQLKDG TLRIDKQAVR DAEKYDGKYL IRTSDDTLSA EDVAIGYKQL VDIEQAFRTL KSTLELRPMY HRLEDRIRAH VLLSWLALLL VRIVEIRTHE SWPKVRDECE RLMLGHFSSK NGDLYQRTEL TAKQAQFFAA LGLEPPPKIL GIHPRA
|
| |