Gene GYMC61_3569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_3569 
Symbol 
ID8527456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013412 
Strand
Start bp7271 
End bp9031 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content51% 
IMG OID 
Producttransposase IS4 family protein 
Protein accessionYP_003254595 
Protein GI261420914 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGATCC TAGGAAAAAA CGCCGAATTT CATAGGCACA CGATTTATTC ATTTTTTCTT 
TTAAAAATAA AGTTTATACG TTATAATATA GGCATGTACA TACGACGAGT CACACGCAAA
AACAAGGATG GAACAACCGT TGCTTATCTC CAGCTTGCTC ACAATGAATG GGATCCAAAG
GCCAAATATG CGAAAGCGAA GGTGATTTAT TCGTTCGGGC GCGAAGACGA GGTGGATCGC
GCCGTCTTGG AACGTCTGGC CAAAAGCATT TCGCGATTCC TTTCTCCTGA GCAGGCTTGG
GAAGTCGAAA CGTTGACAGG AGAAGCTTCC GATGACTTTC AATTCCAGTC ATGCAAACAC
CTCGGCGGCG TTTGGCTCTT GGATCAGCTC TGGAGACAAC TGGGGTTGGG AGAGATTCTC
CACTCCTTGT TTACCTCCCG ACATCACCAG ATTTCGCTGG AACGGCTGAT TTTTGCCATG
GTGGCCAATC GCGCCCTTCA TCCGTCAAGC AAGTTGGCGA TGGAGGAGTG GGTGGAGAAA
GACGTGTATA TCCCTCACCT TCCTCAAGCC GCCAGCCACC AGTTGTACCG GGCGATGGAT
GAGCTGCTGG CCGTGCAGCC GGAATTGGAA CGTCAAGTGT TCCATGCTGT GGCCGATTTA
TTGAATTTGG AAGTCGACTT GATTTACTTC GATACAACTT CGTCGTACTT CGAAGTGGAT
CCCTCTGAAA CACCGGAAGG AGAATCGCTT CGAAAACAAG GATTCTCGAA AGACAAACGC
CCAGACTTGG TTCAAATCGT CATTGGGCTG GCTGTCACCC GGGAAGAAGT CCCGATTCGC
GCTTGGGTAT GGCCTGGCAA TACCATGGAC ATGACGGTCA TCAAACAGGT GAAACAAGAC
TTGATTGGCT GGAAGCTTGG ACGTGTGATC AGCGTCATGG ACCGCGGCTT TTCCTCTGAA
GAGAATTTGC GAATCTTGCA ACAGGCCGGC GGACACTACA TTGTCGGCGA AAAAATGCGA
TCCGGCAAAG CCGCCGTCAA AGAGGCCTTA AGCCGTCGCG GACGTTATCA TGAAGTGGAC
GAGAATTTGC ACATCAAAGA AATCATCGTC GGCGACGGAG AAGCGCGTCA GCGCTATGTT
CTCGTGTACA ATCCCAGCGA AGCCGAACGC CAACGCAAGG AGCGAGAAAA GCTGCTCGAA
TCGCTGAAAG AGGAGTTAGA AGGGCTTCGC CAACTCCCAA ACGAAGCCCA TCATAAGGCG
ACCTGCCGGC TGCGTTCCCA TCCGTCCTAC GGAAAATACT TGCGCCAGTT GAAGGACGGA
ACCCTTCGCA TCGACAAGCA AGCGGTTCGT GACGCGGAAA AGTACGACGG CAAATATCTC
ATCCGGACAT CCGATGACAC CTTGTCTGCC GAAGATGTCG CCATCGGGTA TAAGCAGCTG
GTGGATATTG AGCAGGCCTT CCGAACATTG AAGTCTACAT TGGAATTGCG ACCTATGTAT
CATCGCTTGG AAGACCGCAT TCGGGCGCAT GTGCTGCTCA GTTGGCTGGC TCTCTTGCTG
GTTCGGATCG TGGAGATCCG AACCCATGAA TCGTGGCCGA AAGTAAGGGA TGAATGTGAG
CGTCTTATGC TTGGACATTT TTCTTCCAAA AACGGCGACC TTTATCAACG AACCGAACTG
ACGGCCAAAC AGGCTCAATT CTTTGCGGCT CTAGGGCTGG AGCCTCCTCC GAAGATCCTA
GGCATCCATC CTCGCGCCTA G
 
Protein sequence
MRILGKNAEF HRHTIYSFFL LKIKFIRYNI GMYIRRVTRK NKDGTTVAYL QLAHNEWDPK 
AKYAKAKVIY SFGREDEVDR AVLERLAKSI SRFLSPEQAW EVETLTGEAS DDFQFQSCKH
LGGVWLLDQL WRQLGLGEIL HSLFTSRHHQ ISLERLIFAM VANRALHPSS KLAMEEWVEK
DVYIPHLPQA ASHQLYRAMD ELLAVQPELE RQVFHAVADL LNLEVDLIYF DTTSSYFEVD
PSETPEGESL RKQGFSKDKR PDLVQIVIGL AVTREEVPIR AWVWPGNTMD MTVIKQVKQD
LIGWKLGRVI SVMDRGFSSE ENLRILQQAG GHYIVGEKMR SGKAAVKEAL SRRGRYHEVD
ENLHIKEIIV GDGEARQRYV LVYNPSEAER QRKEREKLLE SLKEELEGLR QLPNEAHHKA
TCRLRSHPSY GKYLRQLKDG TLRIDKQAVR DAEKYDGKYL IRTSDDTLSA EDVAIGYKQL
VDIEQAFRTL KSTLELRPMY HRLEDRIRAH VLLSWLALLL VRIVEIRTHE SWPKVRDECE
RLMLGHFSSK NGDLYQRTEL TAKQAQFFAA LGLEPPPKIL GIHPRA