Gene GYMC61_1559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_1559 
Symbol 
ID8525422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp1588906 
End bp1590573 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content53% 
IMG OID 
Producttransposase IS4 family protein 
Protein accessionYP_003252678 
Protein GI261418996 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACATAC GACGAGTCAC ACGCAAAAAC AAGGATGGAA CAACCGTTGC TTATCTCCAG 
CTTGCTCACA ATGAATGGGA TCCAAAGGCC AAATATGCGA AAGCGAAGGT GATTTATTCG
TTCGGGCGCG AAGACGAGGT GGATCGCGCC GTCTTGGAAC GTCTGGCCAA AAGCATTTCG
CGATTCCTTT CTCCTGAGCA GGCTTGGGAA GTCGAAACGT TGACAGGAGA AGCTTCCGAT
GACTTTCAAT TCCAGTCATG CAAACACCTC GGCGGCGTTT GGCTCTTGGA TCAGCTCTGG
AGACAACTGG GGTTGGGAGA GATTCTCCAC TCCTTGTTTA CCTCCCGACA TCACCAGATT
TCGCTGGAAC GGCTGATTTT TGCCATGGTG GCCAATCGCG CCCTTCATCC GTCAAGCAAG
TTGGCGATGG AGGAGTGGGT GGAGAAAGAC GTGTATATCC CTCACCTTCC TCAAGCCGCC
AGCCACCAGT TGTACCGGGC GATGGATGAG CTGCTGGCCG TGCAGCCGGA ATTGGAACGT
CAAGTGTTCC ATGCTGTGGC CGATTTATTG AATTTGGAAG TCGACTTGAT TTACTTCGAT
ACGACTTCGT CGTACTTCGA AGTGGATCCC TCTGAAACAC CGGAAGGAGA ATCGCTTCGA
AAACAAGGAT TCTCGAAAGA CAAACGCCCA GACTTGGTTC AAATCGTCAT TGGGCTGGCT
GTCACCCGGG AAGGAGTCCC GATTCGCGCT TGGGTATGGC CTGGCAATAC CATGGACATG
ACGGTCATCA AACAGGTGAA ACAAGACTTG ATTGGCTGGA AGCTTGGACG TGTGATCAGC
GTCATGGACC GCGGCTTTTC CTCTGAAGAG AATTTGCGAA TCTTGCAACA GGCCGGCGGA
CACTACATTG TCGGCGAAAA AATGCGATCC GGCAAAGCCG CCGTCAAAGA GGCCTTAAGC
CGTCGCGGAC GTTATCATGA AGTGGACGAG AATTTGCACA TCAAAGAAAT CATCGTCGGC
GACGGAGAAG CGCGTCAGCG CTATGTTCTC GTGTACAATC CCAGCGAAGC CGAACGCCAA
CGCAAGGAGC GAGAAAAGCT GCTCGAATCG CTGAAAGAGG AGTTAGAAGG GCTTCGCCAA
CTCCCAAACG AAGCCCATCA TAAGGCGACC TGCCGGCTGC GTTCCCATCC GTCCTACGGA
AAATACTTGC GCCAGTTGAA GGACGGAACC CTTCGCATCG ACAAGCAAGC GGTTCGTGAC
GCGGAAAAGT ACGACGGCAA ATATCTCATC CGGACATCCG ATGACACCTT GTCTGCCGAA
GATGTCGCCA TCGGGTATAA GCAGCTGGTG GATATTGAGC AGGCCTTCCG AACATTGAAG
TCTACATTGG AATTGCGACC TATGTATCAT CGCTTGGAAG ACCGCATTCG GGCGCATGTG
CTGCTCAGTT GGCTGGCTCT CTTGCTGGTT CGGATCGTGG AGATCCGAAC CCATGAATCG
TGGCCGAAAG TAAGGGATGA ATGTGAGCGT CTTATGCTTG GACATTTTTC TTCCAAAAAC
GGCGACCTTT ATCAACGAAC CGAACTGACG GCCAAACAGG CTCAATTCTT TGCGGCTCTA
GGGCTGGAGC CTCCTCCGAA GATCCTAGGC ATCCATCCTC GCGCCTAG
 
Protein sequence
MYIRRVTRKN KDGTTVAYLQ LAHNEWDPKA KYAKAKVIYS FGREDEVDRA VLERLAKSIS 
RFLSPEQAWE VETLTGEASD DFQFQSCKHL GGVWLLDQLW RQLGLGEILH SLFTSRHHQI
SLERLIFAMV ANRALHPSSK LAMEEWVEKD VYIPHLPQAA SHQLYRAMDE LLAVQPELER
QVFHAVADLL NLEVDLIYFD TTSSYFEVDP SETPEGESLR KQGFSKDKRP DLVQIVIGLA
VTREGVPIRA WVWPGNTMDM TVIKQVKQDL IGWKLGRVIS VMDRGFSSEE NLRILQQAGG
HYIVGEKMRS GKAAVKEALS RRGRYHEVDE NLHIKEIIVG DGEARQRYVL VYNPSEAERQ
RKEREKLLES LKEELEGLRQ LPNEAHHKAT CRLRSHPSYG KYLRQLKDGT LRIDKQAVRD
AEKYDGKYLI RTSDDTLSAE DVAIGYKQLV DIEQAFRTLK STLELRPMYH RLEDRIRAHV
LLSWLALLLV RIVEIRTHES WPKVRDECER LMLGHFSSKN GDLYQRTELT AKQAQFFAAL
GLEPPPKILG IHPRA