Gene GYMC61_1923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_1923 
Symbol 
ID8525787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp1943286 
End bp1944944 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content55% 
IMG OID 
Producttransposase 
Protein accessionYP_003253028 
Protein GI261419346 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGTGA AAGTCAAAGG GGTCTATCGT AATGGTTATT TGAATATCAT AAGTGTTCTA 
GTCAAAAAAC TCGACATCCC CCACTTGATT GACCATCTTG TGCCCGTCGA TCCACAGTGC
CAAACGCGAG TCAGCGATGC CGTTCAAGCC ATCCTTTACC ATCTATTTGA CGGCCGGCAA
GCCCTGGTTC ACTTAGAACG ATGGGCTCAG GAGATCGATC TAGAGAAACT CATCCGTCCC
GGTCTCCAGC CTTCCTGGTT GAACGACGAT GCCTTGGCCC GTCATCTCGA CCGCTTGTAT
GAGGCCGATA TTCACAAGGT CATCAGCACT TGCTTGATTC ACATTTATCG CAAAGAAGGC
CTTTCCCTCC GAGCCTTTCA CGCTGATACG ACGGACAAGA CGGTTTACGG AGCGTATGAG
TCGGCCTCAT TAGAGGCTTT GCAGATCACA CATGGCTATA ACCGCCATCA TCGTTGGCAA
AAACCAATCG GCTTCGGACT GGTCGGCAAC GAGGATGGCA TCCCGTTTTA CGGCGATGTG
CATGACGGCA ATCTATCGGA TAAGGCGTGG AATCCTGAGG TGCTGTCGCG GATCCACGAA
CAGTTCAAAC AAGCCAAGAT CGACGACGAA TGGATTTACG TGGCCGATTC CGCCGCGATG
ACGAAAGACA CCTTGGCGCA GACAAAAGCC GCCCGTGCCT TTTTGATCAC CAGAGGCCCT
TCGTCGCTTC GGATCGTGAA ACAGGCGCTC GCAGAGGCCG ATTCGCCTCA CATCCCGTGG
AGCGAACCCT TTACCCTGGC GGAGAGAAAC GGCGCCACGC ATCGGGTATG GGAAACGGCC
TCGACCTATG AAGGCCACCC GGTTCGGCTG ATCGTCGTCG AATCGAGCGC GCTCGACCAG
CGCAAAGGAA AGACGCTCGA AAAAGAGCGG GTCAAAGAAG CGGAGCTTCT TCGCGAGGAA
CAAGTCCGTT GGGAGCGCCA CCCCTTCTCC TGCCGGGAAG ACGCCGAACA AGCCTTGACC
TCCCTCAAGG CGTCCCTTCG CCCTCGGTTT CATCGAGTGG AGGCCGCGGT CGAAGAGATC
GTGCGTCCGA AAAAACGGCG AGGACGGCCG AAAAAAGGGG CGGAACCCGA GACGGAGACG
CTGTACACCC TTCGCCTAAA CGTCGAATTC GACCAACAGG CATGGGAACA GGCAAGACGG
AAAGCGTCCC GGTTTGTCCT CGTCACGACC GTTCCGAAGG AATGGAAGGG CCAACAAATG
GATGCCCAAG AGATCTTGAA GCTGTATAAA GGGCAAATCT CGGTGGAAAT GAATTTCGCT
TTTTTGAAAG ATCCGTTTTT CACGGATGAG ATTTATGTCA AAAAACCAGA ACGGGTCGCC
GTATTGGGCT ATCTATTTCT CTTGGCCTTG GCGATTTACC GCGTTTTTCA GCGCCGGGTG
CGTCAGTTCA TCACACCAGA ACGCCCGTTA AAGGGCGCGG GAGGCCGCAA GCTGACCCGG
CCAACGGGAC AGGTGATTTT TCAGCTGTTT CAATATGTGA ACGTCATCCT GCTGGAGCTG
CCAGACGGGC GCATCCAACG CGCACTCGAT CGCTCCCTCA ACCCGGATCA GCGAAGGATT
CTGCAGGGAT TGGGCATGGA TGAGAGCATC TACGTCTAA
 
Protein sequence
MNVKVKGVYR NGYLNIISVL VKKLDIPHLI DHLVPVDPQC QTRVSDAVQA ILYHLFDGRQ 
ALVHLERWAQ EIDLEKLIRP GLQPSWLNDD ALARHLDRLY EADIHKVIST CLIHIYRKEG
LSLRAFHADT TDKTVYGAYE SASLEALQIT HGYNRHHRWQ KPIGFGLVGN EDGIPFYGDV
HDGNLSDKAW NPEVLSRIHE QFKQAKIDDE WIYVADSAAM TKDTLAQTKA ARAFLITRGP
SSLRIVKQAL AEADSPHIPW SEPFTLAERN GATHRVWETA STYEGHPVRL IVVESSALDQ
RKGKTLEKER VKEAELLREE QVRWERHPFS CREDAEQALT SLKASLRPRF HRVEAAVEEI
VRPKKRRGRP KKGAEPETET LYTLRLNVEF DQQAWEQARR KASRFVLVTT VPKEWKGQQM
DAQEILKLYK GQISVEMNFA FLKDPFFTDE IYVKKPERVA VLGYLFLLAL AIYRVFQRRV
RQFITPERPL KGAGGRKLTR PTGQVIFQLF QYVNVILLEL PDGRIQRALD RSLNPDQRRI
LQGLGMDESI YV