Gene GYMC61_0301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_0301 
Symbol 
ID8524107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp314086 
End bp315459 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content50% 
IMG OID 
Producttransposase IS4 family protein 
Protein accessionYP_003251475 
Protein GI261417793 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGATT TCCCGATTCG GTTTGTGTTG ACAGATGAAG CCATTACCCC AAGTGCGGGG 
CTTGCCCTCG TTGGCTACTT GCTCCATCAA ACGAAACTGA ATAAACGAGT AAACGCGCTT
CGGCCTCCTA CCGTTCGTCG AGACGTGCAC ATTTCCCATA GCGACGTCAT TCGCTCGATG
ATCGGCTTGC TTGCCACAGG AAAAACGGAC TTTGATCATA TTGAAGCATA TCGTCAGGAC
GATATCTTTT CGACATCGAT GGGCATTCAG CACGTGCCTT CCTCCCCAAC GTTGAGACAG
CGACTCGATC AGCTCGCTTG TCTTCCGATG ACCGAAACGA TTCTTTGGGA GGAGTCCATG
CGTCTGTTGA TTCAACGACA TGCCACCTTG TCCCCTTGTT GGGCCAAAGG AAAGACGACA
TGGCTTCCCC TTGATATAGA TGGCTCCCCA TTTGACAACT CCGATACGAA AAAAGAAGGA
GTGAGTCGAA CGTATAAAGG ATTTGACGGT TTTACGCCGT TGTTTGCGTA TGCGGGGAAG
GAAGGCTATA TCGTTCATGC CGAGTTGCGT CCTGGAAAAC AACATGTGCA AGACAACATG
CCTTCGTTTT TAACCACCGC CATCCGTCGA GCTCGTCAAC TGACCTCGTC TCGTCTGCTT
GTTCGCATGG ATGCAGGAAA CGACGCGGAA GCAAACGTGC ACGTGTGTCT AAAGGAAGAC
GTGGACTTTG TCATCAAACG AAACTTGCGC CGAGAATCGA AAGCGCTTTG GTTCCAGATC
GCTTCGCAAA AGGGCAAACG CGTCGATGAT GGACAAACAG AAGGAGTCCA AACGTATGAG
TTATGCCTTC CACAGACGGC AGCGATCGAT GGAAATACGT ATACGTACGT TCAAGTCACC
CAAGTGACGG AACGGACGAT GGAACGCAAT GGACAGGTGA TGCTCGTTCC TAATTATGAA
GTGGAAAGCT ATTGGGTGCG GCTCAAAGGA TACGAGCATG TTCGAATGAG CGATGTGCTC
GCGTTGTATC ATGATCATGC GACATGCGAA CAGTTTCATA GCGAACTCAA GAGCGACTTA
GATTTAGAGC GGCTTCCATC TGGGAAGATG AAAACGAATG CGCTCGTGTT GGTCATGGGA
GCCTTCGTGT ACAACCTTCT TCGCCTGATC GGACAAGATC TATTAAGCGA CCCGAGACAT
CCATTACATC ACAAAGTGAA ACGCCGCCGC ATCAAGACGA TCATTCAGAC GGTGATCACG
ATGGCAGGGC GACTCGTTCG CCGATCACGG CAACTATGGA TGAAACTGAC GCGAAGGAGC
GGGTACAGTA TACTCCTACT GAATGTGTAT CAAAAATGGA AAGAAGCAAG ATAA
 
Protein sequence
MKDFPIRFVL TDEAITPSAG LALVGYLLHQ TKLNKRVNAL RPPTVRRDVH ISHSDVIRSM 
IGLLATGKTD FDHIEAYRQD DIFSTSMGIQ HVPSSPTLRQ RLDQLACLPM TETILWEESM
RLLIQRHATL SPCWAKGKTT WLPLDIDGSP FDNSDTKKEG VSRTYKGFDG FTPLFAYAGK
EGYIVHAELR PGKQHVQDNM PSFLTTAIRR ARQLTSSRLL VRMDAGNDAE ANVHVCLKED
VDFVIKRNLR RESKALWFQI ASQKGKRVDD GQTEGVQTYE LCLPQTAAID GNTYTYVQVT
QVTERTMERN GQVMLVPNYE VESYWVRLKG YEHVRMSDVL ALYHDHATCE QFHSELKSDL
DLERLPSGKM KTNALVLVMG AFVYNLLRLI GQDLLSDPRH PLHHKVKRRR IKTIIQTVIT
MAGRLVRRSR QLWMKLTRRS GYSILLLNVY QKWKEAR