Gene GYMC61_1995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_1995 
Symbol 
ID8525859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp2008533 
End bp2010608 
Gene Length2076 bp 
Protein Length691 aa 
Translation table11 
GC content53% 
IMG OID 
ProductDNA topoisomerase I 
Protein accessionYP_003253093 
Protein GI261419411 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGACT ACTTAGTTAT CGTCGAATCG CCAACGAAAG CGAAGACGAT CGAACGATAC 
TTGGGAAAAA AATATACAGT CAAAGCGTCG ATGGGCCACG TCCGCGATTT GCCAAAAAGC
CAAATGGGCG TTGATATAGA CCACGGCTAC GAGCCGAAAT ACATTACGAT CCGCGGCAAA
GGCCAGGTGA TCAAAGAGTT GAAAACGGCG GCGAAAAAAG CGAAAAAAGT GTTTCTTGCC
GCCGACCCCG ATCGCGAAGG AGAGGCGATT GCCTGGCATT TGGCCCACAT GCTCGATCTG
GACATTCACT CTGAATGCCG CGTTGTCTTT CATGAGATCA CGAAGGATGC TATTCAGCAA
TCGTTTCAGC ACCCGCGCGC GATCAATATG AATCTTGTTG ACGCCCAGCA GGCGCGGCGC
GTGCTCGACC GCCTCGTCGG CTATAACATC AGCCCGCTCT TATGGAAAAA GGTGAAAAAA
GGATTAAGCG CCGGCCGCGT CCAATCGGTG GCGCTGCGCC TCATCATCGA TCGGGAACGA
GAAATTCGCG AATTTCAGCC GGAAGAATAT TGGACGATTC AAGCGACGTT TCAAAAAGAA
GGAGAGACGT TTGCCGCCTC GTTTTACAGT ATTGACGGAC AAAAGCGCGA TTTAAAGACG
GAGGCGGATG TGAAAGCCGT GCTGGACCGC TTAAACGGAA CGGCATTTGT GGTGAAAACG
GTGACAAAAC GAGAGCGCAA GCGCAGCCCC GTGCCGCCGT TTACGACGTC GTCGCTTCAG
CAGGAAGCGG CGCGCAAACT GAATTTCCGG ACGAAAAAGA CGATGATGAT CGCCCAGCAG
CTGTACGAAG GAATCGATCT TGGCAGCCAA GGGACGGTCG GTTTGATCAC GTATATGCGC
ACCGATTCAA CGCGCGTCGC CGAGACGGCG CAGCAAGAGG CGGCGGCATA TATCGAGGCG
ACGTTCGGCG CTATGTATGT CAACCAGGAA AAGCGGAAAG AGAAGAAAAG CACGAACGCC
CAAGACGCCC ATGAAGCGAT CCGCCCGACA TCGGCATTTC GCGATCCGGA CAAAGTGAAA
CCGTATTTGA CGCGCGACCA GTTTCGGTTG TATAAACTCA TTTGGGAACG GTTCATCGCC
AGTCAAATGG CGGCCGCCGT GCTTGACACC ATGAGCGTCG AGTTGGAAAA CAACGGTGTC
GTGTTCCGCG CCAGCGGCTC GAAGGTGAAA TTTCCAGGTT TTATGAAAGT GTATATTGAA
GGAACGGACG ATCAAACGGA AGAGCAGGAC CGTATCCTCC CGGATTTGGA AGAAGAGGAA
ACGGTTGAGA GCGAAACGAT CGAATCGAAG CAGCACTTTA CTCAGCCGCC GCCGCGCTAC
ACGGAAGCCC GCCTCGTCAA AACGTTGGAG GAACTCGGCA TCGGCCGCCC GTCGACGTAT
GCCCCGACGC TTGATACGAT CCAAAAACGC AACTATGTCG TCCTTGAGAA CAAGCGGTTT
GTCCCGACCG AGCTCGGCGA AATCGTTGTA GAGCTGATGC TCGAATTTTT CCCGGAAATT
ATCGATGTCG AGTTTACGGC GAAAATGGAA AAAGAATTGG ACGAAATTGA AGAAGGGAAA
GTGGAATGGA TCAAAGTCGT CGACGAGTTT TACCGCGAAT TTGAAAAACG GTTGAAAGTG
GCGGAAAAAG AAATGCGCGC AGTTGAGATT AAGGATGAGC CGGCCGGCAT TGACTGCGAG
GTGTGCGGCA GTCCGATGGT GTACAAAATG GGCCGTTTCG GCAAGTTTAT CGCCTGCTCG
AATTTTCCGG AATGCCGCCA TACGAAGCCG ATTGTCAAAG AAATCGGCGT CAAGTGCCCG
AAATGCCGCG AGGGCAACAT CGTCGAGCGC AGCACGAAAA GAAAGCGGGT GTTTTACGGC
TGCGACCGTT TCCCGGATTG CGACTTCGTC TCGTGGGATA AACCGCTCGC CCGCCCGTGT
CCGAAATGCG CGGGGCTGCT TGTGGAGAAA GAGCTGAAAA AAGGTGTGCA AGTGCAATGC
ACGGCGTGCG ACTATGAGGA GCGGCTAGAA GCCTGA
 
Protein sequence
MSDYLVIVES PTKAKTIERY LGKKYTVKAS MGHVRDLPKS QMGVDIDHGY EPKYITIRGK 
GQVIKELKTA AKKAKKVFLA ADPDREGEAI AWHLAHMLDL DIHSECRVVF HEITKDAIQQ
SFQHPRAINM NLVDAQQARR VLDRLVGYNI SPLLWKKVKK GLSAGRVQSV ALRLIIDRER
EIREFQPEEY WTIQATFQKE GETFAASFYS IDGQKRDLKT EADVKAVLDR LNGTAFVVKT
VTKRERKRSP VPPFTTSSLQ QEAARKLNFR TKKTMMIAQQ LYEGIDLGSQ GTVGLITYMR
TDSTRVAETA QQEAAAYIEA TFGAMYVNQE KRKEKKSTNA QDAHEAIRPT SAFRDPDKVK
PYLTRDQFRL YKLIWERFIA SQMAAAVLDT MSVELENNGV VFRASGSKVK FPGFMKVYIE
GTDDQTEEQD RILPDLEEEE TVESETIESK QHFTQPPPRY TEARLVKTLE ELGIGRPSTY
APTLDTIQKR NYVVLENKRF VPTELGEIVV ELMLEFFPEI IDVEFTAKME KELDEIEEGK
VEWIKVVDEF YREFEKRLKV AEKEMRAVEI KDEPAGIDCE VCGSPMVYKM GRFGKFIACS
NFPECRHTKP IVKEIGVKCP KCREGNIVER STKRKRVFYG CDRFPDCDFV SWDKPLARPC
PKCAGLLVEK ELKKGVQVQC TACDYEERLE A