Gene GYMC61_1111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_1111 
Symbol 
ID8524949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp1119171 
End bp1120451 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content58% 
IMG OID 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_003252255 
Protein GI261418573 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATTGTA CACAAAACTA TAAAATTGAT CAAGTAACCG AACAAACGCT TGTGGTGGGC 
ATCGATATCG CGAAACGAAC CCACTACGCC TGCTTCGTGG ATGACCGAGG GCGCGTGCTT
CGCAAATCGT TCCCGATCTT CCAGTCGAAA GAGGGCTTTC GCCAGCTGTA TGAAGCGATT
CAGGAGGCGA TGCAAGCGTT CGGAAAGCCG CAGGTGATCG TCGCGGTGGA ACCGACCGGG
CACTACTGGT TGAACCTGGC CTACTTCCTC GAGGAAAAAG GGATCCCGTT GGTGATGGTC
AACCCGGCGC ATGTGTGCCT GTCGAAAGAA CTCGATGACA ACCTGCCGAC GAAACATGAC
GCCAAAGACG CCCTGGTCAT CGCCAGACTG GCGAAAGACG GACGATTCCT CGTCCCCCGG
CTGTTGCACG AGATCGAAGC CGATTTGCGC GTCGGGAGCA CGCTCAAAGA GAAGCTCCGC
AAGGAACAGG CAGCGGTGAA AAACGCGATC ATCCGCTGGA CCGACCGGTA TTTTCCGGAG
TTTTGGACGG TGTTTCGCGA CGTGGGAAAA ACGGCGCTTT CGGTGCTGGA GTGGACGCCG
CTTCCGGCTG ATATGGCGGG CCGATCCGCC GAGGAGCTCA TCGAGGTGCA CCGGCAAAGC
GAAGGGTTGA AAAGCCCGCA GAAAACCAAG ATTCGGGCGT TGATCAACGC CGCAAAGGAC
TCGATTGGGG TGACGGAAGG GGCGACGATG GCCCGATTTG AGATCGCCGC GCTCGTCCGC
CGATACCGCC AATTGGAGGG CGAAATCGCG GCGTTGGACG CCGAGTTGAA GGTATTGGTT
CAAACGACGA TGGAGTATCA ATGGCTGAAA ACGGTCGACG GGTTGGGAGA CGCCACGATC
ATCGATCTGC TGGCGGAGAT TGGCAGCTTC GCCCATTATC GGGACCCGCG TCAATTGGTG
AAGTTGGCGG GCCTGACGCT CAAGGAGAAT TCCTCCGGCC AGCGCAAAGG GCAAAAGCAG
ATCTCCAAAC GGGGACGGAA ACGGCTGCGA TCGGTGCTGT TTCGGGCGAT GATTCCGCTG
ATCCGGCACA ACGAGGCGTT TCGCGAGCTG CACGACTATT ACACGACGCG CCCCGTCAAT
CCGCTGACCG GAAAGCAGTC CATCGTCGCG TTATGCCGAA AGCTGTTGAA TGTGCTGTTT
GCGATTTGTA CGAAGAAACA AGCCTTTGAC GCCGAGCGAA TGAAGCAGGA CGTCTTGTCC
CAGGTGCACC GGGCGGCCTA A
 
Protein sequence
MHCTQNYKID QVTEQTLVVG IDIAKRTHYA CFVDDRGRVL RKSFPIFQSK EGFRQLYEAI 
QEAMQAFGKP QVIVAVEPTG HYWLNLAYFL EEKGIPLVMV NPAHVCLSKE LDDNLPTKHD
AKDALVIARL AKDGRFLVPR LLHEIEADLR VGSTLKEKLR KEQAAVKNAI IRWTDRYFPE
FWTVFRDVGK TALSVLEWTP LPADMAGRSA EELIEVHRQS EGLKSPQKTK IRALINAAKD
SIGVTEGATM ARFEIAALVR RYRQLEGEIA ALDAELKVLV QTTMEYQWLK TVDGLGDATI
IDLLAEIGSF AHYRDPRQLV KLAGLTLKEN SSGQRKGQKQ ISKRGRKRLR SVLFRAMIPL
IRHNEAFREL HDYYTTRPVN PLTGKQSIVA LCRKLLNVLF AICTKKQAFD AERMKQDVLS
QVHRAA