Gene GYMC61_2989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_2989 
Symbol 
ID8526867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp3047000 
End bp3048280 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content58% 
IMG OID 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_003254033 
Protein GI261420351 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTGTA CACAAAATCA GAAAATCAAT CAAGTCACCG AACAAACGCT TGTCGTGGGC 
ATCGATATCG CGAAACGAAC CCACTACGCC TGCTTCGTGG ATGACCGGGG GCGTGTGCTT
CGCAAATCGT TCCCGATCTT CCAGTCGAAA GAGGGCTTTC GCCAGCTGTA TGAAGCGATT
CAGGAGGCGA TGCAAGCGTT CGGAAAGCCG CAGGTGATCG TCGCGGTGGA GCCGACCGGG
CACTACTGGT TGAACCTGGC CTACTTCCTC GAGGAGCACG GGATCCCGTT GGTCATGGTC
AACCCGGCGC ATGTGTGCCG GTCGAAAGAA CTCGATGACA ACCTGCCGAC GAAACATGAC
GCCAAAGACG CCCTAGTCAT CGCCAGACTG GCGAAAGACG GACGATTCCT CGTCCCCCGG
CTGCTGCACG AGATCGAAGC CGATTTGCGC GTCGGGAGCA CGCTCAAAGA GAAGCTCCGC
AAGGAACAGA CGGCGGTGAA AAACGCGATC GTCCGCTGGA CCGACCGATA TTTTCCAGAG
TTTTGGACGG TGTTTCGCGA CCTGGGGAAA ACGGCGCTTT CGGTGCTGGA GTGGACGCCG
CTTCCGGCTG ATATGGCGGG CCGATCCGCC GAGGAGCTCA TCGAAGTGTA CCGGCAAAGC
GAAGGGCTGA AATGCCCGCA GAAGGCCAAA ATTCAGGCGT TGATCGACGC CGCGAAGGAC
TCGATTGGGG TGACGGAAGG GACGACGATG GCCCGGTTTG AAATCGCCGC GCTCGTCCGC
CGATACCGCC AATTGGAGGC TGAGATCGCC GCGTTGGACG CCGAGTTGAA GGCGTTGGTT
CAAACGACGA TGGAGTATCA ATGGCTGAAA ACGGTCGACG GGTTGGGAGA CGCCACGATC
ATCGATCTGC TGGCGGAGAT CGGCAGCTTC GCCCACTATC GGGACCCGCG TCAATTGGTG
AAGTTGGCGG GCCTGACGCT CAAAGAGAAC TCCTCCGGCC AGCGCAAAGG GCAAAAGCAC
ATCTCCAAAC GGGGACGGAA ACGGCTGCGA TCGGTGCTGT TTCGGGCGAT GATTCCGCTG
ATCCGGCACA ACGAGGCGTT TCGCGAGCTG CATGACTATT ACACGACCCG CCCCGTCAAT
CCGCTGACCG GAAAGCAGTC CATCGTCGCA TTATGCCGGA AGCTGTTGAA TGTGCTGTTT
GCGATTTGTA CGAAGAAACA AGCCTTTGAC GCGGAGCGAA TGAAACAGGA CGTCTTGTCC
CAGGTGCAAC GGGCGGCCTA A
 
Protein sequence
MNCTQNQKIN QVTEQTLVVG IDIAKRTHYA CFVDDRGRVL RKSFPIFQSK EGFRQLYEAI 
QEAMQAFGKP QVIVAVEPTG HYWLNLAYFL EEHGIPLVMV NPAHVCRSKE LDDNLPTKHD
AKDALVIARL AKDGRFLVPR LLHEIEADLR VGSTLKEKLR KEQTAVKNAI VRWTDRYFPE
FWTVFRDLGK TALSVLEWTP LPADMAGRSA EELIEVYRQS EGLKCPQKAK IQALIDAAKD
SIGVTEGTTM ARFEIAALVR RYRQLEAEIA ALDAELKALV QTTMEYQWLK TVDGLGDATI
IDLLAEIGSF AHYRDPRQLV KLAGLTLKEN SSGQRKGQKH ISKRGRKRLR SVLFRAMIPL
IRHNEAFREL HDYYTTRPVN PLTGKQSIVA LCRKLLNVLF AICTKKQAFD AERMKQDVLS
QVQRAA