Gene GYMC61_2158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_2158 
Symbol 
ID8526022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp2177380 
End bp2178861 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content59% 
IMG OID 
Producttwo component transcriptional regulator, AraC family 
Protein accessionYP_003253255 
Protein GI261419573 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAATTGG TCATTGCCGA ACGGGACGAC AAGGAACGAG AAGCGATCCG CTGGCTCGTT 
TCCGCGTATT CGCTGCCGAT CGAGCAGGTG TACACGGCGG CCAATGTGGA GGAGATGATG
GCTCTTCTTG AGCGGGAGGC GCCGGAGCTA TTGTACGTCG AACTGGATAT GATTCCGTAT
GAACGATGGT CAAAGGTGAC AGCCCACGTC CGCTTGTTTT GCCAACGCGT GATTGCCGCA
ACGGCCGAGG CGACGTTTGC GCGGGCGAAA CAAGCCATCG ACTGGCAATG CGTCGATTTG
CTTGTGAAGC CGCTTGAGCC CGCCAAGCTG AAACAAGCGT TGAGGACGGC GGTCTCGCTT
TCCTCCAACA GCCGCTCGAG CCTGACAGCG GGTTTTGACG GCCATGACGA CTACCGCGCA
TTATTTTACG ATGACCCCGC CGCAGCTTCC ACCCATGTAT GGCTTGTGCA GGCGGAACAG
CCTGCCTTGT CTCGGGAAGT GATCCGCTTT TTAACGAGCT ATCCGTTTCG CGGACGGGCG
CGCGTATTGC CGCTCACCCA TATGGCGGTT TGCCTGCTTC CCGATTTGCC GGGCGACGGG
AAAGAAGAGG CGTGGAAGCT GTTGCGCGAT TGGGAAACGG AGCATCATGA GCCGCTCGCG
GTTGTCGTGA TGCCGCCTGA TGGCCGGAAA ACGGTGCGCG GGCAATACCA AGCGGCCCGC
CGGTTGCTGG AGACGACGTT TTTCATTGGC TATCGCCAAG TGATCGCTCC CGCTTCGGAA
TACGAACAGT GGCGTGAGCT CGATCCGTTT TTGACGCCCG AGGAGCAGCG GCAATGGATC
GAGATGCTCG AGCGGTTCGA CCATGAAGGG GTGAAGCGAT GGCTGCGGCG CGAATTTTCC
CATTGGGGGC CGCCGTTTCC GAGCCCGGAG ATGGTGCGGA CGCGGCTGAC GAGCATCTTG
GCGCAAATCC GCCGGTTTAT GAAAACATAC GGCCTTGACC GCGGCGGAAC CGAGCGAGAG
TATATGCGCA TTTTTCAAGA CATTTTGTAC AATCCCGTCT TGTATCGCAT CGTTCAAGAA
TTGATTTTAT TTTTATATCG GTTGTTCCAT GAGGCCAAGC AGGCCGAGGA AGACGCGCGC
GTCGACGCCG TGGAGCGCGG CCTCCGCTAT ATGGAGGCGC ATTTTCGCGA TCCGTCGCTC
ACGCTTGAGC GGGTCGCGGC CGCCGCCGGG CGCAGCCCCG CGTATTTCAG CCATTTGTTG
TCGAAAAAGC GCGGCGTGAC GTTTCGCCAG TGGCTGACCA ACCGGCGGCT TGAGGAAGCG
AAACGGCTGC TTCGGCAGAC GGATTTGTCG ATTAAAGAAA TCGCCGAACA AACCGGGTTT
CGCACTGCCC ATTATTTGAT GCGCGTCTTT AAAGCCGAAC TGAACCAGAC GCCGACCGCC
TACCGCGATG AACAACGGCT GAAACCGTCA TCACCGCGGT AG
 
Protein sequence
MKLVIAERDD KEREAIRWLV SAYSLPIEQV YTAANVEEMM ALLEREAPEL LYVELDMIPY 
ERWSKVTAHV RLFCQRVIAA TAEATFARAK QAIDWQCVDL LVKPLEPAKL KQALRTAVSL
SSNSRSSLTA GFDGHDDYRA LFYDDPAAAS THVWLVQAEQ PALSREVIRF LTSYPFRGRA
RVLPLTHMAV CLLPDLPGDG KEEAWKLLRD WETEHHEPLA VVVMPPDGRK TVRGQYQAAR
RLLETTFFIG YRQVIAPASE YEQWRELDPF LTPEEQRQWI EMLERFDHEG VKRWLRREFS
HWGPPFPSPE MVRTRLTSIL AQIRRFMKTY GLDRGGTERE YMRIFQDILY NPVLYRIVQE
LILFLYRLFH EAKQAEEDAR VDAVERGLRY MEAHFRDPSL TLERVAAAAG RSPAYFSHLL
SKKRGVTFRQ WLTNRRLEEA KRLLRQTDLS IKEIAEQTGF RTAHYLMRVF KAELNQTPTA
YRDEQRLKPS SPR