Gene GYMC61_3305 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_3305 
Symbol 
ID8527193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp3366659 
End bp3368200 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content58% 
IMG OID 
Producttwo component transcriptional regulator, AraC family 
Protein accessionYP_003254340 
Protein GI261420658 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAGTGG CGATCGTCGA TGATGAAGCG TTGGAGCGAA GAGCGCTTTG CAAAATGATC 
CATGACCACC TTCCGGACAT CGAGGTGGTC GCCGAGGGCG CCAACGGGCG CGAGGCGATT
GACATTGCCA AGCAATATCG CCCGGATGTG ATGCTTATAG ACATCAAAAT GCCCGGCCTT
GACGGGCTGC AAGCGATCGA AGCGATTCGC CAAGACGGCC TCGATCTGGA ATTCATTATT
GTGTCGGCGT TTGATTTGTT CGACTATGCG AAACAAGCGA TGCGGTTTGG CGTCAAGGAA
TATTTATTGA AACCGAGCCG GAAGGAGGAG GTCATTTCGG CTTTGGAACG GGTCAGCCAA
GAAGTGGCGG CCAAGCGGCG ACACGAGGAG AGCAGCCGCC AGCTCGAGGA GCAGATTCGC
CGGTTGCAGA CGCTTGTGGA AAGCGAATGG CTGTCTGTCC TCATGACAGA AGACGTATCG
GCTGATGAGT GGGAGCGGTG GAAGGAACTG CTGCCGTTTT CGATTGCGTC GGGGATGTTC
CTCGTCATTC AGTTCCCGGA TGCAGGGGTG GCTGACGAAT GGAAATCATG GCTGGACAAG
CAGCTTAGCG GGAAGGCGCC AACGCGCTAT TGGATCGGGC GGATGGCGAA CCGGCGCCTG
CCGGTCTTGT TTTTCCGCAG CCCAAACGAT GGCGAGCCGG CCTGGAAGCC CACCGTCCAG
GCATTGGCGC TCGATTTGGC GCGGCAGTTT TCGGCCCGGT ACGGCGCCGC GCTGTATATC
GGGCTCGGCT CCCCGTTTTC CCGCCTTGAC CAACTTCGTT CCTCGTACTA TGAGGCGCTG
TCGGCCGCTC ATTATTACGC CGACCGGCAA AAAGCACAAG TGGGGTTTCT GCCGGCGGAA
GCGACGCGCG CCGGCGGGGA AGCGGAACGG GATAAACAGC TGTTTGAAGC GCTGCGCCTT
GGCGACATCG AGCAAGCCCG GATGATTGGT CTGACGTATA TAGAGGAACT GGCCTCTTCT
CATTCACTGC CGGCCGCCGG TCGCAAAGCG GAAGAGACCT TTGTGTGGCT CGGGCGTCTT
CTATCGGAGC TGGGCATTCG TTATGAGCGG CTCGCTTCCT TTGCTTCCTG CCGGTCGGCG
GCAGAATTGA AGCGGGCGGC GCTTGATGAA CTGGACCGCA TCGCCGCTGA TCTCGAGGTT
TGGCGCCAGC AGCAAGCGTA TGGCAAACTC GGCAAGGCGA AAGACTACAT TGACCGCCAC
TACGCCGAGC CGCTGACGCT TGAGGAGGTG GCGGAACAAG CGGGCATCAG TCCGTACTAC
TTCAGCAAAC TGTTCAAAGA GCATTTTGGC ATCACCTTTA TCGACTACGT GACGAACGTG
CGCATCGAAC GGGCGAAAGA AGCGCTGGCT GAGACGGATC AAAGCTTAAA AGAAATTTGT
TTTTCAGTCG GCTACAACGA TCCCAACTAT TTCAGCCGCG TCTTTAAAAA GCAGACCGGC
CTGTCGCCGA GCGAATACCG GAAAAAAGTA CAGGCGCGCT GA
 
Protein sequence
MKVAIVDDEA LERRALCKMI HDHLPDIEVV AEGANGREAI DIAKQYRPDV MLIDIKMPGL 
DGLQAIEAIR QDGLDLEFII VSAFDLFDYA KQAMRFGVKE YLLKPSRKEE VISALERVSQ
EVAAKRRHEE SSRQLEEQIR RLQTLVESEW LSVLMTEDVS ADEWERWKEL LPFSIASGMF
LVIQFPDAGV ADEWKSWLDK QLSGKAPTRY WIGRMANRRL PVLFFRSPND GEPAWKPTVQ
ALALDLARQF SARYGAALYI GLGSPFSRLD QLRSSYYEAL SAAHYYADRQ KAQVGFLPAE
ATRAGGEAER DKQLFEALRL GDIEQARMIG LTYIEELASS HSLPAAGRKA EETFVWLGRL
LSELGIRYER LASFASCRSA AELKRAALDE LDRIAADLEV WRQQQAYGKL GKAKDYIDRH
YAEPLTLEEV AEQAGISPYY FSKLFKEHFG ITFIDYVTNV RIERAKEALA ETDQSLKEIC
FSVGYNDPNY FSRVFKKQTG LSPSEYRKKV QAR