Gene GYMC61_1964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_1964 
Symbol 
ID8525828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp1979620 
End bp1981290 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content56% 
IMG OID 
ProductDAK2 domain fusion protein YloV 
Protein accessionYP_003253063 
Protein GI261419381 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACAATGA GGACGCTTGA CGGAAGACGA TTTGCCGATA TGGTGCAGCA AGGGGCCGCA 
CATTTGGCGA ACAACGCCAA GACGGTCGAT GCGCTGAACG TCTTTCCGGT TCCAGATGGC
GATACAGGAA CGAACATGAA CTTGTCGATG ACGTCCGGGG CGAAAGAAGT GAAGGCGAAT
GCCTCCGACC ATATCGGCAA CGTCGCCGCG GCGCTGGCGA AAGGGCTGTT GATGGGGGCG
CGCGGCAATT CCGGCGTCAT TTTGTCGCAG CTGTTCCGCG GATTTGCCAA GGCAGTCGAA
GGCAAACAAG CGGTGAACAG CTTCGAATTC GCCGCCGCTT TGCAGGCGGG GGTGGACACA
GCCTATAAGG CGGTGATGAA GCCGGTTGAG GGGACGATCC TCACCGTTGC CAAAGAAGCG
GCGCGCAAGG CGGTGGAGGT GGCAAAAAAA GAACGCGACG TGATCGCCGT GATGGAAGCG
GCGCTCGCCG AGGCGAAAGC GGCGCTTAAG CGCACACCCG AATTGCTCCC GATCTTAAAG
GAAGTCGGTG TCGTCGACAG CGGCGGTCAA GGGCTCGTAT ACATCTATGA AGGGTTTCTT
GCTGCTCTGA AAGGAGAAAT CGTGAGCGCC GCACGCGCTG AGGCGCGGAT GGACGATTTA
GTGAAAATGG TGCACCATCA AAGCGCGCAA AGTCATATTC ATACCGATGA GATCGAGTTT
GGCTACTGCA CGGAGTTCAT GGTCCGTTTT GAGCCGGAAA AGCTGGCCGA GCACCCGTTT
TCCGAAGAAA CGTTCCGCCG CGAGTTAAGC CAGTTCGGCG ACTCGTTGCT TGTTGTCGCG
GATGACGAGC TTGTTAAGGT GCACATCCAC TCGGAAACGC CGGGTGAGGT GCTGACATAC
GGTCAACGCT ACGGCAGCTT GATCAATATT AAAATTGAAA ACATGCGCGA ACAACATGCC
AACATCGTCG GCAAGGAGGC CAAAACGCTG ACTGGTGTTG CCAAAGAGGA AGCAAAGCCG
TACGGCATCG TCGCCGTCGC CATGGGCGCT GGCGTGGCTG AACTGTTTCG GAGCATCGGT
GCCCACGCCA TCATTGAAGG TGGGCAAACG ATGAACCCGA GCACGGAAGA AATCGCTGAT
GCCATCCGCC TCGCCAACGC GGAAACGGTG TTTGTGCTGC CAAACAACAA AAACATTGTG
ATGGCGGCCA AACAAGCGGC AGAGTTGTCT GAACAACGGG TTGTCGTCAT CCCGTCGAAA
ACGGTTCCGC AAGGTCTGGC GGCGCTCTTG GCGTTCAATC CGGCGCAATC GGCCGAGCAA
AATGAGCGGG CGATGACGGC GGCGCTGTCG CGAGTGAAAA CGGGGCAAGT GACATTTTCC
GTGCGCGATA CGACGATTGA CGGCATCGAG ATTCAAAAGG GCGATTACAT GGGGTTATGG
GATGACCGCA TTATTGCCGC TGACAAAGAC AAACTCACCG TAACGAAGCG GCTGCTTGAT
GCGCTCATTG ATGAAGAAAG CGAAATCGTG ACCATTTTGT ACGGCGAAGA CGCAACGGAG
ATCGATGTGG AAACAGTCGT TGCCTATTTG GAAACGAAAC ATGACGGGGT CGAAGTGGAA
GTGCATAACG GAAAGCAGCC GCTGTATCCA TTCATCATTT CCGTCGAATA A
 
Protein sequence
MTMRTLDGRR FADMVQQGAA HLANNAKTVD ALNVFPVPDG DTGTNMNLSM TSGAKEVKAN 
ASDHIGNVAA ALAKGLLMGA RGNSGVILSQ LFRGFAKAVE GKQAVNSFEF AAALQAGVDT
AYKAVMKPVE GTILTVAKEA ARKAVEVAKK ERDVIAVMEA ALAEAKAALK RTPELLPILK
EVGVVDSGGQ GLVYIYEGFL AALKGEIVSA ARAEARMDDL VKMVHHQSAQ SHIHTDEIEF
GYCTEFMVRF EPEKLAEHPF SEETFRRELS QFGDSLLVVA DDELVKVHIH SETPGEVLTY
GQRYGSLINI KIENMREQHA NIVGKEAKTL TGVAKEEAKP YGIVAVAMGA GVAELFRSIG
AHAIIEGGQT MNPSTEEIAD AIRLANAETV FVLPNNKNIV MAAKQAAELS EQRVVVIPSK
TVPQGLAALL AFNPAQSAEQ NERAMTAALS RVKTGQVTFS VRDTTIDGIE IQKGDYMGLW
DDRIIAADKD KLTVTKRLLD ALIDEESEIV TILYGEDATE IDVETVVAYL ETKHDGVEVE
VHNGKQPLYP FIISVE