Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GYMC61_1964 |
Symbol | |
ID | 8525828 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. Y412MC61 |
Kingdom | Bacteria |
Replicon accession | NC_013411 |
Strand | + |
Start bp | 1979620 |
End bp | 1981290 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | |
Product | DAK2 domain fusion protein YloV |
Protein accession | YP_003253063 |
Protein GI | 261419381 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACAATGA GGACGCTTGA CGGAAGACGA TTTGCCGATA TGGTGCAGCA AGGGGCCGCA CATTTGGCGA ACAACGCCAA GACGGTCGAT GCGCTGAACG TCTTTCCGGT TCCAGATGGC GATACAGGAA CGAACATGAA CTTGTCGATG ACGTCCGGGG CGAAAGAAGT GAAGGCGAAT GCCTCCGACC ATATCGGCAA CGTCGCCGCG GCGCTGGCGA AAGGGCTGTT GATGGGGGCG CGCGGCAATT CCGGCGTCAT TTTGTCGCAG CTGTTCCGCG GATTTGCCAA GGCAGTCGAA GGCAAACAAG CGGTGAACAG CTTCGAATTC GCCGCCGCTT TGCAGGCGGG GGTGGACACA GCCTATAAGG CGGTGATGAA GCCGGTTGAG GGGACGATCC TCACCGTTGC CAAAGAAGCG GCGCGCAAGG CGGTGGAGGT GGCAAAAAAA GAACGCGACG TGATCGCCGT GATGGAAGCG GCGCTCGCCG AGGCGAAAGC GGCGCTTAAG CGCACACCCG AATTGCTCCC GATCTTAAAG GAAGTCGGTG TCGTCGACAG CGGCGGTCAA GGGCTCGTAT ACATCTATGA AGGGTTTCTT GCTGCTCTGA AAGGAGAAAT CGTGAGCGCC GCACGCGCTG AGGCGCGGAT GGACGATTTA GTGAAAATGG TGCACCATCA AAGCGCGCAA AGTCATATTC ATACCGATGA GATCGAGTTT GGCTACTGCA CGGAGTTCAT GGTCCGTTTT GAGCCGGAAA AGCTGGCCGA GCACCCGTTT TCCGAAGAAA CGTTCCGCCG CGAGTTAAGC CAGTTCGGCG ACTCGTTGCT TGTTGTCGCG GATGACGAGC TTGTTAAGGT GCACATCCAC TCGGAAACGC CGGGTGAGGT GCTGACATAC GGTCAACGCT ACGGCAGCTT GATCAATATT AAAATTGAAA ACATGCGCGA ACAACATGCC AACATCGTCG GCAAGGAGGC CAAAACGCTG ACTGGTGTTG CCAAAGAGGA AGCAAAGCCG TACGGCATCG TCGCCGTCGC CATGGGCGCT GGCGTGGCTG AACTGTTTCG GAGCATCGGT GCCCACGCCA TCATTGAAGG TGGGCAAACG ATGAACCCGA GCACGGAAGA AATCGCTGAT GCCATCCGCC TCGCCAACGC GGAAACGGTG TTTGTGCTGC CAAACAACAA AAACATTGTG ATGGCGGCCA AACAAGCGGC AGAGTTGTCT GAACAACGGG TTGTCGTCAT CCCGTCGAAA ACGGTTCCGC AAGGTCTGGC GGCGCTCTTG GCGTTCAATC CGGCGCAATC GGCCGAGCAA AATGAGCGGG CGATGACGGC GGCGCTGTCG CGAGTGAAAA CGGGGCAAGT GACATTTTCC GTGCGCGATA CGACGATTGA CGGCATCGAG ATTCAAAAGG GCGATTACAT GGGGTTATGG GATGACCGCA TTATTGCCGC TGACAAAGAC AAACTCACCG TAACGAAGCG GCTGCTTGAT GCGCTCATTG ATGAAGAAAG CGAAATCGTG ACCATTTTGT ACGGCGAAGA CGCAACGGAG ATCGATGTGG AAACAGTCGT TGCCTATTTG GAAACGAAAC ATGACGGGGT CGAAGTGGAA GTGCATAACG GAAAGCAGCC GCTGTATCCA TTCATCATTT CCGTCGAATA A
|
Protein sequence | MTMRTLDGRR FADMVQQGAA HLANNAKTVD ALNVFPVPDG DTGTNMNLSM TSGAKEVKAN ASDHIGNVAA ALAKGLLMGA RGNSGVILSQ LFRGFAKAVE GKQAVNSFEF AAALQAGVDT AYKAVMKPVE GTILTVAKEA ARKAVEVAKK ERDVIAVMEA ALAEAKAALK RTPELLPILK EVGVVDSGGQ GLVYIYEGFL AALKGEIVSA ARAEARMDDL VKMVHHQSAQ SHIHTDEIEF GYCTEFMVRF EPEKLAEHPF SEETFRRELS QFGDSLLVVA DDELVKVHIH SETPGEVLTY GQRYGSLINI KIENMREQHA NIVGKEAKTL TGVAKEEAKP YGIVAVAMGA GVAELFRSIG AHAIIEGGQT MNPSTEEIAD AIRLANAETV FVLPNNKNIV MAAKQAAELS EQRVVVIPSK TVPQGLAALL AFNPAQSAEQ NERAMTAALS RVKTGQVTFS VRDTTIDGIE IQKGDYMGLW DDRIIAADKD KLTVTKRLLD ALIDEESEIV TILYGEDATE IDVETVVAYL ETKHDGVEVE VHNGKQPLYP FIISVE
|
| |