Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GYMC61_1066 |
Symbol | |
ID | 8524890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. Y412MC61 |
Kingdom | Bacteria |
Replicon accession | NC_013411 |
Strand | - |
Start bp | 1064403 |
End bp | 1066358 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | |
Product | sulfatase |
Protein accession | YP_003252213 |
Protein GI | 261418531 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGGCAA TTTGGGAAAA ATGGTTGAAA AAATGTCGTT CTCTCCCTAA CCAATACATT GGCTTTTTTA TTTTTGCCGT GCTCTTATTT TGGTTGAAGA CGTACGCCGC CTATTTAGCG GAATTTAACC TTGGCATCAG CAACTCCATG CAGGAGTTTT TATTGTTCAT CAACCCGATC AGCTCGGCGG TATTCTTCCT TGGGTTGGCG CTGTTGGCCA AGGAAACGCG TGTGTACAAA TGGATCATTA TTATTAATTT CATCTTATCG TTCATTCTAT ACGCCAATAT CGTTTATTAT CGTTTTTTCA GCGATTTTAT TACATTCCCG ACGTTGACGC AAACGAAAAA CTTCGGCGAT CTCGGCAGAA GCATTTGGGA GTTGCTCCGC TGGTATGACG TGTTCTATTT CTTGGATACG ATCATTTTGG CGGTGATCGT TTTCTCGAAG CGATTCTCGC TCCCGGAAGT GCAGGCCGGC CGATTCAAAA AAGGCGCCAT TTTCGCTTCG GCCATTCTTA TGTTCAGCAT CAACTTGGCG CTCGCTGAGA CCGACCGCCC GCAGCTCTTG ACAAGAACGT TCGACCGCAA CTATATCGTC AAATATTTAG GCGTGTACAA CTATTTGATT TACGATGCGT TCCAAAGCAT GAAATCATCG ACGCAGCGGG CGTTCGCAAA CAAAAGCGAC ATCACGACCG TGCTGAACTA TGTGCAGGCG ACGTATGCCA AACCGAACCC GAAATATTTC GGCGTGGCGA AAGGGAAAAA CGTCATTTAC ATTCATTTAG AGTCGCTGCA AAACTTTGTG ATTAACTATA AGTTGAACGG TGAAGAAGTC ACCCCGTTCT TAAACTCGCT CACCCGCGAT CCGAACACGT TCTATTTCGA TAACTTCTTC CATCAAACAG GACAAGGGAA AACGTCGGAT GCGGAGTTTA TGCTCGAAAA CTCGCTGTTT GGCTTGCCGC AAGGCGCTGT CTTTACAACG AAAGGACAAA ACACGTATCA GGCGGCTCCG GCCATTTTGC ACCAATACGG CTATACAAGC GCCGTCTTCC ACGGCAACTA CAAAACGTTC TGGAACCGCG ATGAAATTTA CAAGTCGTTC GGCTTTGACC ATTTCTTTGA CGCCAGCTAC TACGATATGA ACGACGAGGA CGTCTTGAAC TACGGCCTGA AAGACAAACC GTTCTTCCGG GAGTCGATCC CGCTATTAGA AACATTGAAA GAACCGTTCT ATGTGAAATT TATTACGCTG TCGAATCACT TCCCATACCC GATCAGCGAG GAAGATGCGA CGATCCCGCC GGCGGCGACC GGGGATGGGA CAGTCGACCG ATATTTCCAA ACGGCCCGCT ATTTGGACGA GGCGGTGAAG GAGTTCTTTG ACTACTTGAA AAAATCGGGC CTGTACGACC GCTCGGTCAT CATTTTGTAC GGCGACCATT ACGGCATTTC GGAAAATCAT AACAAAGCCA TGGCGCAAAT TTTAGGAAAA GAAATTACGC CGTATGAACA TGCGCAATTG CAGCGGGTGC CGCTGTTCAT CCACGTGCCG GGCATAAAAG GCGGCGTCAT TCACGAGTTT GGCGGCCAAA TCGATTTGTT GCCGACGGTC TTGCACCTGC TGGGCATTGA TACAAAAAAT TACGTCCATT TTGGAACGGA TTTGCTGTCA CCTGAACATC AAGAAATCGT TCCGTTCCGC AACGGCGACT TTGTCACGCC GAAGGTGACA GCGGTCAACG GCAAGTACTA TGACACGAAA ACAGGCGAAC CTCTTGAAAG CACGCCGGAA ATTCAGCGGC TCGAACAAAT CGTCCGTACG AAGCTTGACC TATCGGATAA AGTCGTCTAC GGCGATTTGC TCCGGTTCTA CACCCCGAAA GGCTTCAAGC CGGTCGATCC GTCAAAATAT GATTACAATA ACCGTGAAGA GGGAAGCGAT CAATGA
|
Protein sequence | MKAIWEKWLK KCRSLPNQYI GFFIFAVLLF WLKTYAAYLA EFNLGISNSM QEFLLFINPI SSAVFFLGLA LLAKETRVYK WIIIINFILS FILYANIVYY RFFSDFITFP TLTQTKNFGD LGRSIWELLR WYDVFYFLDT IILAVIVFSK RFSLPEVQAG RFKKGAIFAS AILMFSINLA LAETDRPQLL TRTFDRNYIV KYLGVYNYLI YDAFQSMKSS TQRAFANKSD ITTVLNYVQA TYAKPNPKYF GVAKGKNVIY IHLESLQNFV INYKLNGEEV TPFLNSLTRD PNTFYFDNFF HQTGQGKTSD AEFMLENSLF GLPQGAVFTT KGQNTYQAAP AILHQYGYTS AVFHGNYKTF WNRDEIYKSF GFDHFFDASY YDMNDEDVLN YGLKDKPFFR ESIPLLETLK EPFYVKFITL SNHFPYPISE EDATIPPAAT GDGTVDRYFQ TARYLDEAVK EFFDYLKKSG LYDRSVIILY GDHYGISENH NKAMAQILGK EITPYEHAQL QRVPLFIHVP GIKGGVIHEF GGQIDLLPTV LHLLGIDTKN YVHFGTDLLS PEHQEIVPFR NGDFVTPKVT AVNGKYYDTK TGEPLESTPE IQRLEQIVRT KLDLSDKVVY GDLLRFYTPK GFKPVDPSKY DYNNREEGSD Q
|
| |