Gene GYMC61_1066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_1066 
Symbol 
ID8524890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp1064403 
End bp1066358 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content48% 
IMG OID 
Productsulfatase 
Protein accessionYP_003252213 
Protein GI261418531 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGGCAA TTTGGGAAAA ATGGTTGAAA AAATGTCGTT CTCTCCCTAA CCAATACATT 
GGCTTTTTTA TTTTTGCCGT GCTCTTATTT TGGTTGAAGA CGTACGCCGC CTATTTAGCG
GAATTTAACC TTGGCATCAG CAACTCCATG CAGGAGTTTT TATTGTTCAT CAACCCGATC
AGCTCGGCGG TATTCTTCCT TGGGTTGGCG CTGTTGGCCA AGGAAACGCG TGTGTACAAA
TGGATCATTA TTATTAATTT CATCTTATCG TTCATTCTAT ACGCCAATAT CGTTTATTAT
CGTTTTTTCA GCGATTTTAT TACATTCCCG ACGTTGACGC AAACGAAAAA CTTCGGCGAT
CTCGGCAGAA GCATTTGGGA GTTGCTCCGC TGGTATGACG TGTTCTATTT CTTGGATACG
ATCATTTTGG CGGTGATCGT TTTCTCGAAG CGATTCTCGC TCCCGGAAGT GCAGGCCGGC
CGATTCAAAA AAGGCGCCAT TTTCGCTTCG GCCATTCTTA TGTTCAGCAT CAACTTGGCG
CTCGCTGAGA CCGACCGCCC GCAGCTCTTG ACAAGAACGT TCGACCGCAA CTATATCGTC
AAATATTTAG GCGTGTACAA CTATTTGATT TACGATGCGT TCCAAAGCAT GAAATCATCG
ACGCAGCGGG CGTTCGCAAA CAAAAGCGAC ATCACGACCG TGCTGAACTA TGTGCAGGCG
ACGTATGCCA AACCGAACCC GAAATATTTC GGCGTGGCGA AAGGGAAAAA CGTCATTTAC
ATTCATTTAG AGTCGCTGCA AAACTTTGTG ATTAACTATA AGTTGAACGG TGAAGAAGTC
ACCCCGTTCT TAAACTCGCT CACCCGCGAT CCGAACACGT TCTATTTCGA TAACTTCTTC
CATCAAACAG GACAAGGGAA AACGTCGGAT GCGGAGTTTA TGCTCGAAAA CTCGCTGTTT
GGCTTGCCGC AAGGCGCTGT CTTTACAACG AAAGGACAAA ACACGTATCA GGCGGCTCCG
GCCATTTTGC ACCAATACGG CTATACAAGC GCCGTCTTCC ACGGCAACTA CAAAACGTTC
TGGAACCGCG ATGAAATTTA CAAGTCGTTC GGCTTTGACC ATTTCTTTGA CGCCAGCTAC
TACGATATGA ACGACGAGGA CGTCTTGAAC TACGGCCTGA AAGACAAACC GTTCTTCCGG
GAGTCGATCC CGCTATTAGA AACATTGAAA GAACCGTTCT ATGTGAAATT TATTACGCTG
TCGAATCACT TCCCATACCC GATCAGCGAG GAAGATGCGA CGATCCCGCC GGCGGCGACC
GGGGATGGGA CAGTCGACCG ATATTTCCAA ACGGCCCGCT ATTTGGACGA GGCGGTGAAG
GAGTTCTTTG ACTACTTGAA AAAATCGGGC CTGTACGACC GCTCGGTCAT CATTTTGTAC
GGCGACCATT ACGGCATTTC GGAAAATCAT AACAAAGCCA TGGCGCAAAT TTTAGGAAAA
GAAATTACGC CGTATGAACA TGCGCAATTG CAGCGGGTGC CGCTGTTCAT CCACGTGCCG
GGCATAAAAG GCGGCGTCAT TCACGAGTTT GGCGGCCAAA TCGATTTGTT GCCGACGGTC
TTGCACCTGC TGGGCATTGA TACAAAAAAT TACGTCCATT TTGGAACGGA TTTGCTGTCA
CCTGAACATC AAGAAATCGT TCCGTTCCGC AACGGCGACT TTGTCACGCC GAAGGTGACA
GCGGTCAACG GCAAGTACTA TGACACGAAA ACAGGCGAAC CTCTTGAAAG CACGCCGGAA
ATTCAGCGGC TCGAACAAAT CGTCCGTACG AAGCTTGACC TATCGGATAA AGTCGTCTAC
GGCGATTTGC TCCGGTTCTA CACCCCGAAA GGCTTCAAGC CGGTCGATCC GTCAAAATAT
GATTACAATA ACCGTGAAGA GGGAAGCGAT CAATGA
 
Protein sequence
MKAIWEKWLK KCRSLPNQYI GFFIFAVLLF WLKTYAAYLA EFNLGISNSM QEFLLFINPI 
SSAVFFLGLA LLAKETRVYK WIIIINFILS FILYANIVYY RFFSDFITFP TLTQTKNFGD
LGRSIWELLR WYDVFYFLDT IILAVIVFSK RFSLPEVQAG RFKKGAIFAS AILMFSINLA
LAETDRPQLL TRTFDRNYIV KYLGVYNYLI YDAFQSMKSS TQRAFANKSD ITTVLNYVQA
TYAKPNPKYF GVAKGKNVIY IHLESLQNFV INYKLNGEEV TPFLNSLTRD PNTFYFDNFF
HQTGQGKTSD AEFMLENSLF GLPQGAVFTT KGQNTYQAAP AILHQYGYTS AVFHGNYKTF
WNRDEIYKSF GFDHFFDASY YDMNDEDVLN YGLKDKPFFR ESIPLLETLK EPFYVKFITL
SNHFPYPISE EDATIPPAAT GDGTVDRYFQ TARYLDEAVK EFFDYLKKSG LYDRSVIILY
GDHYGISENH NKAMAQILGK EITPYEHAQL QRVPLFIHVP GIKGGVIHEF GGQIDLLPTV
LHLLGIDTKN YVHFGTDLLS PEHQEIVPFR NGDFVTPKVT AVNGKYYDTK TGEPLESTPE
IQRLEQIVRT KLDLSDKVVY GDLLRFYTPK GFKPVDPSKY DYNNREEGSD Q