Gene GYMC61_3533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_3533 
Symbol 
ID8527421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp3593003 
End bp3594223 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content55% 
IMG OID 
ProductHtrA2 peptidase 
Protein accessionYP_003254560 
Protein GI261420878 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATACT ACGACGACCA TTATGAGCCG TACGAACAAA CGAGGAGGAA GCGGCGCAGC 
GGATCGTTTG TTTCCGCGCT TGTCGGCGCC GTGTTAGGAG GGTTGCTCGT CCTCATGTCC
ATTCCGGCGC TTTCCCGTTG GGATATCCTC CCGTATGATG TTGTGCCGAA TCAAAGAGCA
GAGGAAGAGC CAAAAACAGA GGAAAATGGA ACTCCACCGA TTCGGCAGAG TGTTTCCGTT
GATGTGACGA CGGCGGTGAC AAAGGCGATC GACCAAGTGT CGGATGCGGT TGTCGGCGTT
GTCAACATTC AAGAAGCCAG CTTCTGGTCG CAAGGAGGCG AGGCTGGCGT CGGATCGGGC
GTCATTTACA AGAAAGCGGG AGGCCGGGCG TTTATCGTCA CGAACCATCA TGTTGTTGAA
AACGCCAGTC AGCTGGAAGT GAGCCTCAAA GATGGGACGA GAGTGCCGGC GAAGTTGCTG
GGCAGCGATG TGCTTATGGA TTTAGCCGTC TTGGAAATTG ACGCGAAGCA TGTGAAAAAA
GTCGCCCAGT TCGGCAACTC CGATACGGTG AAGCCAGGGG AGCCGGTCAT TGCCATCGGC
AACCCGCTCG GCTTGCAGTT TGCCGGCTCG GTGACACAAG GCATTATTTC CGGAACGAAC
CGGACGGTCG AAGTCGATTT GGACCAAGAC GGCGCTCCAG ACTGGAATGC AGAAGTATTG
CAGACAGATG CGGCGATCAA CCCGGGCAAC AGCGGCGGCG CTCTCGTCAA TATCAAAGGG
CAAGTCATCG GCATCAACTC GATGAAAATC GCCCAAGAGG CGGTTGAAGG CATCGGGTTC
GCGATCCCGA TCAACACGGC CATTCCGATC ATTTCGGACT TGGAAAAATA CGGACAAGTG
CGCCGTCCGT ATATGGGCGT TGAACTTCGC TCGCTGAGCG ACATCCCATC GTACCATTTG
CAGGCGACGC TCCATTTGCC GCCGAACGTA ACGGAAGGAG CGGCGGTCAT TCAAGTCGTG
CCGATGTCGC CGGCCGCACA GGCGGGCTTA AAACAGTTTG ATGTCATCGT GGCGCTTGAC
GGCGAAAAAA TCCGCAACGT GCTTGATTTG CGCAAATATT TATATACGAA AAAATCGATC
GGTGACCGGA TGGAAGTTAC GTTTTATCGT GATGGGAAAA AACGCACGGT CACGATGAAG
CTGGCGCGCG AGTCGTATTA A
 
Protein sequence
MGYYDDHYEP YEQTRRKRRS GSFVSALVGA VLGGLLVLMS IPALSRWDIL PYDVVPNQRA 
EEEPKTEENG TPPIRQSVSV DVTTAVTKAI DQVSDAVVGV VNIQEASFWS QGGEAGVGSG
VIYKKAGGRA FIVTNHHVVE NASQLEVSLK DGTRVPAKLL GSDVLMDLAV LEIDAKHVKK
VAQFGNSDTV KPGEPVIAIG NPLGLQFAGS VTQGIISGTN RTVEVDLDQD GAPDWNAEVL
QTDAAINPGN SGGALVNIKG QVIGINSMKI AQEAVEGIGF AIPINTAIPI ISDLEKYGQV
RRPYMGVELR SLSDIPSYHL QATLHLPPNV TEGAAVIQVV PMSPAAQAGL KQFDVIVALD
GEKIRNVLDL RKYLYTKKSI GDRMEVTFYR DGKKRTVTMK LARESY