Gene GYMC61_1065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_1065 
Symbol 
ID8524889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp1063057 
End bp1064385 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content59% 
IMG OID 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003252212 
Protein GI261418530 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGGCT ATTCCATCGT GCAGCTCGTC CGCCGCCATG CAGGGCGGCT TGACCGTCCG 
CTTCGTCATG CGCTCGTCAA CATGTACCAG CCGATGATGC GCATGCCGTG CATTTTTCAC
CGCTGGATGG AGCGGTGGTT GCGAAAATGG AGGACCGTTT CTGTTTTGAT TGAGGTTGAA
GGCACAGAGG GCGTTGAGGC GCTGGCCAAG GCGGAGCGGG ATCATTTCCG TATGAAACTG
CATCACCATT TCCGCCATGT CCCGATTTAT AGCGCACGGG TGACGCCGGC GGCGCTTGAG
CAGCTGCTCG AGCATCCGAA AGTGAAAAAA GTGTACCTCA ATCGGCAGGT CAAGGCACTG
TTAAACAATG CAGTGCCATC GGCCAATGCG AAACATGTGG CTGTCAACGG GACGGAGTTA
AGCGGAAAAG GAGTGACAAT CGCCATTGTC GACACAGGCA TTTACCCGCA CCCGGACTTG
GAGGGGAGGA TTGCCGCCTT TGTCGATTTT GTCAATGGGC GCACGGCGCC GTATGATGAC
AACGGCCACG GTACGCATTG CGCCGGCGAT GCGGCGGGAA ATGGGCGAAT GTCAGACGGG
CTGTACGCCG GGCCGGCGTA CGAGGCGAAC GTCGTGGGCG TGAAAGTGCT CGACCGTTCA
GGGAGCGGGA CGTTGGAAAC GATTATGCGC GGCATTGAAT GGTGCATTGA TTACAATGAA
CAAAACCCGG CGGGGCGGAT CAACATTATT TCATTATCGC TCGGCGGAGA ACCGCAGCCG
TTTCCGACCG AAAATGACGA CCCGCTCGTG CAGGCGGCGG AGCGGGCGTG GGAGCGCGGC
ATTGTCGTCT GTGCGGCGGC CGGGAATGAA GGGCCGAGTT ACGGCACGAT TGCCAGCCCC
GGCATCAGCG ATCGCATCAT TACAGTGGGA GCCCTTGATG ACCGCGATAC GGCGGCGACA
CGCACGGATG ATGAGGTGGC CCCGTTTTCG AGCCGGGGGC CGACCGAGTA CGGAGTGACG
AAGCCGGATC TCGTTGTCCC GGGGGTGAAT ATCGTTTCGC TCCGCGCTCC GCGTTCCATG
CTCGATAAAA TGAATAAACA AAGCCGGGTT GGCGACCATT ACATGGCCAT GTCCGGCACG
TCGATGGCGA CGCCGATTTG CGCCGGCATT GTCGCCTTGA TGCTCGAGGC GAGACCGGGG
GCAACGCCGG ATGAAGTTAA GCAGGCGTTA AAAGACGGCG CCGATTTATG GAAAGGGCGC
GATCCGAACG TCTACGGAGC CGGCTATGTC AACGCCAAAC GCGCTGTGGA GCTTCTGCTG
CAGCGCTAG
 
Protein sequence
MFGYSIVQLV RRHAGRLDRP LRHALVNMYQ PMMRMPCIFH RWMERWLRKW RTVSVLIEVE 
GTEGVEALAK AERDHFRMKL HHHFRHVPIY SARVTPAALE QLLEHPKVKK VYLNRQVKAL
LNNAVPSANA KHVAVNGTEL SGKGVTIAIV DTGIYPHPDL EGRIAAFVDF VNGRTAPYDD
NGHGTHCAGD AAGNGRMSDG LYAGPAYEAN VVGVKVLDRS GSGTLETIMR GIEWCIDYNE
QNPAGRINII SLSLGGEPQP FPTENDDPLV QAAERAWERG IVVCAAAGNE GPSYGTIASP
GISDRIITVG ALDDRDTAAT RTDDEVAPFS SRGPTEYGVT KPDLVVPGVN IVSLRAPRSM
LDKMNKQSRV GDHYMAMSGT SMATPICAGI VALMLEARPG ATPDEVKQAL KDGADLWKGR
DPNVYGAGYV NAKRAVELLL QR