Gene GYMC61_1938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_1938 
Symbol 
ID8525802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp1954571 
End bp1956607 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content46% 
IMG OID 
ProductYD repeat protein 
Protein accessionYP_003253037 
Protein GI261419355 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAA AGGCTTGCCA GTTCCTAAAA TCCCCTCTAC TGTTTAAGGT TGTGAAGAAC 
AAAACAGTGG AGGTAAAGAA AGGAAACCAA AAAGAGTACA CGTACGATTT ATCGGATCGC
TTAAAACAGC TGTTGTTAGC GAACGGTACT TCCGTCAACT ACGATTATGA CGTAAATGGC
AATATGACTT CAAAAGTCAT TCGAACCAGC GACGGCCAGT CTCAAACATT TGCCTATTCT
TATGATGCGG CCGGCAAACT CATGAAAACC GTTGGTCCGC TCCATGATGT GACAACGAAT
GAATATGATG CCAATGGAAA TAAAATCAAA ACGGTTTTGC CGAAAGGAAA CACGATTCAA
TGGACGTATG ACGGAACGGA ACGGGTCAAA ACGATTTCTT ACAACAACGT TCCGTATTAC
GAATTTCGTT ATGATCAAAA TGGAAACGAA CTTTCTGTAC AATATGTAAA AGACGGTACG
ACCAAGACAA GAAAATTTGA TTCCGCGAAT CGTGTCATTG AACAATCCGA CCGCGGAGGA
TTACAAAAGT GGACCTATCC GACGACATCG GATAAATTAC AACAGTTTAT GTTCTCCCAT
GGATCGTTTA GCCAAACAGT CACATACCAA TACAATGCCT TGGATCAAAA TACAGTCGTC
CAAGACGGAA CGTATACGTA TCGCTTTGAC TATGACGAGA GAGGGAACGT TCGCACCTTC
ACGACAGGAA ACGGAGCCGG CTCGACCTTT ACGTATGATG ACCGCGGCCT CGTGGAAAGC
GTTTCGGTCG GTACGGCGGA CGGAACAGAA ATTGTATCGG AAACGTACCG TTACGATGAA
AACGGCAATC GGACAGAAAT CGCCACCCCG ACCGGCGCGA AGACGCTCTA TCGTTATGAC
GCCTTGGATC AACTAGTCGA AGAGCAATGG CCGGATGGAA CGACGATCGC GTATACGTAC
GACGGATTCG GCAATCGGAA GCAAATCGTG AAAACGAAAG ACGGGCAGTC GACGACGACC
ACGGCTGATT ATAATGCGGC GAATCAGTTG GTTCGTTTCG GTGACGAAAC CATCACGTAC
GATGCGAACG GCAACCGTGT CGAAGACGGA CAGTATCGAT ATGAATGGAA CGAAGCTGAC
CAGCTCGTCT CCATCACAAG AAAAGGCGAA AGCACGCCGT TTGTGACGTA CCAATATGAT
GAAGACGGCC GGCGCATCCA AAAGAATGTA AATGGCGTCA TTACGAACTA CCATTACCAA
GGCGACAGCC TCAACGTCTT GTATGAAACC GACGCAAGCG GAAATGTGGT AAGATCATAT
ATATACGGAG AAAACGGGCA ACTCCTTGCC ATGAAAAAAG GCAATGCGAC GTATTTTTAC
CACTATAACA CCCATGGCGA TGTCATCGCC CTGACGGATG AACAAGGAAA CATCGTTGCC
CGTTATCAAT ACGATGCGAG GGGGAATATT CTCTCTCAAT CCGGCGCCTT GGCGGACGAA
AACCCATACC GATATGCGGG GTACCAATAT GACCAAGAAA CGGGTCTTTA TTACCTGATT
GCCCGGTACT ACCACCCAGA GCATGGCGTG TTCCTGTCGC TCGACCCCGA TCCGGGTGAT
GCGGATGACC TCTTGACGCA AAATGGGTAT GCGTATGCGA ACAACAATCC GGTGATGTTC
GTGGATCCGG ATGGGAAGTA CAGAGTTGTT GTATCGCTTT TAGTCAAATT AGTAAACAGG
TTAAAAACAT TATTTAAAGG GAAATGCAAG TCCTGTAAAT ACAAAAATGT GACTAAAAGC
GGCAGCCGCT ATATTAACAT ACAGACGGAT GTTTCTAAAG CAGAGTTCGA AAAAAATCTG
CTTAGAAATG GATGGAGAAA ATCGAAATCA AAAGATGGAA AAACCACGAT ATTTACAAAG
GACGGAGCTA AATACGTTCT TCGTGATGAA GCGAAATCTA CAGGTGGGCC AACCGCTGAC
TATTATCCAA AGGGAAGTAA AAAAATGACT TTAAAAATCA GGTTAAAAAA TAGGTGA
 
Protein sequence
MAKKACQFLK SPLLFKVVKN KTVEVKKGNQ KEYTYDLSDR LKQLLLANGT SVNYDYDVNG 
NMTSKVIRTS DGQSQTFAYS YDAAGKLMKT VGPLHDVTTN EYDANGNKIK TVLPKGNTIQ
WTYDGTERVK TISYNNVPYY EFRYDQNGNE LSVQYVKDGT TKTRKFDSAN RVIEQSDRGG
LQKWTYPTTS DKLQQFMFSH GSFSQTVTYQ YNALDQNTVV QDGTYTYRFD YDERGNVRTF
TTGNGAGSTF TYDDRGLVES VSVGTADGTE IVSETYRYDE NGNRTEIATP TGAKTLYRYD
ALDQLVEEQW PDGTTIAYTY DGFGNRKQIV KTKDGQSTTT TADYNAANQL VRFGDETITY
DANGNRVEDG QYRYEWNEAD QLVSITRKGE STPFVTYQYD EDGRRIQKNV NGVITNYHYQ
GDSLNVLYET DASGNVVRSY IYGENGQLLA MKKGNATYFY HYNTHGDVIA LTDEQGNIVA
RYQYDARGNI LSQSGALADE NPYRYAGYQY DQETGLYYLI ARYYHPEHGV FLSLDPDPGD
ADDLLTQNGY AYANNNPVMF VDPDGKYRVV VSLLVKLVNR LKTLFKGKCK SCKYKNVTKS
GSRYINIQTD VSKAEFEKNL LRNGWRKSKS KDGKTTIFTK DGAKYVLRDE AKSTGGPTAD
YYPKGSKKMT LKIRLKNR