Gene GYMC61_1787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_1787 
Symbol 
ID8525651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp1810920 
End bp1812641 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content55% 
IMG OID 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_003252896 
Protein GI261419214 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAGA CGATTCATGG CATCGCCGCT TCAAGCGGCA TCGCCATCGC CAAGGCCTAC 
CGCTTAGAGA CGCCTCATTT GACGGCCGAA AAACGAACGG TCACCGATGT CGAGGCGGAA
ATTGCGCGGC TTGAGGCGGC GGTCGCGAAA GCGAAAGAAG AGCTGGAAGC CATCAAACAG
CATGCCTTGG AAAAGCTTGG TGAAGACAAA GCCGCCATTT TTGCCGCCCA CTTGCTTGTG
CTTGACGACC CAGAACTGCT GAACCCAATT AAGGAAAAAA TCAAAACAGA ACAAGTGAAT
GCGGAATATG CGCTCCATGA AACGGCATCG TTTTTCATTT CCATGTTTGA AGGCATGGAC
AATGAGTATA TGAAAGAGCG GGCCGCCGAT ATCCGCGATG TGACAAAGCG CGTCCTCGCC
CATCTGCTTG GCGTCACGAT CTCGAACCCG AGCCTCATTT CTGAGGAAGT CGTGATCATC
GCTGAAGACT TGACGCCATC CGATACGGCG CAGCTGAACC GCCAATATGT GAAAGGATTT
GCCACCGACA TCGGCGGGCG AACGTCGCAC TCGGCCATTA TGGCCCGCTC GCTCGAAATT
CCGGCTGTCG TCGGCACGAA GGCGGTAACG GCGGAAGTAA AAAACGGCGA CATGGTCATC
GTCGATGGGC TCGACGGTCA AGTCGTCGTC AATCCGTCCC CGGAGCTGCT TGCGCGTTAT
GAAGAGAAGC GGGCTCGCTA TGAGGAGCAA AAAGCGGAAT GGGCGAAGCT TGTCGATCAA
CCGACGGTCA CCGCTGATGG CGTGCACGTT GAGCTGGCGG CCAATATCGG CACGCCGGAC
GATGTGAAAG GAGCGTTGGC CAACGGGGCA GAAGGGATCG GATTGTATCG CACGGAATTT
TTATACATGG GACGATCGGA GCTGCCTACG GAAGACGAAC AGTTTGCGGC TTACAAAACG
GTGCTTGAAC AAATGGGCGG CAAGCCGGTC GTTGTGCGTA CGCTTGACAT TGGCGGCGAC
AAAGAGCTCC CGTATTTACA CTTGCCAAAA GAGATGAACC CGTTTTTAGG GTTTCGAGCC
ATTCGGCTTT GTTTGGAAAT GCAAGACATG TTCCGCACCC AGCTGCGCGC CTTGCTGCGG
GCGAGCGTGC ACGGCAATTT GAAAATCATG TTCCCGATGA TTGCGACGCT CGATGAATTC
CGCCAAGCGA AAGCGATTTT GCTCGAAGAA AAAGAAGCGC TCCTCCGCCA AGGCGTCCCG
GTCGCCGATG ACATTGAAGT CGGCATGATG GTGGAGATCC CGGCTGCCGC CGTCATGGCC
GATCAGTTTG CCAGGGAAGT CGATTTCTTC AGCATCGGAA CGAACGACCT GATCCAATAT
ACGATGGCGG CCGACCGGAT GAATGAGAGG GTGGCGTATC TATATCAACC GTACAACCCG
GCTATTTTGC GGCTCATCAG CTATGTCATT GACGCCGCTC ACCGCGAAGG GAAATGGGTT
GGGATGTGCG GGGAAATGGC CGGCGACCCG ATCGCCATTC CGATTTTGCT TGCTCTTGGC
CTTGATGAGT TCAGCATGAG CGCCACCTCG ATTTTGCCGG CGCGCGCCCA GCTGAAGCGG
CTGTCAAAAG AGGATGCGGT CCGCGTGAAA GAGACAGTGC TGTCGCTTGG TACGGCTGAG
GAAGTAGTGT CGTTTGTCAA ACGAACGTTC CATATGGCTT GA
 
Protein sequence
MEKTIHGIAA SSGIAIAKAY RLETPHLTAE KRTVTDVEAE IARLEAAVAK AKEELEAIKQ 
HALEKLGEDK AAIFAAHLLV LDDPELLNPI KEKIKTEQVN AEYALHETAS FFISMFEGMD
NEYMKERAAD IRDVTKRVLA HLLGVTISNP SLISEEVVII AEDLTPSDTA QLNRQYVKGF
ATDIGGRTSH SAIMARSLEI PAVVGTKAVT AEVKNGDMVI VDGLDGQVVV NPSPELLARY
EEKRARYEEQ KAEWAKLVDQ PTVTADGVHV ELAANIGTPD DVKGALANGA EGIGLYRTEF
LYMGRSELPT EDEQFAAYKT VLEQMGGKPV VVRTLDIGGD KELPYLHLPK EMNPFLGFRA
IRLCLEMQDM FRTQLRALLR ASVHGNLKIM FPMIATLDEF RQAKAILLEE KEALLRQGVP
VADDIEVGMM VEIPAAAVMA DQFAREVDFF SIGTNDLIQY TMAADRMNER VAYLYQPYNP
AILRLISYVI DAAHREGKWV GMCGEMAGDP IAIPILLALG LDEFSMSATS ILPARAQLKR
LSKEDAVRVK ETVLSLGTAE EVVSFVKRTF HMA