Gene GYMC61_2668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_2668 
Symbol 
ID8526545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp2707208 
End bp2708524 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content49% 
IMG OID 
ProductPTS system, cellobiose-specific IIC subunit 
Protein accessionYP_003253738 
Protein GI261420056 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCAAG CATTATTCGA AAAACTAAGC AAAGTTCTCA TTCCGATCGC GGGAAAGTTG 
AACAATAGCC GCTATTTGCA AGTGTTGCGC GATGCCTTTA TGTTGGCGTT TCCGTTGACG
ATTTTCGGTT CAATCGCGGT CGTCATCGCC AACTTGCCGT TTTTGGACAA GGTGATGAGT
GAAAACAGCC TCAATACACT CAAGGGGATG CTCGGTGTTG CACCAAATGC AACAATGGGC
GTGATGACGA TTTTCGTCGT GTTTGGCATT GGCTACTATT TATCGAAAAG TTATGAAGTC
GAAGGGATTT TCGGCGGTGC GATCGCGTTA GCTTCTTTTC TCTTGTTGAC GCCATTCGCT
TTGCAGGTTG AAGGAGGTGA GGTTGTGCAA GGAGTCATCC CGCTCGACCG CCTAGGAGCG
AAAGGGATGT TTCTTGGCAT GATTACCGCA TTTGTCGCTG GGGAAATTTA CCGGAAGGTT
GTGCAAAAAA ATATTACCAT CAAAATGCCG GCCGGGGTGC CGCCGGCGGT TGCGAAGTCG
TTTGCGGCGT TGATTCCGGC GGTGGTCACC CTGACCTTCT TCCTCGTCGT CAATATCATT
GTAACACAAA TTTTTAAAAC AAACATGCAT GATGTCATTT ACAACGCAGT GCAAGCACCG
CTTGTTGGGC TCGGAAGCGG CATTGTTCCA ACGCTCATCG CGATTTTTGT CACGCAAATT
TTATGGTTTT TTGGCCTCCA CGGGCAAATC ATCATCAACT CGGTGATGGA TCCGATTTGG
AACACACTGT CGCTCGAAAA CTTAAATGCG TACACGCAAA CAGGGGAAGT TCCGCATGTC
GTCAGCAAAC AGTTTATTGA AATTTACACG GTCGGCATGG GCGGAACGGG TATGACACTC
GCTGTCATTT TCGCCATCTT GCTCTTTATG AAGAGCAAGC AAATGAAGCA GGTGGCCAAG
CTCGGGCTTG GACCGGGAAT CTTTAACGTC AATGAACCGA TTATTTTCGG CTTGCCGGTC
GTGATGAATC CGCTCGTCAT CGTCCCGTGG ATTTTGGCGC CGATGGTTGT CACGTTGGTG
ACGTATTTGG CGATGTCCTC AGGCCTTGTC CCGCCGCCTA ACGGCGTAGC GGTACCATGG
ACGGTGCCGA TTTTCATCAA CGGCATTATG GCGACAAACT CGCTGGCCGG CGGACTGTTG
CAAGTGGTCA ATTTCTTGAT CGTGCTCGTC ATTTGGTTCC CGTTCTTAAA ATTCATTGAC
CGCATGAATT TACAAAAGGA AAAAGAAGAG CAAGCCGCAT CGAAAAGTGC ATCATAA
 
Protein sequence
MNQALFEKLS KVLIPIAGKL NNSRYLQVLR DAFMLAFPLT IFGSIAVVIA NLPFLDKVMS 
ENSLNTLKGM LGVAPNATMG VMTIFVVFGI GYYLSKSYEV EGIFGGAIAL ASFLLLTPFA
LQVEGGEVVQ GVIPLDRLGA KGMFLGMITA FVAGEIYRKV VQKNITIKMP AGVPPAVAKS
FAALIPAVVT LTFFLVVNII VTQIFKTNMH DVIYNAVQAP LVGLGSGIVP TLIAIFVTQI
LWFFGLHGQI IINSVMDPIW NTLSLENLNA YTQTGEVPHV VSKQFIEIYT VGMGGTGMTL
AVIFAILLFM KSKQMKQVAK LGLGPGIFNV NEPIIFGLPV VMNPLVIVPW ILAPMVVTLV
TYLAMSSGLV PPPNGVAVPW TVPIFINGIM ATNSLAGGLL QVVNFLIVLV IWFPFLKFID
RMNLQKEKEE QAASKSAS