Gene GYMC61_2135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_2135 
Symbol 
ID8525999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp2156960 
End bp2158315 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content52% 
IMG OID 
ProductPTS system, cellobiose-specific IIC subunit 
Protein accessionYP_003253233 
Protein GI261419551 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCGGT TTATTCGTGT GTTGGAAGAG CGTGTGATGC CTGTCGCCGG CAGGATTGCC 
GAACAGCGCC ATTTGCAAGC CATTCGTGAC GGAATCATTT TGTCGATGCC GCTCTTGATT
ATCGGGTCTT TATTTTTAAT CATTGGTTTT TTGCCGATCC CCGGTTATAA CGAATGGATG
GCGAAATGGT TTGGCGAACA TTGGCTCGAT AAGCTGTTGT ATCCGGTCGG GGCGACATTC
GACATTATGG CGCTTGTCGT CAGCTTCGGC GTCGCTTACC GGTTGGCGGA AAAGTACAAA
GTTGATGCGC TTTCGGCCGG GGCGATTTCA CTTGCCGCTT TTTTGCTCGC AACTCCGTAT
AAAGTGCCGT TCACGCCGGA AGGAGCGAAA GAAGCCATTA TGGTCAGCGG CGGCATTCCG
GTGCAATGGG TCGGCAGCAA AGGTTTGTTT GTCGCCATGA TTTTGGCGAT CGCATCGACG
GAAATTTACC GAAAAATCAT CCAAAAAAAT ATTGTCATTC GACTTCCGGA TGGGGTGCCG
CCTGCAGTGG CCCGCTCTTT TGTTGCTTTG ATTCCGGGGG CCGCTGTTCT CGTCGTTGTC
TGGGTGGCCC GCCTTATTTT GGAAATGACG CCGTTTGAAA GTTTCCATAA TATTGTATCT
GTGCTTCTAA ACAAACCGCT CAGTGTGCTC GGCGGCAGTT TATTTGGCGC CATTGTCGCT
GTACTGCTTG TGCAGCTGCT ATGGTCGACC GGTTTGCACG GGGCGGCGAT CGTAGGAGGA
GTAATGGGGC CGATTTGGCT GTCGCTGATG GACGAAAACC GGATGGTGTT CCAGCAAAAT
CCGAATGCCG AACTGCCCAA CGTCATTACG CAGCAGTTTT TTGATCTTTG GATTTACATC
GGCGGTTCAG GAGCGACATT GGCGTTGGCG TTGACCATGA TGCTTCGGGC GCGCAGCCGG
CAGTTGAAAA GCTTAGGGCG GCTCGCGATC GCACCTGGCA TTTTCAATAT TAATGAGCCG
ATCACGTTCG GTATGCCGAT CGTCATGAAT CCATTGCTTA TCATTCCATT CATTCTCGTG
CCTGTCGTGC TTGTTGTTGT CTCCTACGCG GCGATGGCGA CTGGGCTTGT CGCCAAACCA
AGCGGGGTGG CCGTGCCATG GACGACACCG ATCGTGATCA GTGGCTATTT AGCGACGGGG
GGCAAAATTT CCGGGAGCAT TTTGCAAATC GTCAACTTCT TCATCGCGTT TGCCATCTAC
TATCCATTTT TCTCGATTTG GGACAAACAA AAAGCGGCCG AAGAGCAGAC CGATCCAACA
ATCTCAAGCG GAGCGGGAAC AACGCACTCG CTGTAA
 
Protein sequence
MDRFIRVLEE RVMPVAGRIA EQRHLQAIRD GIILSMPLLI IGSLFLIIGF LPIPGYNEWM 
AKWFGEHWLD KLLYPVGATF DIMALVVSFG VAYRLAEKYK VDALSAGAIS LAAFLLATPY
KVPFTPEGAK EAIMVSGGIP VQWVGSKGLF VAMILAIAST EIYRKIIQKN IVIRLPDGVP
PAVARSFVAL IPGAAVLVVV WVARLILEMT PFESFHNIVS VLLNKPLSVL GGSLFGAIVA
VLLVQLLWST GLHGAAIVGG VMGPIWLSLM DENRMVFQQN PNAELPNVIT QQFFDLWIYI
GGSGATLALA LTMMLRARSR QLKSLGRLAI APGIFNINEP ITFGMPIVMN PLLIIPFILV
PVVLVVVSYA AMATGLVAKP SGVAVPWTTP IVISGYLATG GKISGSILQI VNFFIAFAIY
YPFFSIWDKQ KAAEEQTDPT ISSGAGTTHS L