Gene GYMC61_0050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_0050 
Symbol 
ID8523832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp62180 
End bp63730 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content58% 
IMG OID 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_003251232 
Protein GI261417550 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAACG TATGGAAAGG GGCGGCGATC TTAACGGCCG CCGCTTTGGC CGCGAAACTA 
TTAAGCGCTT TGTACCGTGT TCCGTATCAA AATATGGTCG GGGACATTGG TTTTTATATT
TACCAACAAG TGTACCCGAT TTATGGCATC GTCGTCGCAC TCTCGCTGAC CGGCTACCCG
GTTGCCGTCT CGAAGCTTGT CGCCGAGCGG TTGGCGGGAC AGGATGAAGC GGGCGCTGCC
GCTAGTGTGC GCGTCGCCTT GTTGGTGTTA AGCGTCCTTG GCGTCATCCT GTTTGCTTCG
CTGTATCTAG GGGCGGGGGT GATTGCCTCG GCGATGGGCG ATGGACGGCT TACGCCGCTC
GTGCGTTTGC TTTCATTTTC GTTTTTGCTG TTTCCGCCCA TCGCGTTGTT GCGTGGCTAT
TTCCAAGGGC GGCATGATAT GACGCCGACG GCGGCGTCCC AAGTTGGCGA GCAATTCGTC
CGGGTGACGG CGATTTTAGG GCTGTCGTAT GGAGCGGTGC AGCGCGGCGC CGACGTCTAT
GCTTGCGGCA TGGCAGCGGT CGCAGGGACG CTAGTAGGCA TGGCGGCGGC GCTTTTCATT
TTGCTTTTCT TCCTGTCCCG GCGTCGGCGG CTAAAAACGT CGGGCCGCAC GCCGCCAGCT
TGGGATCGAC AGGTGGGCCG GCGTTTATTG ACGGAGGGGA CGGTCATTTG CTTGACGAAT
ATGGCGTTGA CGCTGATTCC ACTCGTCGAT TCATTTTTAT TCGTTCCGCT TCTACAGGAA
GCGGGGGCAA GGCTCGATGA GGTGCAGCGG CTAAAAGGAG TGTACGACCG CGGTCAGCCG
CTCATTCAGC TCGGCACGGT CGTCGGCACG TCGTTTTCAT TGGCGCTTGT TCCACTTCTT
TCCGGAGCGC GCCGCCAAGG TGCCGTTTTC GCCTATGGAG CGCTGTCCAT CCGGCTTGCC
GTTGTCATTG GGCTTGGTGC TTCGTTAGGG CTCATTTGCC TCATTCGACC GATCAATGCG
ATGTTGTTCG AGAATGACTA CGGTTCGTCG GTTCTCGCCG TCTTGTCCTC CTCTGTCTTT
TTTACGACGA TCGCGTTGAC CGCCTCTGCA TTATTGCAAG GAATGGGGAG GGAATGGACG
GCCGCTGCCG GCGTGGCGTT GGCAGTGGCG GGGAAGGCCG CGCTTATGCA TTGGCTTGCT
CCGCGGTTTG GAGCGCTTGG CGCCGCCGCG GCGACGACGG GTGCTTATGC GCTCATGGCA
GGCTTTTTAT GCGCCTTTTT GCCGCGTGAA TATCGGACGG CGGGCCGGAA ATACATGTAC
CCAACCGTGA AAGCGGCCGC TATGATGGCC GTCGTCTTGC ATGGGTATAG GTGGCTGATG
GACAGCTCGA GCGAGGGGCG GCTATGGGCG GCTGCCGAGG CGCTTGGCGG CGTTGCCATC
GGTGCTGTCG TTTACCTTGC GTGTATTGTG AAAGGACATG TTTTTTCTGA ACAGGAGTTG
GCAGCTCTCC CATTGGCTAA TAAATTCCGT CTACGATTAG GAGGCAGGTG A
 
Protein sequence
MGNVWKGAAI LTAAALAAKL LSALYRVPYQ NMVGDIGFYI YQQVYPIYGI VVALSLTGYP 
VAVSKLVAER LAGQDEAGAA ASVRVALLVL SVLGVILFAS LYLGAGVIAS AMGDGRLTPL
VRLLSFSFLL FPPIALLRGY FQGRHDMTPT AASQVGEQFV RVTAILGLSY GAVQRGADVY
ACGMAAVAGT LVGMAAALFI LLFFLSRRRR LKTSGRTPPA WDRQVGRRLL TEGTVICLTN
MALTLIPLVD SFLFVPLLQE AGARLDEVQR LKGVYDRGQP LIQLGTVVGT SFSLALVPLL
SGARRQGAVF AYGALSIRLA VVIGLGASLG LICLIRPINA MLFENDYGSS VLAVLSSSVF
FTTIALTASA LLQGMGREWT AAAGVALAVA GKAALMHWLA PRFGALGAAA ATTGAYALMA
GFLCAFLPRE YRTAGRKYMY PTVKAAAMMA VVLHGYRWLM DSSSEGRLWA AAEALGGVAI
GAVVYLACIV KGHVFSEQEL AALPLANKFR LRLGGR