Gene GYMC61_2540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_2540 
Symbol 
ID8526408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp2572209 
End bp2574113 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content58% 
IMG OID 
Productvon Willebrand factor type A 
Protein accessionYP_003253615 
Protein GI261419933 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCGGT TTATGATGTT CAATGACCGA ACGGTCGACT CGAGTTTGGT CATGCAGCTT 
TCTGACCTCG CCCAAACGCT CATGCGCCGC CGCGACGTTG CCATCCAGTT CGCCGCCCAT
TCCGGCGTGC ATTGTGTCAA GCCGGTCGTT TACGTCAGCC ATTTTTGGGA AGGGTATCCA
CCATTGGAGC GGGAAACGGC AATGAAAAGC GACGTCGGCT TGCGCATCAT CGGCACGTAT
CGGCACACCG ACCGATTGGC GGTTCGCGCC TTCCGACATG CGGTGGAATC CCGACCGCTC
TCCAAGCTTT CCAAACAGCT GTTTACATTC GCTGAAGATC TTCGCTTGGA AGCCGTCTGC
GAGCGGGAAC GTCCCGGGAT GAAGCGATGG TTCCGCACGC GCCGCCGCAT GTACCGCCGC
TACTTCACTC AGCAATGGCA AGCAAACAGG ACGCGCGGCG CTTGGGCTGA TCAGTTTTTG
GCGGGGATGT ATTTGCGGCT CACGGCCGAT TCGCCGCTTG ATGATGCGCC GCTGGCTCCG
ATGGCGGATG AAGCGGTGCA GGCGCGCCTC GAAACGTTAT GGCCGCAATT TTTTGATGCT
TCCTCGACAG CCGAGGTGGC GGACTGGGCG TTCGCCGTTG TGGAACTGAT GGAGGACGTG
CTGCCTCATG ATATGGTCAA TGCCTATACA TCGCTGCCAA TCGTTGAGGA TGGTGCGGAC
GAGAAAACGA TGACGCTTCA AGATCTCAAA CGGACCAACC CATTGGAAAA CCGCGATGCC
CTTCAAGAGG CGGATGGAGA GGCCAAGCGG CAAGTGCTGC CAACATGGCA TCGGGAAACG
AGCCGGGCGG GTGGAAGCTT TCTCCGCTTT GAGCTCGAAC GAGGCAGCCG CACCGAGATC
ATCAGCGATG AGGCGAGGCC GGGTGAAGAC GGCGATCAGG CTCTGGCCGT CGTGAAAGGA
ACGTCTCGGC CGACGGCGCG AAACGAGTAT GGCCTGGAAG CGCAAGCATC CTTCAGCGAA
CAGCCGCCGG CTGGAAACAG TGCGCCGTAT GGCGAGGCCA ACCGCCAAGC CGATCTTGTG
CTTCTTCCTT CGTCGCCAAA CCTCGCGCAT CTTGAGCAAT ACCGCGCCAA ACAGGCGGCG
GTGGCGCCGT ACCGAAAACG GCTTGTGCGC ATCATGGAAC AATGGCTTGA ACACAAGCGC
TCCGCTTGGC GCACGAACTT GCCCGTCGGT CGGCTGCGCA AACAATTGGT GTCGTTTTTC
ATTGATGAGC GGCCGCGCTT GTTTTGCAAA AAAGGCGAGC CGACGCGGCG GTTTGATGCG
GTCTTCGGCC TGCTTGTCGA TTGCTCGGCT TCGATGCATG ACAAAATGGA AGAGACGAAA
ACCGGGCTCG TTCTTTGCCA TGAGGTGCTG AAAACGTTGC GCGTGCCGCA CCAAATTGTC
GGATTTTGGG AAGATGCGAA CGAAGCAACC GCTTCGCGTC AGCCGAACTA TTTGCAGATG
GCGGTCTCAT TCCATCGCTC GCTTGAGCCG TCGAGCGGCC CGGCGATCCT GCAGCTCGAG
CCGCATGAAG ATAATCGCGA TGGATTGGCG ATTCGCTGGA TGACCGAACA GCTCCTCAAG
CGCCCGGAAG CACAAAAAGT GCTGCTCGTC TTCTCTGACG GCGAACCCGC CGCGTACGGA
TATGAACAAA ACGGCATCAT CGATACGCAC GAAGCAGTCG CCGAAGCGCG CCGCCGCGGC
ATTGAGGTCG TCAATCTCTT TTTAGGCCAC GGGGCCGACG ATGAGTCAAC GCGGCGGACG
ATTGAAAACA TCTATGGTCG CTTCCGCGTC TTTGTTCCGC ATGTGAGCGA GTTGCCGGAT
CGGCTGTTGC CGCTCTTGAA AACGTGGTTG CAAAAAAGTT TGTGA
 
Protein sequence
MERFMMFNDR TVDSSLVMQL SDLAQTLMRR RDVAIQFAAH SGVHCVKPVV YVSHFWEGYP 
PLERETAMKS DVGLRIIGTY RHTDRLAVRA FRHAVESRPL SKLSKQLFTF AEDLRLEAVC
ERERPGMKRW FRTRRRMYRR YFTQQWQANR TRGAWADQFL AGMYLRLTAD SPLDDAPLAP
MADEAVQARL ETLWPQFFDA SSTAEVADWA FAVVELMEDV LPHDMVNAYT SLPIVEDGAD
EKTMTLQDLK RTNPLENRDA LQEADGEAKR QVLPTWHRET SRAGGSFLRF ELERGSRTEI
ISDEARPGED GDQALAVVKG TSRPTARNEY GLEAQASFSE QPPAGNSAPY GEANRQADLV
LLPSSPNLAH LEQYRAKQAA VAPYRKRLVR IMEQWLEHKR SAWRTNLPVG RLRKQLVSFF
IDERPRLFCK KGEPTRRFDA VFGLLVDCSA SMHDKMEETK TGLVLCHEVL KTLRVPHQIV
GFWEDANEAT ASRQPNYLQM AVSFHRSLEP SSGPAILQLE PHEDNRDGLA IRWMTEQLLK
RPEAQKVLLV FSDGEPAAYG YEQNGIIDTH EAVAEARRRG IEVVNLFLGH GADDESTRRT
IENIYGRFRV FVPHVSELPD RLLPLLKTWL QKSL