Gene GYMC61_2843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_2843 
Symbol 
ID8526720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp2902060 
End bp2903223 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content51% 
IMG OID 
ProductCapsule synthesis protein, CapA 
Protein accessionYP_003253901 
Protein GI261420219 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCAT GGATCAAGGG GAGAAATGCG GCGCTTATGT GTATGTGTAT GGCGTCCATC 
TTTTTATTTT CCTGCAGCAA TGCGGCTGAT CGCGGCGTCG AGGAAGTGCT GGCGGCGCCA
AACGAACTGG CGGGGCAAAC AATCGAGCAT CCACCGCTGC CGAGTCCGAT CGATAAGATG
GTGATAGCGT CAGCCTCTGC CAAACAAGCG GCGCTGAACC CGAACGAAGT GCGCATCACC
ATCAGCGCGG CGGGCGATGT GACGCTCGGG CGCGATGAAA ATTATGGCTA TGCGTATTCG
TTTGATGAAG AAGCGAAGAA GCACGGCTTG CGCTATTTTA CGAAATACAT TGAGCCGATC
TTCAAAAAGG ACGATTTTAC GACCGTCAAT TTGGAAACGA CGCTGACGAC CTCGACGCGG
AAGGCAAGCA AGAAATTTCG TTTTCGCGGC CACCCGAGTT ATGCGAAAAT CTTAACTTAT
GGCGGCATTG ATGCTGTGAA TCTGGCCAAT AATCATACAT ACGATTACTT GCAACGAGGA
TACAATGACA CGATTGCCAG TTTAAAAAAG GAAAACATCG GTTATTTCGG CCGCACGCTC
CGGCTGTTGA AAACGGTCAA AGGCATTCAA GTTGGAGCGC TTGGCTACGA AGGCTGGAGC
AACACCAGCA CGTTGCGCAA GCAAATCGCC AACGATATTC GCACGCTTCG GAAACAAGGA
GCCGACATCG TTTTCGTCCA TTTCCATTGG GGAGTGGAAC GAAGCTATGT GCCAAACAGC
ACGCAAAAGG CGCTCGGCCG CTTTGCGATT GACAGCGGCG CCGATTTGGT CGTCGGTCAT
CATCCGCATG TCATTCAAGG GATTGAGGAG TATAAAGGGA AGTTCATCGT CTACAGCCTC
GGCAATTTTA TGTTCGGCGG AAACAAAAAT CCGAGCGACA AAGATACGTT TGTCTTCCAG
CAAGTGTTCT CTTTTCAAAA CGGCAAGCGG ACGGCCAAAA AAGAAATCCG CGTCATCCCA
TTTCGCATTT CATCGGTGAC GACAAGAAAC AATTATCAGC CGATGCCGTT AGCGGGAGGG
GAAGCAGCCC GAGTGAAACG GAAAATTGTC TCGCTGTCGG CGAAAATCAA AAAGCCGACT
TGGACCGCGT ACGAAGTGAA GTAA
 
Protein sequence
MNAWIKGRNA ALMCMCMASI FLFSCSNAAD RGVEEVLAAP NELAGQTIEH PPLPSPIDKM 
VIASASAKQA ALNPNEVRIT ISAAGDVTLG RDENYGYAYS FDEEAKKHGL RYFTKYIEPI
FKKDDFTTVN LETTLTTSTR KASKKFRFRG HPSYAKILTY GGIDAVNLAN NHTYDYLQRG
YNDTIASLKK ENIGYFGRTL RLLKTVKGIQ VGALGYEGWS NTSTLRKQIA NDIRTLRKQG
ADIVFVHFHW GVERSYVPNS TQKALGRFAI DSGADLVVGH HPHVIQGIEE YKGKFIVYSL
GNFMFGGNKN PSDKDTFVFQ QVFSFQNGKR TAKKEIRVIP FRISSVTTRN NYQPMPLAGG
EAARVKRKIV SLSAKIKKPT WTAYEVK