Gene GYMC61_3454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_3454 
Symbol 
ID8527342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp3514977 
End bp3517073 
Gene Length2097 bp 
Protein Length698 aa 
Translation table11 
GC content55% 
IMG OID 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_003254484 
Protein GI261420802 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCCT TGTTGGTGGT GAACTGGCTC GCGTTTTTGC TTGTAACCGC TTACGCAATT 
TACTTGTTTG CATATGTCGT AAAAACGCGG GCGATGTACA TCAAACTCGG CAAAAAAGTC
GAGTTTGACC GCAAAGTAAA AGAGCGGCTG CGAAACATCT GGGTCAACGT GTTCGGCCAG
AAAAAGCTGC TCAAGGACAA AAAAAGCGGA CTGATCCACG TCGTCTTTTT CTATGGCTTT
ATTCTCGTCC AATTTGGCGC GATTGATTTC ATCATCAAAG GGCTTGCGCC GGGGGCGCAT
TTGCCGCTTG GGCCGCTGTA TGCGGGATTT ACGTTTTTCC AAGAAATCGT CACTTTGCTT
ATTTTGATCG CGGTGCTCGC CGCCTTTTAC CGCCGTTACA TTGAAAAGCT CGTCCGTTTA
AAGCGCGACT TGAAAGCTGG GCTTGTCCTC ATCTTCATCG CCGGGCTCAT GCTGTCGGTG
TTGTTCGGCA ACGGGATGAG CATCATTTGG CACGGCGAAG AAGCGACATG GAGCGAGCCG
GTCGCCTCGC TCATTGCTGC CGCGTTTTCC TGGGTCGGCG AAACGGGGGC TGCGGTGCTC
TTCTTTATCG CCTGGTGGGT GCATCTGTTG ATTTTGCTCA CGTTCCTCGT GTACGTGCCG
CAATCAAAGC ACGCCCATTT GATCGCTGCG CCGATCAACG TCTTTTTCAG CCGGCTGACG
CGGCCGAAGC TTGCGCCGAT CCATTTTGAA GACGAAAGCC AAGAATCGTT TGGCGTCGGC
AAAATTGAGG ATTTCACGCA AAAGCAGTTG ATCGACTTGT ACGCCTGTGT CGAGTGCGGC
CGCTGCACAA GCATGTGTCC GGCGACCGGC ACGGGGAAAA TGTTGTCGCC GATGGATTTG
ATTTTAAAAT TGCGCGACCA TTTGACGGAA AAAGGGGCGG TCGTCACGTC GCGCGCGCCG
TGGGTGCCCG CGTTCGCTTT CAAACATACA AGGGGCAATC AGCTCGCGTT CGCCGCGGCG
TCGGAGCAGG CGGCGACGAT CGAAATGCCA AGCTTAATCG GCGATGTCAT CACCGAAGAA
GAGATTTGGG CCTGTACGAC GTGCCGCAAC TGTGAGGACC AATGCCCGGT CATGAATGAG
CATGTCGATA AAATCATCGA CTTGCGCCGC TATCTTGTCC TGACGGAAGG ACGGATGAAC
CCGGACGCGC AGCGGGCGAT GACGAACATC GAACGCCAAG GCAATCCGTG GGGCTTGAAC
CGAAAAGAGC GGGAGAACTG GCGCGAGCTG CGCGATGATG TGCATGTGCC GACCGTCAAA
GAGGCGGCGA AAGCGGGAGA GGAAATCGAG TACTTGTTCT GGGTCGGCTC GATGGGGTCG
TATGACAGCC GGAGCCAAAA AATCGCCCTT GCGTTTGCCA AGCTGTTGAA TGAAGCAGGC
GTCAAGTTCG CGATTTTAGG CAACAAGGAG AAAAACTCGG GCGATACGCC GCGCCGGTTA
GGAAATGAGT TTTTGTTCCA AGAATTAGCG ACGAACAACA TCGCGGAATT TGAAAAAGCG
GGCGTCAAGA AAATCGTGAC GATCGACCCG CACGCTTACA ATACGTTCAA AAATGAGTAC
CCGGATTTTG GGTTCGAGGC CGAAGTGTAT CATCATACCG AGCTGCTCGC CAAGCTCATC
GAAGAAGGGC GGCTCGTGCC GAAACATCCG GTGAATGAAC GCATTACGTT CCATGACTCG
TGCTATTTAG GACGCTACAA TGACGTCTAT GACGCGCCGC GGAAAATTTT GCGCGCTATC
CCGGGCGTCG AGCTTGTCGA AATGGAGCGC AACCGCGAAC GCGCCATGTG TTGCGGCGCC
GGCGGCGGCC TCATGTGGAT GGAGGAGACG ACCGGCAACC GGATCAACGT CGCCCGCACT
GAGCAAGCGC TCGCTGTCAA TCCGACGGTC ATCAGCTCCG GCTGTCCGTA CTGTTTGACG
ATGTTGTCAG ACGGCACGAA GGCCAAGGAA GTGGAAGACC GCGTCTTTAC GTACGATGTC
GCTGAATTGT TGGCGAAATC GGTGTTTGGC GAGGAAAAAG AAGAAGCGGC ATCATAA
 
Protein sequence
MNALLVVNWL AFLLVTAYAI YLFAYVVKTR AMYIKLGKKV EFDRKVKERL RNIWVNVFGQ 
KKLLKDKKSG LIHVVFFYGF ILVQFGAIDF IIKGLAPGAH LPLGPLYAGF TFFQEIVTLL
ILIAVLAAFY RRYIEKLVRL KRDLKAGLVL IFIAGLMLSV LFGNGMSIIW HGEEATWSEP
VASLIAAAFS WVGETGAAVL FFIAWWVHLL ILLTFLVYVP QSKHAHLIAA PINVFFSRLT
RPKLAPIHFE DESQESFGVG KIEDFTQKQL IDLYACVECG RCTSMCPATG TGKMLSPMDL
ILKLRDHLTE KGAVVTSRAP WVPAFAFKHT RGNQLAFAAA SEQAATIEMP SLIGDVITEE
EIWACTTCRN CEDQCPVMNE HVDKIIDLRR YLVLTEGRMN PDAQRAMTNI ERQGNPWGLN
RKERENWREL RDDVHVPTVK EAAKAGEEIE YLFWVGSMGS YDSRSQKIAL AFAKLLNEAG
VKFAILGNKE KNSGDTPRRL GNEFLFQELA TNNIAEFEKA GVKKIVTIDP HAYNTFKNEY
PDFGFEAEVY HHTELLAKLI EEGRLVPKHP VNERITFHDS CYLGRYNDVY DAPRKILRAI
PGVELVEMER NRERAMCCGA GGGLMWMEET TGNRINVART EQALAVNPTV ISSGCPYCLT
MLSDGTKAKE VEDRVFTYDV AELLAKSVFG EEKEEAAS