Gene GYMC61_3072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_3072 
Symbol 
ID8526957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp3126489 
End bp3127709 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content58% 
IMG OID 
Productcysteine desulfurase, SufS subfamily 
Protein accessionYP_003254113 
Protein GI261420431 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGTGA ACGAAATTCG CGCGTTGTTT CCGATTTTGC ATCAGGACGT CAACGGCCAT 
CCGCTCGTCT ATTTTGACAG CGCGGCGACG TCGCAAAAGC CGCTGCCGGT GATTGAGGCG
CTTGACCGCT ACTACCGCGA GTACAACTCG AACGTCCACC GCGGCGTCCA TACGCTCGGG
ACGAAGGCGA CCGACGCGTA CGAAGGCGCG CGCGAAAAAG TGCGGCGGTT TTTAAACGCC
CAATCGGCGC AGGAAATCAT CTTTACGCGC GGCACAACCG CTGCGCTCAA CTTGGTCGCT
GCAAGCTACG GGCGCGCCAA TGTCAAAGAA GGCGACGAGA TCGTCATCAC GTACATGGAG
CATCACAGCA ACTTAATCCC ATGGCAGCAG CTGGCGAAAC AAACGGGCGC AACGCTGAAA
TACATTCCGC TGCAGGAAGA CGGCACGATC GATTTGCGCG ACGTTGAGGC GACCATCACC
AAAGCGGCGA AGATCGTCGC CATCGCCCAT GTGTCCAACG TGCTCGGGAC GATCAACCCG
GTGCGGGAGA TCGCCCGCAT CGCCCATGAG CGCGGGGCGG TCGTCGTCGT CGATGCGGCG
CAAAGCGCTC CGCATATGAA GGTCGATGTT CAGGAACTTG ATTGCGATTT TCTCGCCCTT
TCCGGCCATA AAATGTGCGG GCCGACGGGA ATCGGCGTAT TATATGGCAA AAAGAAATGG
CTTGAGCAGA TGGAGCCGAT CGAGTTCGGC GGCGAAATGA TCGATTTTGT CGAGCTGTAC
GACTCGACGT GGAAAGAGCT GCCGTGGAAG TTTGAAGGCG GCACGCCGAT CATTGCCGGG
GCGATTGGCC TTGGCGCAGC GATCGATTTC CTTGAACAAG TGGGCTTGGA CGCCATCGCC
GCCCATGAGC ATGAGCTGGC GCAATATGCG CTTAGCCGAA TGGCGGACAT CGAAGGCGTC
ACCGTCTATG GCCCGAAAGA GCGGGCGGGG CTTGTCACGT TCAACATCGA CGGGGTGCAT
CCGCACGATG TGGCGACGGT CCTTGACGCC GAAGGAATCG CCATCCGCGC CGGCCACCAT
TGCGCCCAGC CGCTCATGAA ATGGCTCGGC GTGACGGCGA CCGCCCGGGC GAGCTTTTAC
CTTTACAATA CCAAAGAGGA AATCGACGCA TTCATCGCCG CATTACAGAA AGCGAAGGAG
TACTTCAGCC ATGTCTTCTA A
 
Protein sequence
MNVNEIRALF PILHQDVNGH PLVYFDSAAT SQKPLPVIEA LDRYYREYNS NVHRGVHTLG 
TKATDAYEGA REKVRRFLNA QSAQEIIFTR GTTAALNLVA ASYGRANVKE GDEIVITYME
HHSNLIPWQQ LAKQTGATLK YIPLQEDGTI DLRDVEATIT KAAKIVAIAH VSNVLGTINP
VREIARIAHE RGAVVVVDAA QSAPHMKVDV QELDCDFLAL SGHKMCGPTG IGVLYGKKKW
LEQMEPIEFG GEMIDFVELY DSTWKELPWK FEGGTPIIAG AIGLGAAIDF LEQVGLDAIA
AHEHELAQYA LSRMADIEGV TVYGPKERAG LVTFNIDGVH PHDVATVLDA EGIAIRAGHH
CAQPLMKWLG VTATARASFY LYNTKEEIDA FIAALQKAKE YFSHVF