Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2921 |
Symbol | |
ID | 7977226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2946685 |
End bp | 2947905 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644799725 |
Product | cysteine desulfurase, SufS subfamily |
Protein accession | YP_002950865 |
Protein GI | 239828241 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01976] cysteine desulfurase family protein, VC1184 subfamily [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATACGA AAGAGGTTCG GCAGCTGTTT CCGATTTTGG ACCAAGAAGT AAACGGGAAG CCGCTTATAT ATCTTGACAA TGCTGCGACA TCACAGAAAC CTCTTCCCGT CATTGAAGCG ATTGATCACT ATTATCGCCA GTACAATTCA AATGTCCATC GCGGCGTGCA TACGCTTGGT ACGAAAGCGA CGGACGCATA TGAAGGAGCA CGTGAAAAGG TACGACGGTT TATTAATGCG AAGTCCACAC AAGAAATCAT TTTTACAAGA GGAACGACAA CGGCGTTAAA TATGGTTGCA GCGAGCTATG GCCGCGCTAA TTTACAAGAA GGCGATGAAA TTGTCATCAC GTACATGGAG CACCATAGCA ATATTATTCC ATGGCAGCAA GTCGCTAAAC ATACGGGAGC CACATTAAAG TATATTCCGC TTCAGCCAGA TGGAACGATC GACCTAAAAG ATGTAGAAGC GGCGGTCACA TCCAATACAA AAATTGTCGC TATTGTACAT GTTTCCAACG TTTTAGGAAC GATTAACCCA GTAAAAGAAA TTGCAAAAAT CGCCCACAAA CATGGCGCTG TCATCGTAGT GGACGCCGCG CAGAGCGCTC CTCATATGAA AATCGATGTT CAAGATCTAG ATTGTGATTT TTTGGCGTTT TCTGGGCATA AAATGTGCGG TCCGACAGGA ATTGGCGTAT TATATGGTAA AAGGGAATTA TTAGAGAAGA TGGAACCGGT AGAGTTTGGC GGCGAAATGA TTGATTTCGT CGGCCTGTAC GAATCGACAT GGAAAGAGCT TCCGTGGAAA TTCGAGGGCG GTACTCCAAT TATTGCGGGA GCCATCGGTT TAGGAGCGGC AATCGACTTC CTTGAAGAAA TCGGATTAGA TAATATCGCT GCACATGAGC AGGAACTTGC CCAGTACGCT TTAGAACAGT TGTCCGCGAT AGAAGGGATC ACGATTTATG GTCCAAAACA TCGCGGAGGA TTAGTGACGT TTAACATAGA AGGAGTTCAT CCGCATGATG TGGCAACCGT CCTTGACGCG GAAGGGATCG CCGTCCGCGC CGGTCACCAT TGCGCCCAAC CGTTAATGAA ATGGTTAAAC GTAACAGCTA CAGCGCGTGC AAGTTTTTAC CTCTATAATA CAAAGGAAGA AATCGATCAA CTCGTAGTCG CATTACAGAA AACAAAGGAG TATTTTGGTC ATGTCTTCTA A
|
Protein sequence | MNTKEVRQLF PILDQEVNGK PLIYLDNAAT SQKPLPVIEA IDHYYRQYNS NVHRGVHTLG TKATDAYEGA REKVRRFINA KSTQEIIFTR GTTTALNMVA ASYGRANLQE GDEIVITYME HHSNIIPWQQ VAKHTGATLK YIPLQPDGTI DLKDVEAAVT SNTKIVAIVH VSNVLGTINP VKEIAKIAHK HGAVIVVDAA QSAPHMKIDV QDLDCDFLAF SGHKMCGPTG IGVLYGKREL LEKMEPVEFG GEMIDFVGLY ESTWKELPWK FEGGTPIIAG AIGLGAAIDF LEEIGLDNIA AHEQELAQYA LEQLSAIEGI TIYGPKHRGG LVTFNIEGVH PHDVATVLDA EGIAVRAGHH CAQPLMKWLN VTATARASFY LYNTKEEIDQ LVVALQKTKE YFGHVF
|
| |