Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2532 |
Symbol | |
ID | 7979111 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 2561823 |
End bp | 2562962 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644799333 |
Product | cysteine desulfurase |
Protein accession | YP_002950493 |
Protein GI | 239827869 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTTATC TTGACTATGC CGCTACAACA CCGATGAGCA AAGAAGCGCT AAATGCCTAT GTAGAAGCAG CTGACGCTTA CTTTGGAAAT GCAAGCAGTT TACACGATAT CGGAAGCAAT GCCGAACGAC TGTTAACTAT TTGCCGAAAA GAACTTGCCA CGCTTATTGG CGGCGAGGAA CGGGGTATTT ATTTTACAAG CGGGGGAACA GAATCCAATA TTCTTGCTAT TCGTTCGCTT ATTAACGCAC ATCGCCACCG CGGAAATCAT CTCATAACAA CCGAGATTGA ACATGCTTCT CTTTATCATC TATTCCAACA ATTAGAAAAA GAAGGATTTG AGGTCACTTA TTTACCGGTT AACCGCTTTG GACAAATTGA TATCAGTGAT TTACAACGGG CGATTACACC AAAAACGATT CTCGCCTCCA TTCAACACGC TAATTCAGAA ATCGGAACAA TACAGCCAAT TGCCGAAATT GGACGGCTGC TTCGGCGTCA TGGGGTAATC TTTCATAGCG ATTGTGTCCA AACTTTTGGT AAAATACGAA TTGATGTGAA AAAGATGTTT ATCGACAGCC TTTCGATTTC TGCCCATAAA ATTTACGGAC CAAAAGGGGT CGGTGCGGTA TATATTGACC CACGCATCCA TTGGCAGCCA TGTTTTCCTA ACGCCACCCA TGAAGATGGA TTTCGTCCAG GGACAGTCAA TGTGCCTGGA ATCGCTTCTT TTATCACCGC AGCACAGCAC ATTTGCGAAA ATATAGATGC CGAACAAACT CGTTTCGAGC AGCTGCGCCG TTACTTGCTG GCGCTTATTC GTGAAAAAAG CTTGCCTGTC ACGGCAGAGG GACATCCTGA CGTTCACCTG CCAAATATTA TCGGTCTTTC TGTTCATGGA ATCGAAGGCC AATATGTCAT GCTGGAATGT AACCGTGATA ACATCGCCAT TTCCACTGGA AGTGCGTGTC AAATTGGAAA ACAAGCTCCT TCGCGGACGA TGTTGGCAGT TGGAAAATCT GTAGAAGAAG CAAAACAGTT TATACGTGTT TCATTCGGGA AATGGACGAC GGAAAAAGAC ATTGACCAAC TTGTGTCTTC CCTTGAGCGA ATCAGTCAGC AAAGAAAGGA GTTTGAATGA
|
Protein sequence | MIYLDYAATT PMSKEALNAY VEAADAYFGN ASSLHDIGSN AERLLTICRK ELATLIGGEE RGIYFTSGGT ESNILAIRSL INAHRHRGNH LITTEIEHAS LYHLFQQLEK EGFEVTYLPV NRFGQIDISD LQRAITPKTI LASIQHANSE IGTIQPIAEI GRLLRRHGVI FHSDCVQTFG KIRIDVKKMF IDSLSISAHK IYGPKGVGAV YIDPRIHWQP CFPNATHEDG FRPGTVNVPG IASFITAAQH ICENIDAEQT RFEQLRRYLL ALIREKSLPV TAEGHPDVHL PNIIGLSVHG IEGQYVMLEC NRDNIAISTG SACQIGKQAP SRTMLAVGKS VEEAKQFIRV SFGKWTTEKD IDQLVSSLER ISQQRKEFE
|
| |