Gene GWCH70_2921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2921 
Symbol 
ID7977226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2946685 
End bp2947905 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content45% 
IMG OID644799725 
Productcysteine desulfurase, SufS subfamily 
Protein accessionYP_002950865 
Protein GI239828241 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01976] cysteine desulfurase family protein, VC1184 subfamily
[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACGA AAGAGGTTCG GCAGCTGTTT CCGATTTTGG ACCAAGAAGT AAACGGGAAG 
CCGCTTATAT ATCTTGACAA TGCTGCGACA TCACAGAAAC CTCTTCCCGT CATTGAAGCG
ATTGATCACT ATTATCGCCA GTACAATTCA AATGTCCATC GCGGCGTGCA TACGCTTGGT
ACGAAAGCGA CGGACGCATA TGAAGGAGCA CGTGAAAAGG TACGACGGTT TATTAATGCG
AAGTCCACAC AAGAAATCAT TTTTACAAGA GGAACGACAA CGGCGTTAAA TATGGTTGCA
GCGAGCTATG GCCGCGCTAA TTTACAAGAA GGCGATGAAA TTGTCATCAC GTACATGGAG
CACCATAGCA ATATTATTCC ATGGCAGCAA GTCGCTAAAC ATACGGGAGC CACATTAAAG
TATATTCCGC TTCAGCCAGA TGGAACGATC GACCTAAAAG ATGTAGAAGC GGCGGTCACA
TCCAATACAA AAATTGTCGC TATTGTACAT GTTTCCAACG TTTTAGGAAC GATTAACCCA
GTAAAAGAAA TTGCAAAAAT CGCCCACAAA CATGGCGCTG TCATCGTAGT GGACGCCGCG
CAGAGCGCTC CTCATATGAA AATCGATGTT CAAGATCTAG ATTGTGATTT TTTGGCGTTT
TCTGGGCATA AAATGTGCGG TCCGACAGGA ATTGGCGTAT TATATGGTAA AAGGGAATTA
TTAGAGAAGA TGGAACCGGT AGAGTTTGGC GGCGAAATGA TTGATTTCGT CGGCCTGTAC
GAATCGACAT GGAAAGAGCT TCCGTGGAAA TTCGAGGGCG GTACTCCAAT TATTGCGGGA
GCCATCGGTT TAGGAGCGGC AATCGACTTC CTTGAAGAAA TCGGATTAGA TAATATCGCT
GCACATGAGC AGGAACTTGC CCAGTACGCT TTAGAACAGT TGTCCGCGAT AGAAGGGATC
ACGATTTATG GTCCAAAACA TCGCGGAGGA TTAGTGACGT TTAACATAGA AGGAGTTCAT
CCGCATGATG TGGCAACCGT CCTTGACGCG GAAGGGATCG CCGTCCGCGC CGGTCACCAT
TGCGCCCAAC CGTTAATGAA ATGGTTAAAC GTAACAGCTA CAGCGCGTGC AAGTTTTTAC
CTCTATAATA CAAAGGAAGA AATCGATCAA CTCGTAGTCG CATTACAGAA AACAAAGGAG
TATTTTGGTC ATGTCTTCTA A
 
Protein sequence
MNTKEVRQLF PILDQEVNGK PLIYLDNAAT SQKPLPVIEA IDHYYRQYNS NVHRGVHTLG 
TKATDAYEGA REKVRRFINA KSTQEIIFTR GTTTALNMVA ASYGRANLQE GDEIVITYME
HHSNIIPWQQ VAKHTGATLK YIPLQPDGTI DLKDVEAAVT SNTKIVAIVH VSNVLGTINP
VKEIAKIAHK HGAVIVVDAA QSAPHMKIDV QDLDCDFLAF SGHKMCGPTG IGVLYGKREL
LEKMEPVEFG GEMIDFVGLY ESTWKELPWK FEGGTPIIAG AIGLGAAIDF LEEIGLDNIA
AHEQELAQYA LEQLSAIEGI TIYGPKHRGG LVTFNIEGVH PHDVATVLDA EGIAVRAGHH
CAQPLMKWLN VTATARASFY LYNTKEEIDQ LVVALQKTKE YFGHVF