Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1236 |
Symbol | |
ID | 4027614 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 1415260 |
End bp | 1416501 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637966414 |
Product | SufS subfamily cysteine desulfurase |
Protein accession | YP_573290 |
Protein GI | 92113362 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.321654 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCCC TTTCCGAGCC GATGCTGGAC GTGGCCCGTG TCCGCCGTGA CTTTCCGATT CTCGAGCGGG AAGTGCATGG CAAGCCGCTG ATCTATCTGG ATAATGCTGC GACCACCCAG ACGCCGTCAG CGGTCATCGA GACGCTGGAT GATTATTACC GGCGTTACAA CGCCAATATT CATCGCGGGC TGCATACCCT GGCCGACGAG GCCACGGCAG CCTATGAAGG TACGCGCGAC AAGGTTCGCG CGTGGCTGGG GGCGGCATCG AGTCGCGAAA TCATCTTCAC CCGGGGCACC ACCGAGGCCA TCAACCTGGT CGCCAATAGC TGGGGGCGGG CCAACCTGCG TCCGGGGGAC GAGATTCTGG TCTCCTTGAT GGAGCACCAC TCCAATATCG TGCCCTGGCA GATGCTGGCC GAGGCACTGG ATGTGACGCT CAAGGTGATC CCCGTCGATG AGCGCGGCGT GCTCGACCAG GCGGCCTATC GTGAATTGCT GGGCGAGCGT ACGCGGCTGG TGTGCGTCAA TCACGTCTCC AATGCCCTGG GGACCGTCAA TCCAGTGGCC GAGATGGCGC GTGAGGCCCA TGCCCACGAT GCTCTGATTC TCGTCGACGG TGCCCAGGCG GTACCGCATC AGCGTGTCGA CGTGCATGCG CTGGGTGTCG ATTTCTACGC GTTCTCGGGG CACAAGATGT ACGGGCCCAC CGGGGTGGGC GTGCTCTACG GCCGCGAAGC GCTGCTCGAA GCGATGCCGC CCTGGCAGGG GGGCGGCGAG ATGATCAGCC GTGTGTCCTT CGATCAGGGC ACGGTATACA GCGACATACC GCACAAGTTC GAGGCGGGCA CGCCGGCCAT CGCCGAAGTG ATCGCCCTCG GCGCGGCACT GGACTGGGTC GCCGCCACCG GCATCGACAT GATGGCGGCC TGGGAAGGGC AGCTTCTCGA GCGCGCCACG GCCAAGTTGC GCGATGTCGA GGGGCTGCGT CTGATCGGGA CGGCGCCCGA CAAGGTCAGC GTGCTGTCGT TCGTGGTCGA TGGCGTGCAC GCCCAGGATA TCGGCTTGCT GATCGATCAA CTGGGCGTGG CCATACGTAC CGGGCATCAC TGCGCGCAAC CCGTCCTGGC CAGCATGGGG CTCGAGGCAA CCTGTCGCGC GTCCCTGGCG GCGTACAATA CGCCCGAAGA GGTCGACGCG TTCGTCGAGG CCCTGGAGAA AGTCATCGCC ATGGTGCGCT AA
|
Protein sequence | MNALSEPMLD VARVRRDFPI LEREVHGKPL IYLDNAATTQ TPSAVIETLD DYYRRYNANI HRGLHTLADE ATAAYEGTRD KVRAWLGAAS SREIIFTRGT TEAINLVANS WGRANLRPGD EILVSLMEHH SNIVPWQMLA EALDVTLKVI PVDERGVLDQ AAYRELLGER TRLVCVNHVS NALGTVNPVA EMAREAHAHD ALILVDGAQA VPHQRVDVHA LGVDFYAFSG HKMYGPTGVG VLYGREALLE AMPPWQGGGE MISRVSFDQG TVYSDIPHKF EAGTPAIAEV IALGAALDWV AATGIDMMAA WEGQLLERAT AKLRDVEGLR LIGTAPDKVS VLSFVVDGVH AQDIGLLIDQ LGVAIRTGHH CAQPVLASMG LEATCRASLA AYNTPEEVDA FVEALEKVIA MVR
|
| |