Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0728 |
Symbol | |
ID | 4268695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 810355 |
End bp | 811620 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638125477 |
Product | SufS subfamily cysteine desulfurase |
Protein accession | YP_741572 |
Protein GI | 114319889 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.88575 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.887973 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACGG TTGAACCGGT TCGCGACCGG GAACGGGCCG GGCCGCCGCT GGACGTGGCG GCCCTGCGCG CCCAGTTCCC CGTCCTGCAC CAACAGGTGA ACGGCTACCC GCTGGTCTAC CTGGACAATG CCGCCAGCTG CCAGAAGCCG GAGGCGGTGA TCGAGGCCGA GGCGGCGTGT TATCGCGAGT ACTACGCCAA CATCCACCGT GGGGTGCACG CCCTCTCCCA GCGCTGCACC ACCGCCTTCG AGGGGGCCCG CGAGAAGGTG CAGCGGTTCC TGAACGCCGA GCGCGACGGG GAGATCGTCT TCCTGCGCGG CACCACCGAG GCCATCAATT TGGTGGCCCA CAGCTACGTG GAGCCCCTGT TGCAGCCAGG CGATGAGATC CTCATCAGCT ACCTGGAGCA CCACTCCAAC ATCGTCCCCT GGCAGATGGT CTGCGAGCGC ACCGGGGCGG AGCTGCGGGT CATCCCGGTA CAGGATAACG GCGAACTGGA CCTGGAGGCC TTCCAGGCCC TGCTCAGTGA TCGGACCCGC TTCCTCTCCG TGGGGCACGT CTCCAATGCC CTGGGCACGG TGAACCCGGT GCGCTGGATG ATTGAGCAGG CCCACGCCCG GGACATCCCC GTGCTGCTGG ACGGCGCCCA GGCGGTGCCT CATGGCCCGG TGGATGTGCG CGAGCTGGAC TGCGACTTCT ACGCCTTCTC CGGGCACAAG CTCTATGGGC CGACCGGGGT GGGGGTGCTC TACGGGCGCC ACGACCTGCT CAAGGGCATG CGGCCCTGGC AGGGCGGTGG CGACATGATC CGCACCGTCA GCTTCGAGAA GACCCTCTAC GCCGAACCGC CTGCCCGCTT CGAGGCAGGT ACGCCCAACA TCGCCGGCGC CATCGCCCTG GGCGCGGCGG TGGACTGGGT GCAGGCGGTG GGCCTGGAGG CCATCGCCGC CCATGAGGCC CGCCTGTTGG ATTACGCCAC CGAGCGGCTG GGTGCGTTGG AGGGGGTGCG GCTACTGGGC ACGGCCCCGG ACAAGGCGGC GGTGCTCTCC TTCGTGATGG ACGAGGCCCA CCCCCACGAT ATCGGCACCA TCCTCGACCA ACAGGGGGTC GCCATCCGCA CCGGGCACCA CTGTGCCGAG CCGGTGATGA AACGCTTCAA CGTGCCGGCC ACCGCCCGCG CCTCCTTCGC GGCCTACAAC ACCGAGGCCG AGGTGGATGC GCTGGTGGAG GGCGTTGAGA AGGTGCGCGA ACTGTTCGGC GGCTGA
|
Protein sequence | MSTVEPVRDR ERAGPPLDVA ALRAQFPVLH QQVNGYPLVY LDNAASCQKP EAVIEAEAAC YREYYANIHR GVHALSQRCT TAFEGAREKV QRFLNAERDG EIVFLRGTTE AINLVAHSYV EPLLQPGDEI LISYLEHHSN IVPWQMVCER TGAELRVIPV QDNGELDLEA FQALLSDRTR FLSVGHVSNA LGTVNPVRWM IEQAHARDIP VLLDGAQAVP HGPVDVRELD CDFYAFSGHK LYGPTGVGVL YGRHDLLKGM RPWQGGGDMI RTVSFEKTLY AEPPARFEAG TPNIAGAIAL GAAVDWVQAV GLEAIAAHEA RLLDYATERL GALEGVRLLG TAPDKAAVLS FVMDEAHPHD IGTILDQQGV AIRTGHHCAE PVMKRFNVPA TARASFAAYN TEAEVDALVE GVEKVRELFG G
|
| |