Gene Mlg_0728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0728 
Symbol 
ID4268695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp810355 
End bp811620 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content69% 
IMG OID638125477 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_741572 
Protein GI114319889 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.88575 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.887973 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGG TTGAACCGGT TCGCGACCGG GAACGGGCCG GGCCGCCGCT GGACGTGGCG 
GCCCTGCGCG CCCAGTTCCC CGTCCTGCAC CAACAGGTGA ACGGCTACCC GCTGGTCTAC
CTGGACAATG CCGCCAGCTG CCAGAAGCCG GAGGCGGTGA TCGAGGCCGA GGCGGCGTGT
TATCGCGAGT ACTACGCCAA CATCCACCGT GGGGTGCACG CCCTCTCCCA GCGCTGCACC
ACCGCCTTCG AGGGGGCCCG CGAGAAGGTG CAGCGGTTCC TGAACGCCGA GCGCGACGGG
GAGATCGTCT TCCTGCGCGG CACCACCGAG GCCATCAATT TGGTGGCCCA CAGCTACGTG
GAGCCCCTGT TGCAGCCAGG CGATGAGATC CTCATCAGCT ACCTGGAGCA CCACTCCAAC
ATCGTCCCCT GGCAGATGGT CTGCGAGCGC ACCGGGGCGG AGCTGCGGGT CATCCCGGTA
CAGGATAACG GCGAACTGGA CCTGGAGGCC TTCCAGGCCC TGCTCAGTGA TCGGACCCGC
TTCCTCTCCG TGGGGCACGT CTCCAATGCC CTGGGCACGG TGAACCCGGT GCGCTGGATG
ATTGAGCAGG CCCACGCCCG GGACATCCCC GTGCTGCTGG ACGGCGCCCA GGCGGTGCCT
CATGGCCCGG TGGATGTGCG CGAGCTGGAC TGCGACTTCT ACGCCTTCTC CGGGCACAAG
CTCTATGGGC CGACCGGGGT GGGGGTGCTC TACGGGCGCC ACGACCTGCT CAAGGGCATG
CGGCCCTGGC AGGGCGGTGG CGACATGATC CGCACCGTCA GCTTCGAGAA GACCCTCTAC
GCCGAACCGC CTGCCCGCTT CGAGGCAGGT ACGCCCAACA TCGCCGGCGC CATCGCCCTG
GGCGCGGCGG TGGACTGGGT GCAGGCGGTG GGCCTGGAGG CCATCGCCGC CCATGAGGCC
CGCCTGTTGG ATTACGCCAC CGAGCGGCTG GGTGCGTTGG AGGGGGTGCG GCTACTGGGC
ACGGCCCCGG ACAAGGCGGC GGTGCTCTCC TTCGTGATGG ACGAGGCCCA CCCCCACGAT
ATCGGCACCA TCCTCGACCA ACAGGGGGTC GCCATCCGCA CCGGGCACCA CTGTGCCGAG
CCGGTGATGA AACGCTTCAA CGTGCCGGCC ACCGCCCGCG CCTCCTTCGC GGCCTACAAC
ACCGAGGCCG AGGTGGATGC GCTGGTGGAG GGCGTTGAGA AGGTGCGCGA ACTGTTCGGC
GGCTGA
 
Protein sequence
MSTVEPVRDR ERAGPPLDVA ALRAQFPVLH QQVNGYPLVY LDNAASCQKP EAVIEAEAAC 
YREYYANIHR GVHALSQRCT TAFEGAREKV QRFLNAERDG EIVFLRGTTE AINLVAHSYV
EPLLQPGDEI LISYLEHHSN IVPWQMVCER TGAELRVIPV QDNGELDLEA FQALLSDRTR
FLSVGHVSNA LGTVNPVRWM IEQAHARDIP VLLDGAQAVP HGPVDVRELD CDFYAFSGHK
LYGPTGVGVL YGRHDLLKGM RPWQGGGDMI RTVSFEKTLY AEPPARFEAG TPNIAGAIAL
GAAVDWVQAV GLEAIAAHEA RLLDYATERL GALEGVRLLG TAPDKAAVLS FVMDEAHPHD
IGTILDQQGV AIRTGHHCAE PVMKRFNVPA TARASFAAYN TEAEVDALVE GVEKVRELFG
G