Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0624 |
Symbol | |
ID | 4270606 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 673717 |
End bp | 674874 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638125371 |
Product | aminotransferase, class V |
Protein accession | YP_741468 |
Protein GI | 114319785 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | [TIGR03402] cysteine desulfurase NifS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.00292698 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCGCAC CAACCCCGAT CTACCTGGAC TACAACGCCA GCACGCCCAT CGCCCCCGAG GTAGCCAAGG CCATGGCCCC CTACCTGCAG GAGGCCTATG GCAACCCCTC CGCCGGTCAC TGGGCCGGCG GTCCGGCCCG TGAGGCGGTG GAGCAGGCGC GCCGGCAGGT GGCCCGGCTG ATCGGGGCGG CGCCGGACGA GATCGTCTTC ACCAGCGGCG GCAGCGAGGC GAACAACCAC GCCATCAAGG GCACCTGGTA CGCCACAGAG GGGCCATTTC ACATCATCAC CACGGCAGTG GAACACCCGG CCACCCTGGT CCCCTGCCGT TTCCTTGAAT CCCTCGGTGC TAGCCTGACC GTACTCCCCG TGGACCGATA TGGGCAGGTG AACCCGGACG CGGTCCAGGC GGCCATCACC CCCGAGACCC GCCTGATCAG CGTCATGCAC GCCAACAACG AGGTGGGCAC CCTGCAGCCG GTGGAGGCCA TCGGCCGCAT CGCCCGCGAC CACGGTGTCC GCTTCCACGT GGACGCGGCG CAATCGGCCG GCAAGGTGCC CATCAACGTC CAGGCCATGG GCGTGGATCT GCTCTCGCTC GCCGGCCACA AGTTCTACGG CCCCAAGGGC ATCGGCGCGC TCTACGTCCG TCGCGGCATC GACCTGACGC CGCTGATCCA CGGCGCCGGT CACGAGGGGG GGCGCCGCGC CGGTACCGAG AGCGCGCTGC TCGCCACGGG GCTGGGCACC GCCGCCGAGA AGGCGCGTGA CCTCAGCCCC ATGGCTCGGG TTCAGGCGCT GCGCGACCGG CTCTGGACGG GCCTGAAGGG CCATTTCGGT GATACCCTTT GCCTGAACGG TCACCCTCAG GCCCGTCTGC CCAACACCCT GAACGTCGCC TTCGCCGACT GTGTAGGCGC GGCCATCCTG GACCGGCTCG ACGGTGTCGC CGCCTCCACG GGCTCCGCCT GCCACGCCGG CTCTGTCACG CTCTCACCGG TGCTGGCCGC CATGGGGGTC CCGGAGCGGG TGGGCATGGG CGCACTCCGG TTCAGCCTGG GCCGCTGGAC CACGGAACAG GAGATCGACG AGGTTATCGC AAGGCTCGCC CGGGCGGTCC CCCAGGCCCG GGCCGCCACC ACACAGGAAC CATCATGA
|
Protein sequence | MSAPTPIYLD YNASTPIAPE VAKAMAPYLQ EAYGNPSAGH WAGGPAREAV EQARRQVARL IGAAPDEIVF TSGGSEANNH AIKGTWYATE GPFHIITTAV EHPATLVPCR FLESLGASLT VLPVDRYGQV NPDAVQAAIT PETRLISVMH ANNEVGTLQP VEAIGRIARD HGVRFHVDAA QSAGKVPINV QAMGVDLLSL AGHKFYGPKG IGALYVRRGI DLTPLIHGAG HEGGRRAGTE SALLATGLGT AAEKARDLSP MARVQALRDR LWTGLKGHFG DTLCLNGHPQ ARLPNTLNVA FADCVGAAIL DRLDGVAAST GSACHAGSVT LSPVLAAMGV PERVGMGALR FSLGRWTTEQ EIDEVIARLA RAVPQARAAT TQEPS
|
| |