Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1243 |
Symbol | |
ID | 4269027 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1447898 |
End bp | 1449118 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 638125993 |
Product | O-succinylhomoserine sulfhydrylase |
Protein accession | YP_742082 |
Protein GI | 114320399 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases |
TIGRFAM ID | [TIGR01325] O-succinylhomoserine sulfhydrylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGATC AGGGCGATTC CCACCACGAC GATCCGCGCC ACCCGAGCCA CTGGGCGCCC GCCACCCGGG CGGTGCGGGC GGGACAGACC CGCGGCCTGG AGCAGGAGCA GAGCGAGCCC ATCTACGCCA GTTCCAGCTT CACCTACCGT AGTGCTGCCG AGGCGGCGGC ACGTTTCTCC GGCGAGAGCC CGGGGAATAT CTACTCGCGC TTTACCAACC CCACGGTCCG GACCTTCGAG CAGCGGCTCG CCGCCCTGGA GGGGGCCGAG GCCTGCGTGG CGACCGCCTC CGGGATGTCG GCGGTGCTGG CCGCTACCCT GGGGCTCCTG CGGGCCGGCG ACCATATTGT CGCCTCCAGC GGCCTGTTCG GGGCCACGGC CTCGCTGTTC GCCAATTACC TCCCGCGCTA TGGGATCGAG GTCACCACGG TCCCGCTCAC CGACCTCCAG GCCTGGTCGG ACGCCATGCG TCCGCAGACC CGCATGCTCT TCCTGGAGAC GCCGTCCAAC CCGCTGACCG AGGTGGCGGA TATCGCCGCG CTGGCGGACC TGGCCCGGGG CCAGGGGGCG TGGCTGGCGG TGGACAACTG CTTCTGCACC CCGGCCCTGC AGCGGCCGCT GGAGCTCGGC GCGGATCTGG TCATTCACTC GGCGACCAAG TATCTGGACG GTCAGGGGCG GTGCATCGGC GGGGCGGTGT GCGGCGATGC CCAGGTGGTG GGCGAACAGG TTTTCGGCTT CCTGCGCACG GCCGGGCCGA GCATGAGCCC GTTCAACGCC TGGGTGTTCC TCAAGGGGCT GGAGACCCTG GCCCTGCGGA TGGAGGCTCA CTGCCGGAGT GCCTCGGAGC TGGCCCACTG GCTGGAGGAG CACCCGGCCG TGGAGCGGGT GTTCTATCCC GGGCTCGCCC GCCATCCCCA GCACGCCCTG GCGGCGCGCC AGCAGTCCGC CTTTGGCGGT ATCGTCAGCT TCGAGGTGCG GGGCGGGCGC GACGCGGCCT GGCGGGTGAT TGACAATACC CGGTTGCTCT CCATCACCGC GAACCTGGGC GACGCCAAGA GCACCATCAC CCACCCGGCG ACCACCACTC ACGGCCGCAA CACACCGGAG CAACGGGCGG CGGCGGGGAT TACGGAATCG CTGGTCCGGG TGTCGGTGGG GCTGGAGGCG GTGGCGGATA TCCGCGCTGA TCTGGACGCG GCGTTGTCGG CCCTGGCGTA G
|
Protein sequence | MSDQGDSHHD DPRHPSHWAP ATRAVRAGQT RGLEQEQSEP IYASSSFTYR SAAEAAARFS GESPGNIYSR FTNPTVRTFE QRLAALEGAE ACVATASGMS AVLAATLGLL RAGDHIVASS GLFGATASLF ANYLPRYGIE VTTVPLTDLQ AWSDAMRPQT RMLFLETPSN PLTEVADIAA LADLARGQGA WLAVDNCFCT PALQRPLELG ADLVIHSATK YLDGQGRCIG GAVCGDAQVV GEQVFGFLRT AGPSMSPFNA WVFLKGLETL ALRMEAHCRS ASELAHWLEE HPAVERVFYP GLARHPQHAL AARQQSAFGG IVSFEVRGGR DAAWRVIDNT RLLSITANLG DAKSTITHPA TTTHGRNTPE QRAAAGITES LVRVSVGLEA VADIRADLDA ALSALA
|
| |