Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3980 |
Symbol | |
ID | 5835593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 4423765 |
End bp | 4425021 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641369771 |
Product | SufS subfamily cysteine desulfurase |
Protein accession | YP_001641422 |
Protein GI | 163853379 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.970806 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCAC CCGTCCGTAC CACTGCCCAG GACTCGTCCT ACGACGTCGA AGCGATCCGG AAGGAATTCC CGATCCTGTC GGAAAAGGTC TACGGCAAAC CGCTGGTCTA TCTCGACAAC GCCGCCTCGA CGCAGAAGCC GCGGGCCGTG ATCGACGCGA TGGTCTCCTG CATGGAGACC GGCTACGCCA ACGTGCACCG TGGCCTGCAC TACATGGCCA ATGCCGCGAC CGAAGGGTTC GAGGGCGCGC GCGAGACCAC GCGCCAGTTC CTCAACGCGG CCTCGACCGA CGAGATCATC TTCACCCGCA ACGCGACCGA GGCCTACAAC CTCGTGGCCT CCTCCATGGG CTGGGCCGGG CTGATCGGGG AGGGGGACGA GATCATCCTC TCGATCATGG AGCACCACTC CAACATCGTG CCCTGGCATT TCCTGCGCGA GCGGCGCGGC GCCGTCATCA AGTGGGCGCC GGTCGATGAC GACGGCAACT TCCTGGTCGA GGAATACGAA AAGCTCTTCA CGCCGCGCAC CAAGATGGTG GCGATCACCC ACATGTCGAA CGTGCTCGGC ACGGTGACGC CGGCCGAGGA GATCGTGCGC ATCGCCCATG CCCACGGCGT GCCGGTGCTG CTCGACGGGG CGCAGAGCGC GGTGCACCGC CCGGTCGATG TGCGGGCGCT CGATTGCGAC TTCTTCGTCT TCACCGGCCA CAAGGTCTAC GGGCCGACCG GCATCGGCGT GCTCTACGGC AAGAAGGAGT GGCTCGACCG TCTGCCGCCC TACCAGGGCG GCGGCGAGAT GATCCGCACG GTGAGCCAGG ACGCGATCAC CTACAACGAT CCGCCCCACC GCTTCGAGGC GGGCACGCCG GCGATCATCG AGGCGGTCGG CCTCGGCGCG GCGCTGGAAT TCATGATGAA GCTCGGCCGC GACAAGATCG CCGCGCACGA GGCGATGCTG ACCGCCTACG CCCAGGAGCG GCTCGGCGCG ATGAATTCGA TCCGCCAGAT CGGCAATTCC CGCGACAAGG GCGGCGTCAT CGCCTTCGAG GTGAAGGGCG CGCACGCCCA CGACATCGCC ACCGTGATCG ACCACCAGGG CGTGGCGGTA CGGGCCGGCA CCCACTGCGC GATGCCGTTG CTGACGCGCT TCGGTGTCAC CTCGACCTGT CGCGCCTCGT TCGGTCTGTA TAATACGACG CAGGAAATCG ATGTCCTGGC CGCGGCTCTG GCCAAGGCCG AGATGCTGTT CGCCTGA
|
Protein sequence | MNAPVRTTAQ DSSYDVEAIR KEFPILSEKV YGKPLVYLDN AASTQKPRAV IDAMVSCMET GYANVHRGLH YMANAATEGF EGARETTRQF LNAASTDEII FTRNATEAYN LVASSMGWAG LIGEGDEIIL SIMEHHSNIV PWHFLRERRG AVIKWAPVDD DGNFLVEEYE KLFTPRTKMV AITHMSNVLG TVTPAEEIVR IAHAHGVPVL LDGAQSAVHR PVDVRALDCD FFVFTGHKVY GPTGIGVLYG KKEWLDRLPP YQGGGEMIRT VSQDAITYND PPHRFEAGTP AIIEAVGLGA ALEFMMKLGR DKIAAHEAML TAYAQERLGA MNSIRQIGNS RDKGGVIAFE VKGAHAHDIA TVIDHQGVAV RAGTHCAMPL LTRFGVTSTC RASFGLYNTT QEIDVLAAAL AKAEMLFA
|
| |