Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpop_4460 |
Symbol | |
ID | 6313702 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium populi BJ001 |
Kingdom | Bacteria |
Replicon accession | NC_010725 |
Strand | + |
Start bp | 4756512 |
End bp | 4757792 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642653140 |
Product | cysteine desulfurase, SufS subfamily |
Protein accession | YP_001927094 |
Protein GI | 188583649 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.572328 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.26876 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCCGG ACATCAAAGG ATCCAAGGTC GCGACCATGA ACGCACCCGT TCCCGCCACC CCCTACGACG TCGAGGCGAT CCGGGCCGAA TTCCCGATCC TCTCGGAAAA GGTCTACGGC AAGCCGCTGG TCTATCTCGA CAACGCGGCC TCGGCGCAGA AGCCCAAGGC GGTGATCGAC GCCATGGTCT CCTGCATGGA GACCGGCTAC GCCAACGTCC ATCGCGGCCT GCACTACATG GCCAACGCCG CGACCGAGGG GTTCGAGGGC GCGCGCGAGA CCACGCGTCA GTTCCTCAAT GCCCGCTCCA CCGACGAGAT CATCTTCACC CGCAACGCGA CGGAGGCCTA CAACCTCGTC GCATCCTCGA TGGGCTGGGC CGGGCTGATC GGGGAGGGGG ACGAGATCAT CCTCTCGATC ATGGAGCACC ACTCCAACAT CGTGCCCTGG CACTTTCTGC GCGAGCGGCG CGGGGCCGTG ATCAAGTGGG CGCCGGTCGA TGACGACGGC AACTTCCTCG TCGAGGAATA CGAGAGGCTG TTCACCCCCC GCACCAAGAT GGTGGCGCTC ACCCACATGT CGAACGTGCT CGGCACGGTG ACGCCGGGGG AGGAGATCGT GCGCATCGCC CACGCCCACG GCGTGCCGGT GCTGCTCGAT GGGGCGCAGA GCGCGGTTCA CCGCCCGATC GACGTGCAGG CGCTCGATTG CGACTTCTTC GTCTTCACCG GCCACAAGGT CTACGGCCCG ACCGGGATCG GCGTGCTCTA CGGCAAGAAG GAATGGCTCG ACCGCCTGCC GCCCTACCAG GGCGGCGGCG AGATGATCCG CACGGTGAGC CAGGATTCGA TCACCTACAA CGACCCGCCC CACCGCTTCG AGGCGGGCAC GCCGGCGATC ATCGAGGCGG TCGGCCTCGG CGCCGCGCTC GAATTCATGA TGAAGCTCGG GCGCGAGAAC ATCGCCGCGC ACGAGGCGAT GCTGACGGCC TACGCGCAGG AGCGGCTCGG CGCGATGAAT TCGATCCGCC AGATCGGCAA TTCCCGGAAC AAGGGCGGCG TCATCGCCTT CGAGGTGAAG GGTGCGCATG CCCACGACAT CGCCACCGTG ATCGACCGGC AGGGCGTGGC CGTGCGGGCG GGCACGCACT GCGCCATGCC GCTGCTGACC CGCTTCGGTG TGACCTCCAC CTGTCGCGCC TCGTTCGGCC TGTATAATAC GACCCATGAG ATCGACGCGC TCGCCGAGGC CCTGGCCAAG GCCGAGATGC TGTTCGCGTA A
|
Protein sequence | MDPDIKGSKV ATMNAPVPAT PYDVEAIRAE FPILSEKVYG KPLVYLDNAA SAQKPKAVID AMVSCMETGY ANVHRGLHYM ANAATEGFEG ARETTRQFLN ARSTDEIIFT RNATEAYNLV ASSMGWAGLI GEGDEIILSI MEHHSNIVPW HFLRERRGAV IKWAPVDDDG NFLVEEYERL FTPRTKMVAL THMSNVLGTV TPGEEIVRIA HAHGVPVLLD GAQSAVHRPI DVQALDCDFF VFTGHKVYGP TGIGVLYGKK EWLDRLPPYQ GGGEMIRTVS QDSITYNDPP HRFEAGTPAI IEAVGLGAAL EFMMKLGREN IAAHEAMLTA YAQERLGAMN SIRQIGNSRN KGGVIAFEVK GAHAHDIATV IDRQGVAVRA GTHCAMPLLT RFGVTSTCRA SFGLYNTTHE IDALAEALAK AEMLFA
|
| |