Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mfla_0617 |
Symbol | |
ID | 4000847 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacillus flagellatus KT |
Kingdom | Bacteria |
Replicon accession | NC_007947 |
Strand | + |
Start bp | 647798 |
End bp | 649663 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637937517 |
Product | SufS subfamily cysteine desulfurase |
Protein accession | YP_544728 |
Protein GI | 91774972 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000358817 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTACAA GTAACATTCA CGACCTGTCA TCGTTTGAGC CCGAAGCCAT CCTGGCAGCA TTCCCGGGCG AGCATCCGCG CATGGCCGCG GCCATTGTGG TGGGGCGGGA GGCGCAGGCA AGCGGGCATC TGGATATTGA CGAGCTGACC CGTATTGCCA ATGAAATCTA TGCCGAAGGT TTTCCGCATG GTGCGCCAGA ACTTCCTTCC ACGGCATCAT TGACGGATTC CGCTGTGCCA TATGCTGCCG GTAGCCCGGG AATAGGGGCA CCCGTGCCGC CCGTCCCTTC CGCAGTGTCT GCATTGCCGG CGGGCGCGCC CGCTGCGGCC AATGTCGTTC CCGGTATTGA GCTTGCGCAG GTCACGCCCC CCAGCCTCGG GGTGTCGCCG TCCCGGCTAC CGCCGGAATT CAGCCAGGAG CGGGTTGAGG CCGTTGGGCC TGCGCCTGCC TTCCCAGGGC TGGATATCAG CAGTTTGCTT GGCGGCGTGG ATATCCCGTA TCCAGTCGAT CTGCAAGGTT TGCTGACTTT ACCTGTCAAT AGCCTGTACC CGCATGGCTC GGGAAATCCA GGCGCTGGCA ATGCATCGCC TTACTATTTT GTCGCGGAGC GTCCCGGCAT ACCGGATGTA GCAGCTTCGG CCCATCCGCC GTTCGACGTC AGTCTGGTGC GCAAGGATTT TCCTATCCTG AGTGAGTTGG TGAACGGCAG GCCGTTGATC TGGCTGGATA ACGCTGCGAC GACACAGAAG CCGCAGGCGG TGATCGACCG CCTGTCCTAT TTCTATCAGC ACGAGAATTC CAATATCCAT CGTGCAGCGC ATGAGCTAGC GGCGCGTGCG ACTGATGCCT ATGAAGGCGC GCGTGAAACG GTACGCCGTT TCATCAATGC GCCTGCGGTG GAAAACATCG TGTTTGTGCG CGGCACGACC GAGGCCATCA ACCTGGTGGC CAAGACCTGG GGCAGGAAGC ATCTGGGCGA GGGCGACGAG GTGGTGGTGA CCCACCTGGA GCACCATGCC AATATCGTGC CTTGGCAGCA GTTGTGCCAG GAGACCGGGG CCAAGCTCAA AGTGGTGCCG GTGGACGATA CTGGACAGGT GCTGCTCGAC GAATATCAGA AATTACTGAC TTCGCGCACC AAGCTGGTTT CCTTCACCCA GGTTTCCAAT GCTCTCGGCA CCGTGACGCC GGCAAGGGAG ATGATCGCCA TGGCGCATGC CGTGGGTGCC AAGGTGCTGT TAGATGGCGC GCAATCAGTC TCGCACATGC GTACGGACGT GCAATCCCTG GATTGCGATT TCCTGGTATT CTCGGGCCAC AAGATATTTG GCCCGACAGG CATCGGGGCG TTGTACGGCA AGGCTGAGGT GCTGGACGAC ATGCCCGCCT GGCAAGGTGG CGGCAATATG ATCCAGGACG TCACTTTCGA GAAGACGGTA TATCATGGCG CGCCTGCCAA GTTCGAGGCG GGAACCGGCA ATATCGCCGA TGCGGTGGGC ATGGGAGCTG CGCTCGAGTA CGTGGAGCGA TTGGGTATAG AGAACATTGC GCGTTATGAG CACGAGCTGT TGGTCTATGC GACCGCCGCG CTGAACAAGG TGCCGGGGCT GCGCTTGATC GGCACCGCCC AGCACAAGGC CAGCGTGCTC TCGTTCAATC TGCAGGGCTA CAAGAGTGAA GAGGTCGGGG CTGCATTGAA TCAGGAAGGT ATTGCGGTAC GCTCTGGCCA CCATTGCGCG CAACCGATAC TGCGACGCTT CGGTGTGGAA GCCACGGTCA GGCCCTCCCT GGCTTTCTAC AATACTTATG CCGAGGTGGA TGTCTTGCAG GGGGTGCTGT CCAAACTGGC TTCCGCGAAA CGCTGA
|
Protein sequence | MTTSNIHDLS SFEPEAILAA FPGEHPRMAA AIVVGREAQA SGHLDIDELT RIANEIYAEG FPHGAPELPS TASLTDSAVP YAAGSPGIGA PVPPVPSAVS ALPAGAPAAA NVVPGIELAQ VTPPSLGVSP SRLPPEFSQE RVEAVGPAPA FPGLDISSLL GGVDIPYPVD LQGLLTLPVN SLYPHGSGNP GAGNASPYYF VAERPGIPDV AASAHPPFDV SLVRKDFPIL SELVNGRPLI WLDNAATTQK PQAVIDRLSY FYQHENSNIH RAAHELAARA TDAYEGARET VRRFINAPAV ENIVFVRGTT EAINLVAKTW GRKHLGEGDE VVVTHLEHHA NIVPWQQLCQ ETGAKLKVVP VDDTGQVLLD EYQKLLTSRT KLVSFTQVSN ALGTVTPARE MIAMAHAVGA KVLLDGAQSV SHMRTDVQSL DCDFLVFSGH KIFGPTGIGA LYGKAEVLDD MPAWQGGGNM IQDVTFEKTV YHGAPAKFEA GTGNIADAVG MGAALEYVER LGIENIARYE HELLVYATAA LNKVPGLRLI GTAQHKASVL SFNLQGYKSE EVGAALNQEG IAVRSGHHCA QPILRRFGVE ATVRPSLAFY NTYAEVDVLQ GVLSKLASAK R
|
| |