Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1249 |
Symbol | |
ID | 4076364 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 1342754 |
End bp | 1343974 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 638006557 |
Product | cysteine desulfurase |
Protein accession | YP_613244 |
Protein GI | 99081090 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.491768 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATGATG TCGAAAAAAT CCGCTCTGAC TTTCCGATCC TGTCACGCGA AGTGAACGGA AAGCCGCTGG TCTATCTCGA TAACGGGGCC TCTGCGCAGA AACCTCAGGT TGTGATCGAT GCCGTGACCA AGGCATACGC GGAAGAATAT TCAAATGTTC ACCGTGGCTT GCATTATCTT TCTAATCTCG CGACTGAGAA ATACGAAAGC GTACGCGGCA TCATCGCAAA GTTTTTGAAC GCGGGTCACG AAGATCACAT CGTGCTGAAT TCCGGAACCA CAGAGGGTAT CAACCTCGTG GCGTATTCCT GGGCCATGCC GCGGATGGAA GCCGGCGATG AGATCATCCT CTCCGTCATG GAGCACCATG CGAACATCGT GCCCTGGCAT TTCCTGCGCG AACGCCAAGG TGTCGTCATC AAATGGATCG ACACGGACGC GGACGGAAGC CTGGACCCGC AAAAGGTTCT GGACGCCATT ACACCAAAAA CCAAGCTGAT CGCAGTCACC CAGTGTTCCA ATGTCTTGGG AACCGTTGTT GATGTAAAAT CCATCACAAA AGGCGCGCAT GACAAAGGCG TGCCAGTGCT TGTCGACGGC TCTCAGGGCG CGGTTCACAT GCCCGTGGAT GTGCAGGATC TCGACTGTGA TTTCTATGCC GTCACCGGGC ACAAGCTTTA TGGTCCGTCT GGGTCCGGCG CGATCTATAT CAAGCCGGAG CGTATGGCCG AGATGCGTCC GTTTATTGGC GGCGGCGATA TGATCAAAGA AGTGTCCAAG GATCAGGTGA TCTACAACGA TCCGCCGATG AAGTTCGAGG CTGGTACGCC AGGGATCGTG CAGACGATCG GTTTCGGCGT CGCGCTCGAA TATATGATGG AGATCGGAAT GGCAGAAATT GCCGCCCATG AGGCCGATCT GCGAGACTAT GCATCGGAGC GTTTCAAGGG GTTGAACTGG TTGAATATTC AGGGCCATGC TCCTGGAAAA GCAGCGATAT TCAGTCTGAC CCTTGAGGGC GCGGCACATG CGCATGACAT TTCAACCATT CTCGACAAGA AAGGTGTCGC GGTGCGTGCC GGGCATCATT GTGCGGGGCC TTTGATGGAT CATCTTGGCG TCTCTGCAAC TTGCCGCGCG AGCTTTGGCA TGTACAACAC GCGTCCAGAG GTAGATACGT TGATTGAGGC GCTAGAACTC GCACATGAGC TTTTTGGCTA G
|
Protein sequence | MYDVEKIRSD FPILSREVNG KPLVYLDNGA SAQKPQVVID AVTKAYAEEY SNVHRGLHYL SNLATEKYES VRGIIAKFLN AGHEDHIVLN SGTTEGINLV AYSWAMPRME AGDEIILSVM EHHANIVPWH FLRERQGVVI KWIDTDADGS LDPQKVLDAI TPKTKLIAVT QCSNVLGTVV DVKSITKGAH DKGVPVLVDG SQGAVHMPVD VQDLDCDFYA VTGHKLYGPS GSGAIYIKPE RMAEMRPFIG GGDMIKEVSK DQVIYNDPPM KFEAGTPGIV QTIGFGVALE YMMEIGMAEI AAHEADLRDY ASERFKGLNW LNIQGHAPGK AAIFSLTLEG AAHAHDISTI LDKKGVAVRA GHHCAGPLMD HLGVSATCRA SFGMYNTRPE VDTLIEALEL AHELFG
|
| |