Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_2228 |
Symbol | |
ID | 8603565 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | + |
Start bp | 2614207 |
End bp | 2615472 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | cysteine desulfurase, SufS subfamily |
Protein accession | YP_003299832 |
Protein GI | 269126462 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0000771215 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTTCC TGCTGCCCGA GGAGATCAAA AAGGACTTCC CGCTGCTGGC GCGCACCGTC CGCGGCGGGC GTCCGCTGGT CTACCTCGAC TCCGGCGCCA CCTCCCAAAA GCCCTACCAG GTGCTGGACG CCGAACGGGA GTTCTACGAG CGGCACAACG CCGCCGTGCA CCGCGGCGCG CACCTGCTGG CCGAGGAGGC CACCGACGCC TTCGAACGCG CCCGCGCCAC CGTGGCCTCC TTCATCGGGG CGGCGCCCGG CGAGATCGTG TTCACCAAGA ACGCCACCGA GTCGATCAAC CTGGTGGCCT ACTCGCTGAG CAACGCCGCC ACCGCCGGCC CCGAGGCCGA ACGGTTCCGC GTCGGCCCCG GCGATGAGAT CGTCACCACG GAGATGGAGC ACCACGCCAA CCTGGTGCCC TGGCAGCAGC TGTGCCGCCG CACCGGGGCT GCGCTCCGCT GGTTCGGCAT CACCGACGAG GGCCGGCTGG ACCTGTCGAA CCTGGAGGAG CTGATCACCG AGCGCACCAA GCTGGTGGCA CTGACCCACC AGTCCAACGT GCTCGGCACC ATCCCCCCAC TGGAGCAGAT CGTCGCCCGG GCCCGCCAGG TCGGCGCGCT GGTGCTGCTG GACGCCGCCC AGTCGGTGCC GCACCAGCCG GTGGACGTGA CCGCCCTGGA CGTGGACTTC GTGGCCTTCT CCGGCCACAA GATGCTCGGC CCCTCGGGGA TCGGCGTGCT GTGGGGACGC CGCGAGCTGC TGGAGGCCAT GCCCCCGTTC ATCACCGGCG GGTCGATGAT CGAGGTGGTG CGCATGGAGG AGTCCACCTT CCTGCCGCCG CCCCAGCGCT TTGAGGCCGG GGTGCCGATG ACCGCCCAGG CGGTCGGGCT CGCCGCGGCC TGCGACTACC TGTCGGCCCT GGGCATGGAC AAGGTCCAGG CGCACGAGGA GACGCTCACC GGCTACGCCC TGGAGAAGCT CGGGCAGCTG CCCGGCGTGC GCATCATCGG CCCCCGCTCC ACCGAGGCGC GCGGCGGCGC GGTCTCGTTC GTGGTCGACG ACCTGCACCC GCACGACGTG GGACAGGTGC TGGACGAGCT GGGCGTGGCG GTCCGCGTCG GGCACCACTG CGCCTGGCCG ATCTGCCGCC GCTTCGGCAT CCCGGCGACC ACGCGGGCGA CCTTCTACGT CTACAACACC CTGGCGGACG TGGACGCCCT CGCCGAAGGG GTGCGGCACG CCCAGAAGTT CTTCGGAACA CTGTGA
|
Protein sequence | MSFLLPEEIK KDFPLLARTV RGGRPLVYLD SGATSQKPYQ VLDAEREFYE RHNAAVHRGA HLLAEEATDA FERARATVAS FIGAAPGEIV FTKNATESIN LVAYSLSNAA TAGPEAERFR VGPGDEIVTT EMEHHANLVP WQQLCRRTGA ALRWFGITDE GRLDLSNLEE LITERTKLVA LTHQSNVLGT IPPLEQIVAR ARQVGALVLL DAAQSVPHQP VDVTALDVDF VAFSGHKMLG PSGIGVLWGR RELLEAMPPF ITGGSMIEVV RMEESTFLPP PQRFEAGVPM TAQAVGLAAA CDYLSALGMD KVQAHEETLT GYALEKLGQL PGVRIIGPRS TEARGGAVSF VVDDLHPHDV GQVLDELGVA VRVGHHCAWP ICRRFGIPAT TRATFYVYNT LADVDALAEG VRHAQKFFGT L
|
| |