Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A1969 |
Symbol | |
ID | 6872221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 1900608 |
End bp | 1901828 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642785088 |
Product | bifunctional cysteine desulfurase/selenocysteine lyase |
Protein accession | YP_002215754 |
Protein GI | 198243608 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0908482 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0000000737026 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACATTTC CTGTAGAAAA AGTACGGGCG GATTTTCCCA TACTGCAGCG TGAAGTTAAC GGCCTGCCGC TGGCTTACCT GGACAGCGCA GCCAGCGCTC AAAAACCTAA TCAGGTGATT GATGCTGAAT CTGCCTTCTA CCGTCACGGC TATGCTGCGG TACATCGAGG TATCCATACG TTAAGCGTGC AGGCGACCGA AAGCATGGAG AATGTGCGTA AGCAGGCGTC GCGGTTTATT AACGCCCGCT CCGCAGAAGA ACTGGTGTTC GTGCGCGGTA CGACGGAGGG CATTAACCTT GTCGCCAACA GTTGGGGAAC GGAAAATATT CGCGCCGGGG ATAACATTAT CATCAGCGAG ATGGAGCATC ACGCCAACAT CGTTCCCTGG CAGATACTGT GCGAGCGCAA AGGCGCTGAA CTGCGCGTGA TCCCGTTGCA TCCTGACGGT ACGCTGCGGC TGGAGACCTT AGCTGCGCTG TTCGATGACC GGACCCGGCT GCTGGCCATT ACCCATGTTT CCAATGTGCT GGGGACGGAA AACCCACTGC CGGACATGAT TGCGTTGGCG CGCCAGCATG GGGCGAAAGT GCTGGTGGAT GGCGCCCAGG CCGTGATGCA CCATGCTGTT GACGTCCAGG CGCTGGACTG CGATTTTTAC GTTTTCTCCG GCCATAAACT TTACGGGCCG ACCGGCATCG GCATTCTGTA TGTTAAAGAG GCGTTGCTGC AAGAAATGCC GCCGTGGGAA GGGGGCGGGT CGATGATCTC GACCGTCAGC CTGACGCAGG GAACGACATG GGCGAAAGCG CCCTGGCGTT TTGAGGCGGG AACGCCGAAT ACTGGCGGCA TCATCGGTCT CGGCGCGGCG ATTGACTATG TGACGTCGCT GGGACTGGAT AAGATTGGCG ATTATGAGCA GATGCTGATG CGCTATGCGC TGGAGCAACT GGCGCAGGTG CCTGATATCA CGCTGTATGG TCCGGCGCAG CGATTGGGCG TCATCGCGTT TAATCTGGGT AAACACCATG CTTACGACGT CGGCAGCTTT CTTGATAATT ACGGTATCGC GGTACGAACG GGACATCACT GCGCAATGCC GCTCATGGCC TGGTATGGCG TGCCGGCAAT GTGCCGGGCT TCGCTGGCGA TGTATAACAC CCATGAAGAA GTGGACCGAC TGGTGGCAGG ATTAACGCGT ATCCACCGCT TATTGGGATA A
|
Protein sequence | MTFPVEKVRA DFPILQREVN GLPLAYLDSA ASAQKPNQVI DAESAFYRHG YAAVHRGIHT LSVQATESME NVRKQASRFI NARSAEELVF VRGTTEGINL VANSWGTENI RAGDNIIISE MEHHANIVPW QILCERKGAE LRVIPLHPDG TLRLETLAAL FDDRTRLLAI THVSNVLGTE NPLPDMIALA RQHGAKVLVD GAQAVMHHAV DVQALDCDFY VFSGHKLYGP TGIGILYVKE ALLQEMPPWE GGGSMISTVS LTQGTTWAKA PWRFEAGTPN TGGIIGLGAA IDYVTSLGLD KIGDYEQMLM RYALEQLAQV PDITLYGPAQ RLGVIAFNLG KHHAYDVGSF LDNYGIAVRT GHHCAMPLMA WYGVPAMCRA SLAMYNTHEE VDRLVAGLTR IHRLLG
|
| |