Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A1470 |
Symbol | |
ID | 6519127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | + |
Start bp | 1420154 |
End bp | 1421374 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642746583 |
Product | bifunctional cysteine desulfurase/selenocysteine lyase |
Protein accession | YP_002114388 |
Protein GI | 194734883 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.244975 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.102192 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATTTC CTGTAGAAAA AGTACGGGCG GATTTTCCCA TACTGCAGCG TGAAGTTAAC GGCCTGCCGC TGGCTTACCT GGACAGCGCA GCCAGCGCTC AAAAACCCAA TCAGGTGATT GACGCTGAAT CTGCCTTCTA CCGTCACGGC TATGCTGCGG TACATCGAGG TATCCATACG TTAAGCGCAC AGGCGACCGA AAGCATGGAG AATGTGCGTA AGCAAGCGTC GCGGTTTATT AACGCCCGCT CCGCAGAAGA ACTGGTGTTC GTCCGCGGTA CGACGGAGGG CATTAACCTT GTCGCCAACA GTTGGGGAAC GGAAAATATT CGCGCCGGGG ATAACATTAT CATCAGCGAG ATGGAGCATC ATGCCAATAT CGTTCCCTGG CAGATGCTGT GTGAGCGCAA AGGCGCTGAA CTGCGCGTGA TCCCGTTGCA TCCTGACGGT ACGCTGCGGC TGGAGACCTT AGCTGCGCTG TTCGATGACC GGACTCGACT GCTGGCCATT ACCCATGTTT CCAATGTGCT GGGGACGGAA AACCCGCTGC CGGACATGAT TGCGCTGGCG CGCCAGCATG GGGCGAAAGT GCTGGTGGAT GGCGCCCAGG CCGTGATGCA TCATGCTGTT GACGTCCAGG CGCTGGACTG CGATTTTTAC GTTTTCTCCG GCCATAAACT TTACGGGCCG ACCGGCATCG GCATTCTGTA TGTTAAAGAG GTGTTGCTGC AAGAAATGCC GCCGTGGGAA GGGGGCGGGT CGATGATTTC GACCGTCAGC CTGACGCAGG GAACGACATG GGCGAAAGCG CCCTGGCGTT TTGAGGCGGG AACGCCGAAT ACTGGCGGCA TCATCGGTTT GGGCGCGGCA ATTGACTATG TGACGTCGCT GGGACTGGAT AAGATTGGCG ATTATGAGCA GATGCTGATG CGCTATGCGC TGGAGCAACT GGCGCAGGTG CCTGATATCA CGCTGTATGG TCCGGCGCAG CGGTTGGGCG TCATCGCGTT TAATCTGGGT AAACACCATG CTTATGACGT CGGCAGCTTT CTTGATAATT ACGGCATCGC GGTACGAACG GGGCATCACT GCGCGATGCC GCTCATGGCC TGGTATGGCG TGCCGGCAAT GTGCCGGGCT TCGCTGGCGA TGTATAACAC CCATGAAGAA GTGGACCGAC TGGTGGTAGG ATTAACGCGT ATCCACCGCT TATTGGGATA A
|
Protein sequence | MTFPVEKVRA DFPILQREVN GLPLAYLDSA ASAQKPNQVI DAESAFYRHG YAAVHRGIHT LSAQATESME NVRKQASRFI NARSAEELVF VRGTTEGINL VANSWGTENI RAGDNIIISE MEHHANIVPW QMLCERKGAE LRVIPLHPDG TLRLETLAAL FDDRTRLLAI THVSNVLGTE NPLPDMIALA RQHGAKVLVD GAQAVMHHAV DVQALDCDFY VFSGHKLYGP TGIGILYVKE VLLQEMPPWE GGGSMISTVS LTQGTTWAKA PWRFEAGTPN TGGIIGLGAA IDYVTSLGLD KIGDYEQMLM RYALEQLAQV PDITLYGPAQ RLGVIAFNLG KHHAYDVGSF LDNYGIAVRT GHHCAMPLMA WYGVPAMCRA SLAMYNTHEE VDRLVVGLTR IHRLLG
|
| |