Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B1798 |
Symbol | |
ID | 6796595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | - |
Start bp | 1758577 |
End bp | 1759797 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642776032 |
Product | bifunctional cysteine desulfurase/selenocysteine lyase |
Protein accession | YP_002146666 |
Protein GI | 197251900 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0228411 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATTTC CTGTAGAAAA AGTACGGGCG GATTTTCCCA TACTGCAGCG TGAAGTTAAC GGCCTGCCGC TGGCTTACCT GGACAGCGCA GCCAGCGCTC AAAAACCCAA TCAGGTGATT GACGCTGAAT CTGCCTTCTA CCGTCACGGC TATGCTGCGG TACATCGAGG TATCCATACG TTAAGCGCAC AGGCGACCGA AAGCATGGAG AATGTGCGTA AGCAGGCGTC GCGGTTTATT AACGCCCGCT CCGCAGAAGA ACTGGTTTTC GTGCGCGGTA CGACGGAGGG CATTAACCTT GTCGCCAACA GTTGGGGAAC GGAAAATATT CGCGCCGGGG ATAACATTAT CATCAGCGAG ATGGAGCATC ACGCCAACAT CGTTCCCTGG CAGATGCTGT GCGAGCGCAA AGGCGCTGAA CTGCGCGTGA TCCCGTTGCA TCCTGACGGT ACGCTGCGGC TGGAGACCTT AGCTGCGCTG TTCGATGACC GGACCCGACT GCTGGCCATT ACCCATGTTT CCAATGTGCT GGGGACGGAA AACCCGCTGC CGGACATGAT TGCGCTGGCG CGCCAGCATG GGGCGAAAGT GCTGGTGGAT GGCGCCCAGG CCGTGATGCA CCATGCTGTT GACGTCCAGG CGCTGGACTG CGATTTTTAC GTTTTCTCCG GCCATAAACT TTACGGGCCG ACCGGCATCG GCATTCTGTA TGTTAAAGAG GCGTTGCTGC AAGAAATGCC GCCGTGGGAA GGGGGCGGGT CGATGATTTC GACCGTCAGC CTGACGCAGG GAACGACATG GGCGAAAGCG CCCTGGCGTT TTGAGGCGGG AACGCCGAAT ACTGGCGGCA TCATCGGTCT GGGCGCGGCG ATTGACTATG TGACGTCGCT GGGACTGGAT AAGATTGGCG ATTATGAGCA GATGCTGATG CGCTATGCGC TGGAGCAACT GGCGCAGGTG CCTGATATCA CGCTATATGG CCCGGCGCAG CGGTTGGGCG TCATCGCGTT TAATCTGGGT AACCACCATG CTTACGACGT CGGTAGCTTT CTTGATAATT ACGGTATCGC GGTACGAACG GGGCATCACT GCGCGATGCC GCTCATGGCC TGGTATGGCG TGCCAGCAAT GTGCCGGGCT TCGCTGGCGA TGTATAACAC CCATGAAGAA GTGGACCGAC TGGTGGCAGG ATTAACGCGT ATCCACCGCT TATTGGGATA A
|
Protein sequence | MTFPVEKVRA DFPILQREVN GLPLAYLDSA ASAQKPNQVI DAESAFYRHG YAAVHRGIHT LSAQATESME NVRKQASRFI NARSAEELVF VRGTTEGINL VANSWGTENI RAGDNIIISE MEHHANIVPW QMLCERKGAE LRVIPLHPDG TLRLETLAAL FDDRTRLLAI THVSNVLGTE NPLPDMIALA RQHGAKVLVD GAQAVMHHAV DVQALDCDFY VFSGHKLYGP TGIGILYVKE ALLQEMPPWE GGGSMISTVS LTQGTTWAKA PWRFEAGTPN TGGIIGLGAA IDYVTSLGLD KIGDYEQMLM RYALEQLAQV PDITLYGPAQ RLGVIAFNLG NHHAYDVGSF LDNYGIAVRT GHHCAMPLMA WYGVPAMCRA SLAMYNTHEE VDRLVAGLTR IHRLLG
|
| |