Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A1486 |
Symbol | |
ID | 6483082 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 1455600 |
End bp | 1456820 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642736874 |
Product | bifunctional cysteine desulfurase/selenocysteine lyase |
Protein accession | YP_002040628 |
Protein GI | 194442460 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.199299 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.000000000797081 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACATTTC CTGTAGAAAA AGTACGGGCG GATTTTCCCA TACTGCAGCG TGAAGTTAAC GGCCTGCCGC TGGCTTACCT GGACAGCGCA GCCAGCGCTC AAAAACCTAA TCAGGTGATT GATGCTGAAT CTGCCTTCTA CCGTCACGGC TATGCTGCGG TACATCGGGG TATCCATACG TTAAGCGCGC AGGCGACCGA AAGCATGGAG AATGTGCGTA AGCAGGCGTC GCGGTTTATT AACGCCCGCT CCGCAGAAGA ACTGGTGTTC GTGCGCGGTA CGACGGAGGG CATTAACCTT GTCGCCAACA GTTGGGGAAC GGAAAATATT CGCGCCGGGG ATAACATTAT CATCAGCGAG ATGGAGCATC ACGCCAACAT CGTTCCCTGG CAGATGCTGT GCGAGCGCAA AGGCGCTGAA CTGCGCGTGA TCCCATTGCA TCCTGACGGT ACGCTGCGGC TGGAGACCTT AGCTGCGCTG TTCGATGACC GGACCCGACT GCTGGCCATT ACCCATGTTT CCAATGTGCT GGGGACGGAA AACCCACTGC CGGACATGAT TGCGCTGGCG CGCCAGCATG GGGCGAAAGT GCTGGTGGAT GGCGCCCAGG CCGTGATGCA CCATGCTGTT GACGTCCAGG CGCTGGACTG CGATTTTTAC GTTTTCTCCG GCCATAAACT TTACGGGCCG ACCGGCATCG GCATTCTGTA TGTTAAAGAG GCGTTGCTGC AAGAAATGCC GCCGTGGGAA GGGGGCGGGT CGATGATCTC GACCGTCAGC CTGACGCAGG GAACGACATG GGCGAAAGCG CCCTGGCGTT TTGAGGCGGG AACGCCGAAT ACTGGCGGCA TCATCGGTCT CGGCGCGGCG ATTGATTATG TGACGTCGCT GGGACTGGAT AAGATTGGCG ATTATGAGCA GATGCTGATG CGCTATGCGC TGGAGCAACT GGCGCAGGTG CCTGATATCA CGCTATATGG CCCGGCGCAG CGGTTGGGCG TCATCGCGTT TAATCTGGGT AAACACCACG CTTACGACGT CGGCAGCTTT CTTGATAATT ACGGTATCGC GGTACGAACA GGGCATCACT GCGCAATGCC GCTCATGGCC TGGTATGGCG TGCCGGCAAT GTGCCGGGCT TCGCTGGCGA TGTATAACAC CCATGAAGAA GTGGACCGAC TGGTGGCAGG ATTAACGCGT ATCCACCGCT TATTGGGATA A
|
Protein sequence | MTFPVEKVRA DFPILQREVN GLPLAYLDSA ASAQKPNQVI DAESAFYRHG YAAVHRGIHT LSAQATESME NVRKQASRFI NARSAEELVF VRGTTEGINL VANSWGTENI RAGDNIIISE MEHHANIVPW QMLCERKGAE LRVIPLHPDG TLRLETLAAL FDDRTRLLAI THVSNVLGTE NPLPDMIALA RQHGAKVLVD GAQAVMHHAV DVQALDCDFY VFSGHKLYGP TGIGILYVKE ALLQEMPPWE GGGSMISTVS LTQGTTWAKA PWRFEAGTPN TGGIIGLGAA IDYVTSLGLD KIGDYEQMLM RYALEQLAQV PDITLYGPAQ RLGVIAFNLG KHHAYDVGSF LDNYGIAVRT GHHCAMPLMA WYGVPAMCRA SLAMYNTHEE VDRLVAGLTR IHRLLG
|
| |