Gene SNSL254_A1486 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1486 
Symbol 
ID6483082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1455600 
End bp1456820 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content57% 
IMG OID642736874 
Productbifunctional cysteine desulfurase/selenocysteine lyase 
Protein accessionYP_002040628 
Protein GI194442460 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.199299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.000000000797081 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACATTTC CTGTAGAAAA AGTACGGGCG GATTTTCCCA TACTGCAGCG TGAAGTTAAC 
GGCCTGCCGC TGGCTTACCT GGACAGCGCA GCCAGCGCTC AAAAACCTAA TCAGGTGATT
GATGCTGAAT CTGCCTTCTA CCGTCACGGC TATGCTGCGG TACATCGGGG TATCCATACG
TTAAGCGCGC AGGCGACCGA AAGCATGGAG AATGTGCGTA AGCAGGCGTC GCGGTTTATT
AACGCCCGCT CCGCAGAAGA ACTGGTGTTC GTGCGCGGTA CGACGGAGGG CATTAACCTT
GTCGCCAACA GTTGGGGAAC GGAAAATATT CGCGCCGGGG ATAACATTAT CATCAGCGAG
ATGGAGCATC ACGCCAACAT CGTTCCCTGG CAGATGCTGT GCGAGCGCAA AGGCGCTGAA
CTGCGCGTGA TCCCATTGCA TCCTGACGGT ACGCTGCGGC TGGAGACCTT AGCTGCGCTG
TTCGATGACC GGACCCGACT GCTGGCCATT ACCCATGTTT CCAATGTGCT GGGGACGGAA
AACCCACTGC CGGACATGAT TGCGCTGGCG CGCCAGCATG GGGCGAAAGT GCTGGTGGAT
GGCGCCCAGG CCGTGATGCA CCATGCTGTT GACGTCCAGG CGCTGGACTG CGATTTTTAC
GTTTTCTCCG GCCATAAACT TTACGGGCCG ACCGGCATCG GCATTCTGTA TGTTAAAGAG
GCGTTGCTGC AAGAAATGCC GCCGTGGGAA GGGGGCGGGT CGATGATCTC GACCGTCAGC
CTGACGCAGG GAACGACATG GGCGAAAGCG CCCTGGCGTT TTGAGGCGGG AACGCCGAAT
ACTGGCGGCA TCATCGGTCT CGGCGCGGCG ATTGATTATG TGACGTCGCT GGGACTGGAT
AAGATTGGCG ATTATGAGCA GATGCTGATG CGCTATGCGC TGGAGCAACT GGCGCAGGTG
CCTGATATCA CGCTATATGG CCCGGCGCAG CGGTTGGGCG TCATCGCGTT TAATCTGGGT
AAACACCACG CTTACGACGT CGGCAGCTTT CTTGATAATT ACGGTATCGC GGTACGAACA
GGGCATCACT GCGCAATGCC GCTCATGGCC TGGTATGGCG TGCCGGCAAT GTGCCGGGCT
TCGCTGGCGA TGTATAACAC CCATGAAGAA GTGGACCGAC TGGTGGCAGG ATTAACGCGT
ATCCACCGCT TATTGGGATA A
 
Protein sequence
MTFPVEKVRA DFPILQREVN GLPLAYLDSA ASAQKPNQVI DAESAFYRHG YAAVHRGIHT 
LSAQATESME NVRKQASRFI NARSAEELVF VRGTTEGINL VANSWGTENI RAGDNIIISE
MEHHANIVPW QMLCERKGAE LRVIPLHPDG TLRLETLAAL FDDRTRLLAI THVSNVLGTE
NPLPDMIALA RQHGAKVLVD GAQAVMHHAV DVQALDCDFY VFSGHKLYGP TGIGILYVKE
ALLQEMPPWE GGGSMISTVS LTQGTTWAKA PWRFEAGTPN TGGIIGLGAA IDYVTSLGLD
KIGDYEQMLM RYALEQLAQV PDITLYGPAQ RLGVIAFNLG KHHAYDVGSF LDNYGIAVRT
GHHCAMPLMA WYGVPAMCRA SLAMYNTHEE VDRLVAGLTR IHRLLG