Gene SeD_A1969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1969 
Symbol 
ID6872221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1900608 
End bp1901828 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content57% 
IMG OID642785088 
Productbifunctional cysteine desulfurase/selenocysteine lyase 
Protein accessionYP_002215754 
Protein GI198243608 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0908482 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0000000737026 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACATTTC CTGTAGAAAA AGTACGGGCG GATTTTCCCA TACTGCAGCG TGAAGTTAAC 
GGCCTGCCGC TGGCTTACCT GGACAGCGCA GCCAGCGCTC AAAAACCTAA TCAGGTGATT
GATGCTGAAT CTGCCTTCTA CCGTCACGGC TATGCTGCGG TACATCGAGG TATCCATACG
TTAAGCGTGC AGGCGACCGA AAGCATGGAG AATGTGCGTA AGCAGGCGTC GCGGTTTATT
AACGCCCGCT CCGCAGAAGA ACTGGTGTTC GTGCGCGGTA CGACGGAGGG CATTAACCTT
GTCGCCAACA GTTGGGGAAC GGAAAATATT CGCGCCGGGG ATAACATTAT CATCAGCGAG
ATGGAGCATC ACGCCAACAT CGTTCCCTGG CAGATACTGT GCGAGCGCAA AGGCGCTGAA
CTGCGCGTGA TCCCGTTGCA TCCTGACGGT ACGCTGCGGC TGGAGACCTT AGCTGCGCTG
TTCGATGACC GGACCCGGCT GCTGGCCATT ACCCATGTTT CCAATGTGCT GGGGACGGAA
AACCCACTGC CGGACATGAT TGCGTTGGCG CGCCAGCATG GGGCGAAAGT GCTGGTGGAT
GGCGCCCAGG CCGTGATGCA CCATGCTGTT GACGTCCAGG CGCTGGACTG CGATTTTTAC
GTTTTCTCCG GCCATAAACT TTACGGGCCG ACCGGCATCG GCATTCTGTA TGTTAAAGAG
GCGTTGCTGC AAGAAATGCC GCCGTGGGAA GGGGGCGGGT CGATGATCTC GACCGTCAGC
CTGACGCAGG GAACGACATG GGCGAAAGCG CCCTGGCGTT TTGAGGCGGG AACGCCGAAT
ACTGGCGGCA TCATCGGTCT CGGCGCGGCG ATTGACTATG TGACGTCGCT GGGACTGGAT
AAGATTGGCG ATTATGAGCA GATGCTGATG CGCTATGCGC TGGAGCAACT GGCGCAGGTG
CCTGATATCA CGCTGTATGG TCCGGCGCAG CGATTGGGCG TCATCGCGTT TAATCTGGGT
AAACACCATG CTTACGACGT CGGCAGCTTT CTTGATAATT ACGGTATCGC GGTACGAACG
GGACATCACT GCGCAATGCC GCTCATGGCC TGGTATGGCG TGCCGGCAAT GTGCCGGGCT
TCGCTGGCGA TGTATAACAC CCATGAAGAA GTGGACCGAC TGGTGGCAGG ATTAACGCGT
ATCCACCGCT TATTGGGATA A
 
Protein sequence
MTFPVEKVRA DFPILQREVN GLPLAYLDSA ASAQKPNQVI DAESAFYRHG YAAVHRGIHT 
LSVQATESME NVRKQASRFI NARSAEELVF VRGTTEGINL VANSWGTENI RAGDNIIISE
MEHHANIVPW QILCERKGAE LRVIPLHPDG TLRLETLAAL FDDRTRLLAI THVSNVLGTE
NPLPDMIALA RQHGAKVLVD GAQAVMHHAV DVQALDCDFY VFSGHKLYGP TGIGILYVKE
ALLQEMPPWE GGGSMISTVS LTQGTTWAKA PWRFEAGTPN TGGIIGLGAA IDYVTSLGLD
KIGDYEQMLM RYALEQLAQV PDITLYGPAQ RLGVIAFNLG KHHAYDVGSF LDNYGIAVRT
GHHCAMPLMA WYGVPAMCRA SLAMYNTHEE VDRLVAGLTR IHRLLG