Gene SeD_A2217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2217 
Symbol 
ID6873085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2122191 
End bp2123243 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content51% 
IMG OID642785319 
Producthypothetical protein 
Protein accessionYP_002215982 
Protein GI198243375 
COG category[R] General function prediction only 
COG ID[COG1054] Predicted sulfurtransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value0.598434 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGTGT TACACAACCG CATCTCTAAT GACGAGCTGA AAGCCAAAAT GCTGGCGGAA 
AGCGAGCCGC GTACGACAAT TTCTTTTTAT AAATATTTCA CTATCGCCTC GCCGCAGCAG
ACGCGGGACG CGTTGTATCA GGTGTTTACG GCGTTGGACG TTTTTGGTCG TGTTTACCTG
GCGCATGAAG GCATCAATGC GCAAATCAGC GTGCCGCAAA GCAAGGTCGA GACCTTTCGT
CAACAGCTTT ATACGTTCGA CCCCGCGCTG GACGGGCTGC GTTTAAATAT CGCGCTGGAG
GATGACGGAA AGTCATTTTG GGTGCTGCGT ATGAAAGTTC GCGACCGTAT CGTCGCTGAC
GGAATTGACG ATCCGACTTT TGACGCCAGT AATGTCGGCG ATTATCTGAA GGCGGCAGAT
GTGAATGCGA TGCTGGACGA TCCTGATGCG GTCTTTATTG ATATGCGCAA CCACTATGAG
TATGAAGTCG GCCATTTCGA AAATGCTCTG GAAATCCCGG CGGATACGTT TCGTGAACAG
TTGCCAAAAG CGGTTGAAAT GCTGCGGGAA CATGCAGATA AAAAGATAGT GATGTACTGT
ACCGGCGGTA TTCGTTGTGA GAAAGCCAGC GCCTGGATGA AACACAACGG TTTCAATAAA
GTCTGGCATA TTGAGGGTGG CATCATTGAG TACGCCCGTC GCGCGCGCGA GCAGGGGCTT
CCCGTTCGCT TTATCGGCAA AAACTTTGTA TTTGATGAGC GAATGGGCGA ACGAATCTCG
GATGAGGTTA TCGCGCATTG CCATCAGTGC GGCGCGCCCT GCGATAGCCA TACCAACTGC
AAAAATGACG GTTGCCATCT GCTGTTTATC CAGTGTCCGC AGTGCGCCAG TAAATTTAAC
GGCTGCTGTA GTGAACAATG CTGTGAAGAG TTGGCCTTGC CGGAGGAAGA ACAGCGCCGA
CGTCGCGCGG GTCGCGAGAA CGGCAATAAA ATTTTTAATA AATCGCGTGG TCGGCTTAAT
AGCAAACTGA GCATTCCCGA TCCGGCTGAG TAA
 
Protein sequence
MPVLHNRISN DELKAKMLAE SEPRTTISFY KYFTIASPQQ TRDALYQVFT ALDVFGRVYL 
AHEGINAQIS VPQSKVETFR QQLYTFDPAL DGLRLNIALE DDGKSFWVLR MKVRDRIVAD
GIDDPTFDAS NVGDYLKAAD VNAMLDDPDA VFIDMRNHYE YEVGHFENAL EIPADTFREQ
LPKAVEMLRE HADKKIVMYC TGGIRCEKAS AWMKHNGFNK VWHIEGGIIE YARRAREQGL
PVRFIGKNFV FDERMGERIS DEVIAHCHQC GAPCDSHTNC KNDGCHLLFI QCPQCASKFN
GCCSEQCCEE LALPEEEQRR RRAGRENGNK IFNKSRGRLN SKLSIPDPAE