Gene SeD_A2521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2521 
Symbol 
ID6873188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2401269 
End bp2402207 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content57% 
IMG OID642785601 
ProducttRNA-dihydrouridine synthase C 
Protein accessionYP_002216259 
Protein GI198242215 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGTTT TACTGGCGCC GATGGAAGGC GTGCTCGACG CGTTAGTGCG CGAGCTGCTG 
ACCGAAGTGA ATGATTACGA TCTCTGCATC ACCGAATTTG TGCGCGTGGT GGATCAGCTG
CTGCCGGTAA AAGTGTTTCA TCGCATCTGC CCGGAGTTGC ATTACGCCAG CCGCACGCCG
TCCGGCACGC CGGTGCGTAT TCAGCTTCTG GGCCAGCATC CGCAGTGGCT GGCGGAAAAC
GCCGCGCGGG CGACGGCGTT GGGATCGTAT GGCGTGGACC TGAACTGCGG CTGTCCGTCA
AAAGTGGTGA ACGGCAGCGG CGGCGGCGCG ACATTGCTCA AAGATCCCGA ACTCATCTAT
CAGGGCGCGA AAGCGATGCG GGCCGCGGTA CCGTCGCATC TTCCGGTGAC GGTAAAAGTG
CGTCTCGGCT GGGATAGCGG CGATAGAAAA TTTGAAATCG CCGATGCTGT GCAGCAGGCT
GGCGCCAGTG AACTGGTGGT GCATGGTCGT ACCAAAGCGC AGGGCTACCG CGCCGAGCAT
ATCGACTGGC AGGCGATCGG CGAAATACGC CAGCGTCTGA CTATTCCGGT TATCGCGAAT
GGCGAAATCT GGGACTGGCA GAGCGCGCAG GCATGTATGG CGACCAGCGG CTGCGATGCG
GTGATGATTG GCCGTGGGGC GTTAAATATT CCTAACCTGA GCCGGGTGGT GAAGTATAAC
GAACCGCGTA TGCCGTGGCC GGAAGTGGTA ACGTTATTAC AAAAATATAC CCGACTGGAA
AAGCAGGGCG ATACCGGTTT ATACCATGTC GCGCGTATTA AACAGTGGTT GGGATATTTG
CGTAAGGAAT ATATTGAGGC GACGGAACTC TTTCAGTCGA TTCGGGCGTT AAACCGTTCA
TCCGAGATTG CGCGGGCGAT TCAGGCTATT AAAATCTAA
 
Protein sequence
MRVLLAPMEG VLDALVRELL TEVNDYDLCI TEFVRVVDQL LPVKVFHRIC PELHYASRTP 
SGTPVRIQLL GQHPQWLAEN AARATALGSY GVDLNCGCPS KVVNGSGGGA TLLKDPELIY
QGAKAMRAAV PSHLPVTVKV RLGWDSGDRK FEIADAVQQA GASELVVHGR TKAQGYRAEH
IDWQAIGEIR QRLTIPVIAN GEIWDWQSAQ ACMATSGCDA VMIGRGALNI PNLSRVVKYN
EPRMPWPEVV TLLQKYTRLE KQGDTGLYHV ARIKQWLGYL RKEYIEATEL FQSIRALNRS
SEIARAIQAI KI