Gene SeD_A4828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4828 
Symbol 
ID6874566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4681806 
End bp4683719 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content54% 
IMG OID642787717 
ProductHTH domain-containing protein 
Protein accessionYP_002218311 
Protein GI198245784 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG1762] Phosphotransferase system mannitol/fructose-specific IIA domain (Ntr-type)
[COG3711] Transcriptional antiterminator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGATTCC CCAACCAACG TTTAGCGCAG CTATTTGCGA TGCTGCAAAA CGAGACGCTG 
CCGCAGGATG AGCTGGCGCA GCGGTTGTCG GTCTCCACGC GTACCGTTCG GGCGGATATC
GCCGCGCTGA ACATGTTGCT GACGCCGCAT GGCGCGCAAT TTACTCTCAG CCGCGGCAAC
GGGTATCAGC TCAAAATTGA TGATCCGGCA CGTTATCAAT CCCTGCAAAC GCAGCAATCT
CCCGCCCTGG CGCGCGGTCC GCGCACCAGC CAGGAGCGGA TACACTATTT GCTGGCGCGT
TTTTTAACCT CCGCCTTCTC GCTGAAGCTG GAGGATTTAG CAGATGAATG GTTTGTCAGC
CGTGCGACGT TGCAGAACGA CATGGCGGAC GTGCGCGAGC ATTTGCTGCG TTATCATCTG
ACGCTGGAAA CGCGTCCGCG TCATGGCATG AAATTGTTTG GCGGAGAGAT GGCGATTCGC
GCCTGTCTGA CCGACCTTTT ATGGACGCTG GCGCAGCAGG AGCCCTCTCA TCCGTTAATT
GTTAGTACCA CGCTGAACAC CGAGGTGTCT CAACGGCTGC GGTCTCTTTT GCCGGATATT
TTCTCTCATT GTCAAATCCG CCTGACCGAT GAGGGCGAGC TGTTTCTACG TTTATACTGC
GCGGTGGCGG TACGGCGTAT TCGCGAGGGG TATCCGTTAT CGGAATGTGT GGCGGAGGAG
GTTGATGAAA AGGTGCGCCA TGCGGCGCAT GAGATTGCGG AGCTACTACA ACAGCTGGCC
GACAAGCCGC TGTCGGAGCC GGAAGTAAGC TGGCTGAAGG TGCATATTGC CGCCCGCCAG
GTACAGGAGA TTGCCCCCAG CGCCATTAAT GCCGATGATG AAGAGGCGCT GGTGCGCTAT
ATCCTCAATT TTATTAATAC CCAGTACAAC TATAACCTGT TGAATGATAA ACAGCTTCAC
GCGGACTTGC TGACCCACAT CAAAACGATG ATCACCCGCG TGCGCTACCA GATCATGATC
CCCAACCCAC TACTGGAAAA TATTAAGCAG CACTATCCGA TGGCGTGGGA TATGACGTTG
GCGGCGATAT CGAGTTGGGG AAAATACACG CCGTATACCA TTAGCGAAAA CGAAATCGGT
TTTCTGGTGC TGCATATTGG CGTTGGGCTG GAACGTAGTT ACAACATTGG CTACCAGCGG
CAGCCGCAGG TACTGTTGGT GTGCGATGCT GGTAATGCGA TGGTGCGGAT GATTGAAGCG
GTACTGGCGC GGAAATACCC GCAGATTGAG ATTGCCCGCA CTCTGACGCT ACGCGACTAT
GAGGCGCGGG ACAGTATTGT GGAGGATTTT GTGATTTCCA CGGCGCGGAT CGGTGAAAAA
GATAAGCCGG TCATCATGAT CGCACCCTTT CCTACCGACT ATCAATTGGA GCAGATCGGT
AAGCTGGTGC TGGTGGACAG AACGCGCCCG TGGATGCTGG ATAAATATTT CGATGCCTCG
CATTTTCGCA TCGTGGAGGG GGAAATAGAT CAACAGACGC TGTTTAAAAC GCTGTGCGAT
CAGTTGCATG AGGAAGGCTT TGTTGATGCG GCGTTTCTCG ATTCGGTTAT TGAACGTGAA
GCTATCGTCA GTACGTTATT AGGCGACGGG ATTGCCTTGC CACACGCGCT GGGGCTGCTG
GCGAAGAAAA CGGTGGTTTA TACGGTGCTC GCGCCACAGG GGATTGCCTG GGGTGACGAA
ACGGCGCACG TTATTTTTTT ACTCGCCATC AGCAAAAGTG AATATGAAGA GGCGATGGCC
ATCTACGATA TTTTCGTCAC TTTCCTGCGC GAACGCGCCA TGACGCGCCT CTGCGCATGT
CAGAATTTTA CGCAATTTAA AACGGTCGCG ATGGAGTGCG TGAGTCGTTT TTGA
 
Protein sequence
MRFPNQRLAQ LFAMLQNETL PQDELAQRLS VSTRTVRADI AALNMLLTPH GAQFTLSRGN 
GYQLKIDDPA RYQSLQTQQS PALARGPRTS QERIHYLLAR FLTSAFSLKL EDLADEWFVS
RATLQNDMAD VREHLLRYHL TLETRPRHGM KLFGGEMAIR ACLTDLLWTL AQQEPSHPLI
VSTTLNTEVS QRLRSLLPDI FSHCQIRLTD EGELFLRLYC AVAVRRIREG YPLSECVAEE
VDEKVRHAAH EIAELLQQLA DKPLSEPEVS WLKVHIAARQ VQEIAPSAIN ADDEEALVRY
ILNFINTQYN YNLLNDKQLH ADLLTHIKTM ITRVRYQIMI PNPLLENIKQ HYPMAWDMTL
AAISSWGKYT PYTISENEIG FLVLHIGVGL ERSYNIGYQR QPQVLLVCDA GNAMVRMIEA
VLARKYPQIE IARTLTLRDY EARDSIVEDF VISTARIGEK DKPVIMIAPF PTDYQLEQIG
KLVLVDRTRP WMLDKYFDAS HFRIVEGEID QQTLFKTLCD QLHEEGFVDA AFLDSVIERE
AIVSTLLGDG IALPHALGLL AKKTVVYTVL APQGIAWGDE TAHVIFLLAI SKSEYEEAMA
IYDIFVTFLR ERAMTRLCAC QNFTQFKTVA MECVSRF