Gene SeD_A1743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1743 
Symbol 
ID6871222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1678715 
End bp1680139 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content53% 
IMG OID642784880 
Productaminotransferase 
Protein accessionYP_002215548 
Protein GI198244240 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.455944 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAT ATCAACGTCT GGCGGAGCAA ATTAGAGAAC AAATCGCCTC TGGCGTTTGG 
CAACCCGGCG ATCGATTACC CTCGCTGAGG GAGCAGGTCG CCAGTAGCGG CATGAGTTTT
ATGACTGTCG GTCATGCGTA CCAGTTGCTG GAAAGTCAGG GACGGATTAT CGCCCGTCCG
CAATCTGGTT ATTATGTCGC GCCGCATCCG GTTTGTCGGT CAGTCGCGAC GGCAGCGCAC
GTTATTCGGG ATGAAGCCGT AGATATCAAT ACCTATATTT TTGAGATGCT GCAGGCGAGC
CGTGATACCT CTGTGGTGCC CTTTGCCTCA GCGTTTCCCG ATCCGCGTCT GTTTCCCTTG
CCGCAATTGA ACCGCTCTCT GGCGCAGGTC AGTAAAACCG CGACGGCAAT GAGCGTCATC
GAAAACTTGC CGCCAGGTAA TGCAGAATTG CGCTATGCGA TAGCACGCCG TTACGCGCAG
CAGGGGATTA CCGTTTCTCC TGACGAGATT GTCATTACCG CCGGCGCGCT GGAAGCGTTG
AATCTTAGTT TACAGGCGGT GACGGCGCCG GGAGACTGGG TGGTGGTGGA GAACCCCTGT
TTCTACGGTG CGCTACAGGC CCTGGAGCGA TTGCGCCTGA AAGCGTTGTC GATACCTACC
GATGTTAAAG AAGGTATCGA TCTTATGGCG CTGGAACAGG CGTTACAGGA ATATCCGGTG
AAAGCGTGCT GGTTAATGAC CAACAGCCAG AATCCGTTAG GGTTCACGCT GAGTGCTGAG
AAAAAAGCGC GGCTGGTCGC GTTACTCACG CACCATAACG TGACGCTAAT TGAAGACGAT
GTTTATAGCG AACTCTACTT TGGCCGCGAG AAACCTCTGC CTGCCAAAGC ATGGGATCGC
GACGATACCG TGTTGCATTG TTCATCTTTT TCCAAATGTC TGGTTCCCGG TTTTCGCATT
GGCTGGGTTG CGGGAGGAAG CCATGCCAGA CAAATTCAGC GTTTGCAGTT AATGAGCACA
TTATCCACCA GTTCCCCCAT GCAATTGGCG TTAGTGGATT ATCTCTCGAC ACGTCGATAT
GACGCGCATC TGCGCCGATT GCGAAGGCAG CTTGCCGAAC GTAAACAGCA AGCCTGGCAA
ACCCTTTTGC GCCATCTCCC GGCGGAAGTA AAAATTCACC ATAACGATAG CGGTTATTTT
CTTTGGCTGG AGCTACCTGA GCAGCTTGAT GCCGGTGAAC TTAGCGCCAA AGCGCTTGAA
CACCTTATCA GTATCGCGCC GGGGAAAATG TTTTCAACGT CCGGCGCCTG GACGCGTTTC
TTTCGTTTTA ACACTGCATG GCATTGGGGA GAACGGGAAG AACAGGCTGT AAAACAGCTT
GGCAGCCTGA TTCGCGAGAT GCTGCGCGCT AAAAGCCTTG TGTGA
 
Protein sequence
MKKYQRLAEQ IREQIASGVW QPGDRLPSLR EQVASSGMSF MTVGHAYQLL ESQGRIIARP 
QSGYYVAPHP VCRSVATAAH VIRDEAVDIN TYIFEMLQAS RDTSVVPFAS AFPDPRLFPL
PQLNRSLAQV SKTATAMSVI ENLPPGNAEL RYAIARRYAQ QGITVSPDEI VITAGALEAL
NLSLQAVTAP GDWVVVENPC FYGALQALER LRLKALSIPT DVKEGIDLMA LEQALQEYPV
KACWLMTNSQ NPLGFTLSAE KKARLVALLT HHNVTLIEDD VYSELYFGRE KPLPAKAWDR
DDTVLHCSSF SKCLVPGFRI GWVAGGSHAR QIQRLQLMST LSTSSPMQLA LVDYLSTRRY
DAHLRRLRRQ LAERKQQAWQ TLLRHLPAEV KIHHNDSGYF LWLELPEQLD AGELSAKALE
HLISIAPGKM FSTSGAWTRF FRFNTAWHWG EREEQAVKQL GSLIREMLRA KSLV