Gene SeD_A1070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1070 
Symbol 
ID6871887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1073816 
End bp1075030 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content46% 
IMG OID642784255 
Productdiaminopropionate ammonia-lyase 
Protein accessionYP_002214929 
Protein GI198242145 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01747] diaminopropionate ammonia-lyase family
[TIGR03528] diaminopropionate ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.0000973165 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCATGAGC TTATTAAATA CCAGTTTAAT ACACGTCGGA AAAAATATGG TACAGGAGCG 
GCTTTAAGTT TGCTTAACGG AAATGTTGGG CGTGAGGTGT TAGCATTTCA TAAAAAATTA
CCCAATTATG CCGTTACGCC GTTACATAAT CTGGCGCATC TAAGCCGGCG GCTTGGACTA
GGGTCCATCC ATATTAAAGA TGAGTCCTGG CGTTTTGGTC TGAATGCTTT TAAAGGTCTG
GGCGGCTCTT ATGCTGTAGG AAAATATCTC GCTGATAAAT TGCAATGTGA TATTAACTCG
TTAAGTTTTG CTGCCCTTAA CACTCCTGAG ATTAAAGAAA AAATTAAGGA TTGTGTTTTT
GTTACCGCGA CGGATGGCAA TCATGGCCGT GGTGTGGCGT GGGCGGCGGA GCAATTAGGT
CTAAAAGCCG TCGTTTATAT GCCTAAAGGA TCATCGTTAA TCCGGGCAGA GAATATTCGC
CATCATGGAG CTGAATGCAC CATCACCGAT CTGAACTACG ATGATGCAGT GCGACTGGCC
CATAGAATGG CGCAAACAAA AGGCTGGGTG CTTTTGCAGG ATACAGCCTG GACAGGGTAT
GAAGAGATCC CAACGTGGAT TATGCAAGGC TATATGACAC TAGCGGTTGA AGCTTATGAT
CAGCTCGCAG AAACAAACAG TCCGTTGCCA ACCCACCTTA TTTTACAAGC GGGGGTGGGA
TCATTTGCTG GCAGTGTTAT GGGTTATTTT GTTGAAAAAA TGCAGGAAAG TATCCCTAAT
ATTATTGTGG TTGAGCCGCA TCAGGCCAAC TGTCTTTATC AATCCGCAGT TATGAATGAT
GGTCAACCTC ACTGCGTCAC TGGCGATATG GCGACGATAA TGGCCGGGCT TGCGTGTGGG
GAGCCGAATA TTATCAGTTG GCCTATTATT CGGGACAACA CCAGTTGTTT TATTTCCGCT
GATGATTGTC TGGCGGCTAA GGGTATGCGC ATTTCTGCCG CGCCGCGTCC AGGTACGGAT
ACGCCTTTTA TTTCCGGCGA GTCCGGAGCT ATTGGCGTAG GGTTACTTTA TGAGTTGATG
AACAATATGC ATTATCAGGA TCTTGCTAAT CGCTTACAGC TTGATGCCAA TGCTCATGTT
CTGCTTATTA GCACCGAAGG CGATACGTCC CCAGATATTT ATGAAGATAT AGTCTGGAAC
GGACGCAGTG CTTAA
 
Protein sequence
MHELIKYQFN TRRKKYGTGA ALSLLNGNVG REVLAFHKKL PNYAVTPLHN LAHLSRRLGL 
GSIHIKDESW RFGLNAFKGL GGSYAVGKYL ADKLQCDINS LSFAALNTPE IKEKIKDCVF
VTATDGNHGR GVAWAAEQLG LKAVVYMPKG SSLIRAENIR HHGAECTITD LNYDDAVRLA
HRMAQTKGWV LLQDTAWTGY EEIPTWIMQG YMTLAVEAYD QLAETNSPLP THLILQAGVG
SFAGSVMGYF VEKMQESIPN IIVVEPHQAN CLYQSAVMND GQPHCVTGDM ATIMAGLACG
EPNIISWPII RDNTSCFISA DDCLAAKGMR ISAAPRPGTD TPFISGESGA IGVGLLYELM
NNMHYQDLAN RLQLDANAHV LLISTEGDTS PDIYEDIVWN GRSA