Gene SNSL254_A1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1038 
SymboldpaL 
ID6482349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1054343 
End bp1055557 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content45% 
IMG OID642736444 
Productdiaminopropionate ammonia-lyase 
Protein accessionYP_002040203 
Protein GI194443204 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01747] diaminopropionate ammonia-lyase family
[TIGR03528] diaminopropionate ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.61616 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.00023872 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCATGAGC TTATTAAATA TCAGTTTAAT ACACGTCGGA AAAAATATGG TACAGGAGCG 
GCCTTAAGTT TGCTTAACGG AAATGTTGGG CATGAGGTGT TAGCATTTCA TAAAAAATTA
CCCAATTATG CCGTTACGCC GTTACATAAT CTGGCGCATC TAAGCCGGCG GCTTGGACTA
GGGTCCATCC ATATTAAAGA TGAATCCTGG CGTTTTGGCC TGAATGCTTT TAAAGGTCTG
GGCGGCTCTT ATGCTGTAGG AAAATATCTC GCTGATAAAT TGCAATGTGA TATTAACTCG
TTAAGTTTTG CTGCCCTTAA TACTCCTGAG ATTAAAGAAA AAATTAAAGA TTGTGTTTTT
GTTACCGCGA CGGATGGCAA TCATGGCCGT GGTGTGGCGT GGGCGGCAGA GCAATTAGGT
CTAAAAGCCG TCGTTTATAT GCCTAAAGGA TCATCGTTAA TCCGGGCAGA GAATATTCGC
CATCATGGAG CTGAATGCAC CATCACCGAT CTGAACTACG ATGATGCAGT GCGACTGGCC
CATAGAATGG CGCAAACAAA AGGCTGGGTG CTTTTGCAGG ATACAGCCTG GACAGGGTAT
GAAGAGATCC CAACATGGAT TATGCAAGGC TATATGACAC TAGCGGTTGA AGCTTATGAG
CAGCTCGCAG AAACAAACAG TCCGTTGCCA ACCCACCTTA TTTTACAAGC GGGGGTGGGA
TCGTTTGCTG GCAGTGTTAT GGGTTATTTT GTTGAAAAAA TGCAGGAAAA TATCCCTAAT
ATTATTGTGG TTGAGCCGCA TCAGGCCAAC TGTCTTTATC AATCCGCAGT TATGAATGAT
GGTCAACCTC ACTGCGTCAC TGGCGATATG GCGACGATAA TGGCCGGGCT TGCGTGTGGG
GAGCCGAATA TTATCAGTTG GCCTATTATT CGGGACAACA CCAGTTGTTT TATTTCCGCT
GATGATTGTC TGGCGGCTAA GGGTATGCGC ATTTCTGCCG CGCCGCGTCC AGGCACTGAT
ACGCCTTTTA TTTCCGGCGA GTCCGGAGCT ATTGGCGTAG GGTTACTTTA TGAGTTGATG
AACAATATGC ATTATCAGGA TCTTGCTAAT CGCTTACAGC TTGATGCCAA TGCTCATGTT
TTGCTTATTA GCACCGAAGG CGATACGTCC CCAGATATTT ATGAAGATAT AGTCTGGAAC
GGACGCAGTG CTTAA
 
Protein sequence
MHELIKYQFN TRRKKYGTGA ALSLLNGNVG HEVLAFHKKL PNYAVTPLHN LAHLSRRLGL 
GSIHIKDESW RFGLNAFKGL GGSYAVGKYL ADKLQCDINS LSFAALNTPE IKEKIKDCVF
VTATDGNHGR GVAWAAEQLG LKAVVYMPKG SSLIRAENIR HHGAECTITD LNYDDAVRLA
HRMAQTKGWV LLQDTAWTGY EEIPTWIMQG YMTLAVEAYE QLAETNSPLP THLILQAGVG
SFAGSVMGYF VEKMQENIPN IIVVEPHQAN CLYQSAVMND GQPHCVTGDM ATIMAGLACG
EPNIISWPII RDNTSCFISA DDCLAAKGMR ISAAPRPGTD TPFISGESGA IGVGLLYELM
NNMHYQDLAN RLQLDANAHV LLISTEGDTS PDIYEDIVWN GRSA