Gene SeHA_C1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C1103 
SymboldpaL 
ID6491329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1098227 
End bp1099441 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content46% 
IMG OID642741345 
Productdiaminopropionate ammonia-lyase 
Protein accessionYP_002044997 
Protein GI194447657 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01747] diaminopropionate ammonia-lyase family
[TIGR03528] diaminopropionate ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value0.624522 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGAGC TTATTAAATA TCAGTTTAAT ACACGTCGGA AAAAATATGG TACAGGAGCG 
GCCTTAAGTT TGCTTAACGG AAATGTTGGG CATGAGGTGT TAGCATTTCA TAAAAAATTA
CCCAATTATG CCGTCACGCC GTTACATAAT CTGGCGCATC TAAGCCAGCG GCTTGGACTA
GGGTCCATCC ATATTAAAGA TGAGTCCTGG CGTTTTGGCC TGAATGCTTT TAAAGGTCTG
GGCGGCTCTT ATGCTGTAGG AAAATATCTC GCTGATAAAT TGCAATGTGA TATTAACTCG
TTAAGTTTTG CTGCCCTTAA TACTCCTGAG ATTAAAGAAA AAATTAAAGA TTGTGTTTTT
GTTACCGCGA CGGATGGCAA TCATGGCCGT GGTGTGGCGT GGGCGGCAGA GCAATTAGGT
CTAAAAGCCG TCGTTTATAT GCCTAAAGGA TCATCGTTAA TCCGGGCAGA GAATATTCGC
CATCATGGAG CTGAATGCAC CATCACCGAT CTGAACTACG ATGATGCAGT GCGACTGGCC
CATAGAATGG CGCAAACAAA AGGCTGGGTG CTTTTGCAGG ATACAGCCTG GACAGGGTAT
GAAGAGATCC CAACATGGAT TATGCAAGGC TATATGACAC TAGCGGTTGA AGCTTATGAG
CAGCTCGCAG AAACAAACAG TCCGTTGCCA ACCCATCTTA TTTTACAAGC GGGGGTGGGA
TCGTTTGCTG GCAGTGTTAT GGGTTATTTT GTTGAAAAAA TGCAGGAAAA TATCCCTAAT
ATTATTGTGG TTGAGCCGCA TCAGGCCAAC TGTCTTTATC AATCCGCAGT TATGGATGAT
GGTCAACCTC ACTGCGTCAC TGGCGATATG GCGACGATAA TGGCCGGGCT TGCGTGTGGG
GAGCCGAATA TTATCAGTTG GCCTATTATT CGGGACAACA CCAGTTGTTT TATTTCCGCT
GATGACTGTC TGGCGGCTAA GGGTATGCGT ATTTCTGCCG CGCCGCGTCC AGGTACGGAT
ACGCCTTTTA TTTCCGGCGA GTCCGGAGCT ATTGGCGTAG GGTTACTTTA TGAGTTGATG
AACAATATGC ATTATCAGGA TCTTGCTAAT CGCTTACAGC TTGATGCCAG TGCTCATGTT
CTGCTTATTA GCACCGAAGG CGATACGTCC CCAGATATTT ATGAAGATAT AGTCTGGAAC
GGACGCAGTG CTTAA
 
Protein sequence
MHELIKYQFN TRRKKYGTGA ALSLLNGNVG HEVLAFHKKL PNYAVTPLHN LAHLSQRLGL 
GSIHIKDESW RFGLNAFKGL GGSYAVGKYL ADKLQCDINS LSFAALNTPE IKEKIKDCVF
VTATDGNHGR GVAWAAEQLG LKAVVYMPKG SSLIRAENIR HHGAECTITD LNYDDAVRLA
HRMAQTKGWV LLQDTAWTGY EEIPTWIMQG YMTLAVEAYE QLAETNSPLP THLILQAGVG
SFAGSVMGYF VEKMQENIPN IIVVEPHQAN CLYQSAVMDD GQPHCVTGDM ATIMAGLACG
EPNIISWPII RDNTSCFISA DDCLAAKGMR ISAAPRPGTD TPFISGESGA IGVGLLYELM
NNMHYQDLAN RLQLDASAHV LLISTEGDTS PDIYEDIVWN GRSA