Gene Spro_0804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_0804 
Symbol 
ID5603740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp898400 
End bp899935 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content62% 
IMG OID640936315 
Producthistidine ammonia-lyase 
Protein accessionYP_001477038 
Protein GI157369049 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.422259 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.987509 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCGC TGACTATTCG CCCAGGCCAA CTGACACTGG CGCAACTGCG TGAAATTTAT 
CAACACCCGG TCACCCTGAC TCTGGACGAC AACGCCTACG CAGATATCCA GAAAAGCGTT
GCCTGCGTTG AACGTATCGT TGAGGAAAAC CGTACCACCT ACGGCATTAA CACCGGTTTC
GGCCTGCTGG CATCCACCCG CATCGCTCGC GAAGATCTGG AAGATCTGCA GCGTTCTATC
GTGCTGTCGC ATGCCGCCGG CGTCGGTGCG CCCACCGACG ACAATCTGGT GCGCCTGATT
ATGGTGCTGA AAATCAATAG CCTGTCGCGC GGCTTCTCCG GCATCCGTCT GGAAGTGATC
GAGGCGCTGA TGGCGCTGGT CAACGCCGAA GTCTATCCGC ATATCCCGCT GAAAGGCTCC
GTAGGCGCCT CCGGCGACCT GGCACCGCTG GCACACATGA GCCTGGTGCT GTTGGGTGAA
GGCAAGGCCC GTTATCAGGG CGAATGGCTG CCTGCCACCG AAGCGCTGGC CAAAGCGGGC
CTGAAACCGC TGACGCTGGC GGCCAAAGAA GGCCTGGCAC TGCTGAACGG CACCCAAGTT
TCCGCCGCCT TTGCCCTGCG CGGTTTGTTT GACGTGGAAG ACCTGTACGC CGCGGCAACG
GTCACCGGTA GCCTGACGGT GGAAGCCGCT CTGGGTTCAC GCAGCCCGTT TGATGCGCGC
ATTCATGCCG TGCGCGGTCA GCGTGGTCAG ATTGACGCCG CCGCCGCTTA CCGCCATCTG
CTGGGCGAGC GCAGCGAAGT GTCCGATTCA CACCGCAACT GTGAAAAAGT GCAGGATCCG
TACTCCCTGC GCTGCCAGCC GCAGGTGATG GGCGCCTGCC TGACGCAAAT TCGCCAGGCC
GCCGAAGTGC TGGAAATTGA AGCCAACGCG GTCTCCGACA ACCCGTTGGT CTTTGCTGAC
CAAGGCGACG TACTGTCCGG CGGTAACTTC CACGCCGAAC CGGTCGCCAT GGCCGCCGAC
AATCTGGCGC TGGCGTTTGC CGAAATAGGT TCACTGTCCG AGCGCCGCAT CTCGCTGATG
ATGGATAAAC ACATGTCGCA GCTGCCACCT TTCCTGGTGG ACAACGGCGG GGTGAACTCC
GGCTTTATGA TTGCCCAGGT GACCGCCGCG GCGCTAACCA GCGAAAACAA AGCGCTGGCG
CACCCGGCCA GCGTCGACAG CATTCCGACC TCGGCCAACC AGGAAGACCA CGTTTCCATG
GCACCGGCCG CCGGTCGTCG CCTGTGGGAA ATGGCGGATA ACGTTCGCGG CATTCTGGCC
GTCGAGTGGC TGGCAGCCTG TCAGGGGCTG GATCTGCGTA AAGGGCTGAA AACCACCGAA
AGCCTGGAGC AGGCCCGCCG CACACTGCGC GAGCAAGTCA GTTACTACGA CAAGGATCGT
TTCTTCGCTC CCGACATTGA AGCCGCCAGC CTGCTGTTGG CGGCCGGTCA CCTGACCTCG
CTAATGCCTG CAGCACTGCT GCCTAGCCAG GCATAA
 
Protein sequence
MKALTIRPGQ LTLAQLREIY QHPVTLTLDD NAYADIQKSV ACVERIVEEN RTTYGINTGF 
GLLASTRIAR EDLEDLQRSI VLSHAAGVGA PTDDNLVRLI MVLKINSLSR GFSGIRLEVI
EALMALVNAE VYPHIPLKGS VGASGDLAPL AHMSLVLLGE GKARYQGEWL PATEALAKAG
LKPLTLAAKE GLALLNGTQV SAAFALRGLF DVEDLYAAAT VTGSLTVEAA LGSRSPFDAR
IHAVRGQRGQ IDAAAAYRHL LGERSEVSDS HRNCEKVQDP YSLRCQPQVM GACLTQIRQA
AEVLEIEANA VSDNPLVFAD QGDVLSGGNF HAEPVAMAAD NLALAFAEIG SLSERRISLM
MDKHMSQLPP FLVDNGGVNS GFMIAQVTAA ALTSENKALA HPASVDSIPT SANQEDHVSM
APAAGRRLWE MADNVRGILA VEWLAACQGL DLRKGLKTTE SLEQARRTLR EQVSYYDKDR
FFAPDIEAAS LLLAAGHLTS LMPAALLPSQ A