Gene Spro_0879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_0879 
Symbol 
ID5602520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp973241 
End bp974314 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content56% 
IMG OID640936394 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001477113 
Protein GI157369124 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.287088 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.139833 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAAG ACGCGCTCAA TAACGTTCAT ATCAGTGCAG AACAAATTCT GATCACTCCG 
GAAGAACTGA AGAACCAGTT CCCGCTCAGC GCCAGCGATG AAAATGAAAT CGCTACCGCG
CGTAAAACCA TCGCTGATAT TTTGCAGGGG CGCGATCATC GCCTGCTGGT GGTGTGCGGA
CCCTGTTCTA TCCACGACCC GGATGCTGCC CTGGATTACG CCCGTCATTT GAAAACCCTG
TCGGCTGAAT TGAGCGATCA GCTGTATATC GTTATGCGCG TCTATTTTGA AAAACCACGC
ACCACCGTTG GCTGGAAAGG CCTGATCAAC GATCCGTACA TGGACGGTTC GTTTGATGTT
GAAGCCGGTT TGCACATCGC GCGTCGTCTG TTGCTGGATC TGGTAGGTAT GGGTCTGCCG
TTGGCGACAG AAGCCTTGGA TCCGAACAGC CCGCAATACC TGGGTGACCT GTTCAGCTGG
TCGGCGATTG GTGCACGTAC CACAGAATCG CAGACTCACC GTGAGATGGC TTCAGGCTTG
TCGATGCCGG TGGGCTTCAA AAACGGCACC GACGGCAGCC TGGGGACGGC AATCAACGCC
ATGCGTGCCG CGGCGATGCC ACACCGTTTT GTCGGCATCA ACCAGGCAGG GCAAGTGTGC
CTGCTGCAAA CGCAGGGTAA CCCGGATGGG CACGTGATCC TGCGCGGTGG TAAAACGCCG
AACTACAGTG CCGAAGACGT TGCGGCCTGC GAAAAACAGA TGCGGGACGC GGGACTCCAC
CCGTCCTTGA TGATAGATTG CAGCCATGGC AACTCGAATA AAGACTATCG CCGTCAGCCG
ACGGTTGCCG AGTCCGTGGT AGAGCAGATT AAAGCGGGTA ACCGTTCGAT CACCGGCATC
ATGCTGGAAA GCCACCTGCA CGAAGGCAGC CAGTCTTCTG AACAACCGCG TGCAGATATG
CGCTACGGGG TTTCTGTTAC TGACGCCTGC ATTAACTGGG AGAGTACCCA GACGCTGCTG
CGCCATATGC ACCAGGAACT CGGCACTGCG CTGACGGCAC GTACTGGAGA GTAG
 
Protein sequence
MQKDALNNVH ISAEQILITP EELKNQFPLS ASDENEIATA RKTIADILQG RDHRLLVVCG 
PCSIHDPDAA LDYARHLKTL SAELSDQLYI VMRVYFEKPR TTVGWKGLIN DPYMDGSFDV
EAGLHIARRL LLDLVGMGLP LATEALDPNS PQYLGDLFSW SAIGARTTES QTHREMASGL
SMPVGFKNGT DGSLGTAINA MRAAAMPHRF VGINQAGQVC LLQTQGNPDG HVILRGGKTP
NYSAEDVAAC EKQMRDAGLH PSLMIDCSHG NSNKDYRRQP TVAESVVEQI KAGNRSITGI
MLESHLHEGS QSSEQPRADM RYGVSVTDAC INWESTQTLL RHMHQELGTA LTARTGE