Gene Spea_1053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpea_1053 
Symbol 
ID5661452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella pealeana ATCC 700345 
KingdomBacteria 
Replicon accessionNC_009901 
Strand
Start bp1275021 
End bp1276112 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content46% 
IMG OID641235599 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001500915 
Protein GI157960881 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0453685 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACAAG ATACAATTAA CAATGTTCAT ATCAGTTCAG AGCAAGTACT AGTCACTCCA 
GAGGAGTTAA AGCGTGAGCT TCCTCTTTCC ACCCATGCTT ACCAGTACGT TCTCAATGCT
CGCAAGACAG TATCTGACAT TGTGCATAAA CGTGATAATC GCGTATTGGT GATTTCAGGT
CCTTGCTCTA TCCATGATAT TGATGCTGCC AAAGAGTACG CGCTAAAGCT TAAGAAGTTA
CATGATGAAC TTGGTGATCA GTTTTTTGTG TTGATGCGAG TTTACTTCGA AAAGCCGCGT
ACTACTGTTG GCTGGAAAGG AATGATTAAC GACCCTGATA TGGATGAATC CTTCGATGTG
GATAAAGGCT TGAGAAAAGC TCGCGAACTG ATGATCTGGC TTGCTGAACT TGAACTGCCT
GTTGCCACTG AGGCGCTCGA TCCTATTAGT CCGCAATACA TGTCGGAGTT GGTAACTTGG
TCAGCCATTG GCGCACGCAC CACTGAGTCA CAGACTCACC GTGAAATGGC GTCGGGTCTT
TCTATGCCGG TAGGCTTTAA AAATGGTACC GATGGCAAGC TTGGTGTGGC CATCAATGCA
TTAGAGTCGG CTGCGAGTAG TCACCGCTTC ATGGGTATTA ACCAGCAAGG TCAGGTCGCA
CTGCTACAGA CCGCAGGTAA CCCTGATGGA CATGTGATTT TACGTGGCGG CAAAACACCT
AACTATGATG CTGCGAGTGT GGCGGAGTGT GAACAGCAGT TACATGCAGC CAAACTCAGT
GCACGATTGA TTGTCGATTG CAGTCATGGT AACTCGTCAA AAGATCATAA TAAGCAAAAG
CCTGTGTGCG AAGATGTTTT TAACCAGATT GTGGCGGGCA ATAAGTCTAT TATCGGTGTG
ATGCTAGAAA GCCATTTAAA TGCGGGTAAG CAGAGCAGTG ACTTGCCGAT GGATGAGCTT
GCTTACGGCG TTTCGGTTAC AGATGCTTGT ATCGACTGGA AAACCACAGA AGCGTTATTA
CGAACGGGAG CCGATGAGTT AGCCTCGGTA TTACCGACAA GATTTGATAT GCTAAAAGTT
GCAAACGGCT GA
 
Protein sequence
MQQDTINNVH ISSEQVLVTP EELKRELPLS THAYQYVLNA RKTVSDIVHK RDNRVLVISG 
PCSIHDIDAA KEYALKLKKL HDELGDQFFV LMRVYFEKPR TTVGWKGMIN DPDMDESFDV
DKGLRKAREL MIWLAELELP VATEALDPIS PQYMSELVTW SAIGARTTES QTHREMASGL
SMPVGFKNGT DGKLGVAINA LESAASSHRF MGINQQGQVA LLQTAGNPDG HVILRGGKTP
NYDAASVAEC EQQLHAAKLS ARLIVDCSHG NSSKDHNKQK PVCEDVFNQI VAGNKSIIGV
MLESHLNAGK QSSDLPMDEL AYGVSVTDAC IDWKTTEALL RTGADELASV LPTRFDMLKV
ANG