Gene Sbal223_1861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_1861 
Symbol 
ID7086535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp2199457 
End bp2200521 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content48% 
IMG OID643460765 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_002357789 
Protein GI217973038 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0481085 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000026222 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCATAA AAACAGACGA ACTACGCACT TCCTTATTAG CAAAAGTCAT CTCACCTGCG 
CAACTGGCAT CTGAGTTCCC ATTAACTCAA GATGCGGCCG ATTATCTGGT TCAGCAACGT
CGTGAAGTCG AAGCCATTAT TATGGGTGAA GATCAACGTC TTTTGGTGAT CGTAGGCCCA
TGCTCAATAC ACGATACCAA AGCGGCATTA GATTATGCGC AGCGCTTAGC CGCGTTACAT
CAAGAATTGA AAGATGATTT GTGTATCTTA ATGCGGGTCT ATTTTGAGAA ACCTCGCACG
ATTGTCGGCT GGAAAGGATT AATTTCTGAT CCCGATCTTG ATGGCAGTTT CGAGCCAAAC
AAAGGCTTAC GTATTGCTCG CGAGTTGCTG CAACAAATCA CTGAATTAAA ATTACCTATC
GCCACCGAAT TCTTAGATAT GGTGAACGGT CAGTACATCG CCGATCTCAT TACTTGGGGC
GCAATTGGCG CACGCACCAC AGAAAGCCAA GTTCACCGTG AAATGGCCTC GGCACTGTCT
TGTCCTGTTG GTTTCAAAAA CGGTACCGAT GGCAATATCA ACATCGCCGT CGATGCAGTG
CGCGCCGCGC AAGTTCCGCA TATTTTCTAT TCACCGGATA AAGATGGCGC GATGGCGGTT
TATCGTACCC ACGGTAATCC GTTTGGTCAT ATCATTCTGC GCGGCGGCAA AACGCCTAAT
TACCAGGCCG AAGATATCGA AAAGGCACGG CAACAACTTG CCTCTGTTGG GGTCACTCAA
CGTATGGTGG TTGATTTCAG CCACGGTAAC AGTGAGAAGA ACCACCTTAA GCAGCTTAAC
GTGGCTGATG AAATCATGGC GCAAATGCGG GCAGGCAGCA CTGCCATCGC CGGTATTATG
GCTGAAAGCT TCTTACAGGA AGGCAACCAG AAAGTCGTTG CAGACCAGCC ACTCTGTTAT
GGCCAAAGCA TCACAGATGC CTGCTTACAT TGGGATGACT CAGAAGTCCT GCTTCGTAAA
CTAGCAGCTG CTTCCCGCGA GCGAAAAGCA TTGTTAGCAA AATAA
 
Protein sequence
MTIKTDELRT SLLAKVISPA QLASEFPLTQ DAADYLVQQR REVEAIIMGE DQRLLVIVGP 
CSIHDTKAAL DYAQRLAALH QELKDDLCIL MRVYFEKPRT IVGWKGLISD PDLDGSFEPN
KGLRIARELL QQITELKLPI ATEFLDMVNG QYIADLITWG AIGARTTESQ VHREMASALS
CPVGFKNGTD GNINIAVDAV RAAQVPHIFY SPDKDGAMAV YRTHGNPFGH IILRGGKTPN
YQAEDIEKAR QQLASVGVTQ RMVVDFSHGN SEKNHLKQLN VADEIMAQMR AGSTAIAGIM
AESFLQEGNQ KVVADQPLCY GQSITDACLH WDDSEVLLRK LAAASRERKA LLAK