Gene Shew185_3704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShew185_3704 
Symbol 
ID5372408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS185 
KingdomBacteria 
Replicon accessionNC_009665 
Strand
Start bp4399393 
End bp4400448 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content46% 
IMG OID640831962 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001367892 
Protein GI153002211 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000561065 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATTACC AAAATGATGA CGTTCGCATT AAAGAAGTAA AAGAGTTACT TCCTCCTATC 
GCGATTCTAG AACGATTTCC TGCTTCTGAA AAAGCCTCTG CGACTGTGTT TAATGCGCGA
AATAGTATCC ACAATATTCT GGCTAAAACT GATGATCGCC TGTTAGTGGT GATTGGGCCT
TGTTCTATCC ACGATCCCAA AGCGGCGTTG GAATATGGTC AACGTCTGGT TGCGCTGCGT
GAGCGTTATA AGGATCAACT CGAAATCGTG ATGCGAGTGT ATTTTGAAAA GCCCAGAACC
ACAGTGGGTT GGAAGGGGCT TATCAACGAT CCTTACATGG ATAACAGCTT TAAACTCAAC
GATGGTTTAC GCACTGCGCG TAAGTTATTG GTGGATTTGA ACGATAGCGG CATGCCAACC
GCGGGTGAGT TTCTTGATAT GATCACCCCA CAATATATGG CAGATTTAAT GTGCTGGGGA
GCCATCGGCG CCCGTACTAC TGAATCACAA GTGCACAGAG AGTTAGCCTC GGGTCTTTCT
TGTCCGGTCG GTTTTAAAAA TGGGACCGAT GGCACCATTA AAGTTGCTAT TGATGCGATA
GGTGCTGCCA ATGCACCGCA CCATTTTTTA TCTGTGACTA AGTTGGGTCA TTCGGCGATC
GTTTCGACGA AAGGGAATCC TGATTGCCAC ATTATTTTAC GTGGCGGCCG CGAGCCTAAT
TATAGTGCGC CGCATGTCGC TGAAATTAGC CAGCAGTTAT TAAAAGCTAA ACTTGCCGAC
AACATCATGA TCGACTTTAG CCACGCCAAT AGTAGTAAAG AGTATCAACG TCAGTTAGTG
GTTGCCGAAG ATGTGGCTGG CCAAGTAGCG ACGGGCAATA CTGCTATTTT TGGTGTTATG
GTAGAAAGCC ATTTAGTGGA AGGTCGTCAG GATTTAATTG AAGGTCAAGA GTTGTGTTAT
GGCCAGAGTA TTACCGATGC GTGTATCGGT TGGGATGATA CCGAGAGCCT GTTGGCCATT
CTGAATCAGA GTATTATCGA ACGCCGTCAG GTTTAA
 
Protein sequence
MYYQNDDVRI KEVKELLPPI AILERFPASE KASATVFNAR NSIHNILAKT DDRLLVVIGP 
CSIHDPKAAL EYGQRLVALR ERYKDQLEIV MRVYFEKPRT TVGWKGLIND PYMDNSFKLN
DGLRTARKLL VDLNDSGMPT AGEFLDMITP QYMADLMCWG AIGARTTESQ VHRELASGLS
CPVGFKNGTD GTIKVAIDAI GAANAPHHFL SVTKLGHSAI VSTKGNPDCH IILRGGREPN
YSAPHVAEIS QQLLKAKLAD NIMIDFSHAN SSKEYQRQLV VAEDVAGQVA TGNTAIFGVM
VESHLVEGRQ DLIEGQELCY GQSITDACIG WDDTESLLAI LNQSIIERRQ V