Gene Sbal195_3828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_3828 
Symbol 
ID5755643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp4508392 
End bp4509447 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content46% 
IMG OID641290170 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001556248 
Protein GI160876932 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000267909 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.262784 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATTACC AAAATGATGA CGTTCGCATT AAAGAAGTAA AAGAGTTACT TCCTCCTATC 
GCGATTCTAG AACGATTTCC TGCTTCCGAA AAAGCCTCTG CGACTGTGTT TAATGCGCGA
AATAGTATCC ACAATATTCT GGCTAAGTCT GATGATCGCC TGTTAGTGGT AATTGGACCT
TGTTCTATCC ACGATCCCAA AGCGGCGTTG GAATATGGTC AGCGTCTGGT TGCGCTGCGT
GAGCGTTATA AGGGTCAACT CGAAATCGTG ATGCGAGTGT ATTTTGAAAA GCCAAGAACC
ACAGTGGGTT GGAAGGGGCT TATCAACGAT CCTTACATGG ATAACAGCTT TAAACTCAAC
GATGGTTTAC GCACTGCGCG TAAGTTATTG GTGGATTTGA ACGACAGCGG CATGCCAACC
GCGGGTGAGT TTCTTGATAT GATCACCCCA CAATATATGG CAGATTTAAT GTGCTGGGGA
GCCATTGGTG CCCGTACTAC TGAATCACAA GTGCACAGAG AGTTAGCCTC GGGTCTTTCT
TGTCCGGTCG GTTTTAAAAA TGGGACCGAT GGCACCATTA AAGTCGCTAT CGATGCGATA
GGTGCTGCGA ATGCACCGCA CCATTTTTTA TCTGTGACTA AGTTGGGTCA TTCGGCGATC
GTTTCGACGA AAGGGAATCC TGATTGCCAC ATTATTTTAC GTGGCGGCCG CGAGCCTAAT
TACAGTGCGC CGCATGTCGC TGAAATTAGC CAACAGTTAT TAAAAGCTGA ACTTGCCGAC
AACATCATGA TCGACTTTAG CCACGCCAAT AGTAGTAAAC AGTATCAACG ACAGTTAGTG
GTTGCCGAAG ATGTGGCTGG CCAAGTGGCG ACGGGCAATA CTGCTATTTT TGGTGTTATG
GTAGAAAGCC ATTTAGTGGA AGGTCGTCAG GATTTAATTG AAGGTCAAGA GTTGTGTTAT
GGCCAGAGTA TTACCGATGC GTGTATTGGT TGGGATGATA CCGAGCGCCT GTTGGCCATT
CTGAATCAGG GTATTATCGA ACGCCGTCAG GTTTAA
 
Protein sequence
MYYQNDDVRI KEVKELLPPI AILERFPASE KASATVFNAR NSIHNILAKS DDRLLVVIGP 
CSIHDPKAAL EYGQRLVALR ERYKGQLEIV MRVYFEKPRT TVGWKGLIND PYMDNSFKLN
DGLRTARKLL VDLNDSGMPT AGEFLDMITP QYMADLMCWG AIGARTTESQ VHRELASGLS
CPVGFKNGTD GTIKVAIDAI GAANAPHHFL SVTKLGHSAI VSTKGNPDCH IILRGGREPN
YSAPHVAEIS QQLLKAELAD NIMIDFSHAN SSKQYQRQLV VAEDVAGQVA TGNTAIFGVM
VESHLVEGRQ DLIEGQELCY GQSITDACIG WDDTERLLAI LNQGIIERRQ V