Gene Shewmr4_3358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_3358 
Symbol 
ID4253924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp4005597 
End bp4006655 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content49% 
IMG OID638119996 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_735481 
Protein GI113971688 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0607485 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATTACC AAAATGATGA CGTTCGCATT AATGAAGTCA AAGAGTTACT TCCACCAATT 
GCGATTCTAG AACGATTTCC TGCCTCCGAA AACGCTGCTG CAACTGTGTT TAACGCCCGT
CAAAGCATTC ATAACATTCT TGCCCGCCAA GATGATCGTT TGTTAGTGGT CATTGGTCCT
TGCTCAATTC ACGATCCTAA GGCCGCATTG GAATATGGCC AGCGATTGGT TGCGCTCAGA
GAACGTTATC AAGATCAGCT CGAAGTGGTG ATGCGTGTTT ATTTTGAAAA GCCGCGTACT
ACAGTGGGTT GGAAGGGACT CATCAACGAT CCTTATATGG ATAACAGCTT CAAACTGAAC
GATGGTTTGC GTACCGCGCG TAAACTTTTA GTCGATTTGA ATGACTCTGG CATGCCGACC
GCGGGTGAGT TTCTCGATAT GATCACTCCG CAATATGTGG CGGATATGAT GTGCTGGGGC
GCGATTGGTG CGCGCACCAC TGAATCGCAG GTGCACCGTG AATTAGCGTC TGGGTTGTCT
TGCCCTGTAG GCTTTAAAAA TGGCACCGAC GGCACGATCA AGGTGGCGAT TGATGCCATT
GGCGCCGCGA ATGCGCCGCA CCACTTCTTA TCGGTAACCA AATTTGGCCA TTCGGCGATT
GTGTCGACCA AAGGTAATCC TGATTGCCAT ATCATTCTGC GTGGTGGCCG TGAGCCTAAT
TACAGTGCCA GCCATGTTGC GCAGATCAGC GAACAACTGA AAAAGGCCAA GCTTGTCGAT
AACATTATGA TCGACTTTAG CCATGCCAAT AGCAGCAAAC AATATCAACG CCAGATGGAT
GTCGCGAACG ATGTGGCCGA GCAAGTGGCC GCGGGTAATA AGGCAATTTT CGGTGTGATG
GTTGAAAGCC ACTTAGTGGA AGGTCGCCAA GATCTGATTG AAGGCCAGGC GTTATGTTAT
GGCCAGAGTA TTACGGATGC CTGTATTGGC TGGGATGATA CCGAGCGTCT GTTAGATGTG
TTAAATCAGA GCGTTATTGC ACGCCGCCAG CGGGTCTAA
 
Protein sequence
MYYQNDDVRI NEVKELLPPI AILERFPASE NAAATVFNAR QSIHNILARQ DDRLLVVIGP 
CSIHDPKAAL EYGQRLVALR ERYQDQLEVV MRVYFEKPRT TVGWKGLIND PYMDNSFKLN
DGLRTARKLL VDLNDSGMPT AGEFLDMITP QYVADMMCWG AIGARTTESQ VHRELASGLS
CPVGFKNGTD GTIKVAIDAI GAANAPHHFL SVTKFGHSAI VSTKGNPDCH IILRGGREPN
YSASHVAQIS EQLKKAKLVD NIMIDFSHAN SSKQYQRQMD VANDVAEQVA AGNKAIFGVM
VESHLVEGRQ DLIEGQALCY GQSITDACIG WDDTERLLDV LNQSVIARRQ RV