Gene Sama_3367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_3367 
Symbol 
ID4605614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp3993164 
End bp3994243 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content50% 
IMG OID639782787 
Product3-dehydroquinate synthase 
Protein accessionYP_929239 
Protein GI119776499 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0475831 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAAA TTCAGGTTGA TTTGGCAAAT CGAAGTTATC CGATCCACAT TGGCCCGAAT 
TTGTTTGAAG ATCAGGAGCT CTTTGCGCCT GTGGTGAGTG GCAAAAAAGT ACTCGTTGTC
AGTAACGAGA CGATTGCTCC CCTCTATCTC GATAAGATCA GTGCCACCCT GTCCGCTCGC
GCGGCTCAGG TGGCTTCTGT GATCCTGCCT GACGGCGAGC AATACAAAAC ACTCGACTAT
CTCAATGAGA TTTTTGACGC TCTGCTTGAA GGAAATTTTG CCCGCGATTG TGTTCTGGTT
GCACTCGGTG GTGGTGTGAT TGGCGATATG ACCGGGTTTG CTGCGGCCTG TTATCAACGC
GGTGTTGATT TTATTCAAAT TCCTACCACG CTGTTATCCC AGGTCGACTC CTCTGTGGGT
GGCAAAACTG CTGTGAACCA TCCTTTGGGT AAAAACATGA TAGGAGCCTT CTATCAACCC
AAATTGGTGG TCATTGATAT TAATTGCCTG AAAACTCTGC CAGCCAGAGA GTTTGCAGCC
GGTATGGCAG AGGTGATTAA GTACGGCATT ATTCGTGACA GTGAACTCTT TACCTGGTTG
GAACAAAACG TTTCAGCGCT TAAAGCACTG GACCAGGATG CCATTATCCA CGTTATTGCC
CGCTGCTGCG AAATCAAAGC CGAGGTAGTA TCAGAAGATG AAACAGAGCA GGGCGTTCGC
GCTTTGCTCA ATCTGGGACA CACCTTTGGT CATGCCATTG AAGCCGAAAT GGGCTATGGC
AATTGGCTGC ATGGCGAAGC TGTTGCCGCT GGCATGGTAC TTGCTGCACA AACTTCGGTA
ACACTGGGGC TAATCGATAA GTCAATTCTT TGTCGTATTG CGGCACTTAT TCAGGCATTC
GATTTGCCCG TCCAGGCTCC GGAGTCCATG GACTTTAATA GCTTTATTAA GCATATGCGG
CGAGATAAAA AGGTATTGGG TGGCCAGCTC AGGTTGGTTT TGCCGTTGGG CATTGGCGCC
GCTGAAGTCA GCAGCCAGGC CTCTGATGCC GAGCTGGCAG AGGTCATTCG CCGCCCCTGA
 
Protein sequence
MKQIQVDLAN RSYPIHIGPN LFEDQELFAP VVSGKKVLVV SNETIAPLYL DKISATLSAR 
AAQVASVILP DGEQYKTLDY LNEIFDALLE GNFARDCVLV ALGGGVIGDM TGFAAACYQR
GVDFIQIPTT LLSQVDSSVG GKTAVNHPLG KNMIGAFYQP KLVVIDINCL KTLPAREFAA
GMAEVIKYGI IRDSELFTWL EQNVSALKAL DQDAIIHVIA RCCEIKAEVV SEDETEQGVR
ALLNLGHTFG HAIEAEMGYG NWLHGEAVAA GMVLAAQTSV TLGLIDKSIL CRIAALIQAF
DLPVQAPESM DFNSFIKHMR RDKKVLGGQL RLVLPLGIGA AEVSSQASDA ELAEVIRRP