Gene Shewmr4_3697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_3697 
SymbolaroB 
ID4254260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp4412544 
End bp4413623 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content47% 
IMG OID638120339 
Product3-dehydroquinate synthase 
Protein accessionYP_735817 
Protein GI113972024 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0194599 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.984962 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACAAC AAATTCAGGT TGATTTAGGT GAACGTAGTT ACCCCATTTA CATTGGCCAG 
AGTTTGATGA GTGATAGCGA GACCTTGTCT CGCTACCTGC TGAAAAAACG TATCCTTATC
GTCACCAATG AAACTGTCGC GCCTTTGTAT CTCAAACAAA TACAAGACAC GATGGCTTCG
TTTGGTGAGG TATCTAGCGT CATCCTTCCC GATGGCGAGC AATTTAAAGA TTTAACGCAT
TTAGATTCTA TTTTTACGGC TTTGCTGCAA CGCAATTATG CCCGTGATTC AGTGCTGGTG
GCTCTCGGTG GTGGAGTTAT TGGTGACATG ACGGGTTTTG CTGCAGCCTG TTACCAACGT
GGTGTCGATT TTATTCAAAT TCCGACCACG CTATTATCAC AAGTAGATTC CTCTGTTGGC
GGGAAAACCG CCGTTAATCA TCCGCTTGGC AAAAATATGA TCGGGGCTTT TTATCAGCCA
CAGATCGTCA TTATCGATAC TGAATGTTTA CAGACCTTGC CTGCGCGAGA ATTCGCTGCG
GGGATGGCAG AAGTCATTAA GTATGGCATC ATGTGGGATG CTGAGTTTTT TCAATGGCTT
GAGAACAATG TTCAGGCATT GAAAAGCCTA GATACTCAAG CTTTGGTCTA CGCGATCTCT
CGCTGCTGTG AGATTAAAGC CGATGTCGTG AGCCAGGATG AGACCGAGCA GGGCGTCCGC
GCATTATTAA ACCTTGGGCA TACCTTTGGA CATGCGATCG AAGCCGAGAT GGGCTATGGT
AATTGGCTGC ATGGTGAAGC GGTTGCGGCT GGCACAGTCC TTGCTGCACA AACGGCTAAG
TCCATGGGAT TGATTGATGA GTCAATTGTT CGTCGTATTG TGCAATTGTT CCACGCTTTC
GATCTGCCAA TAACAGCGCC GGAATCGATG GATTTCGACA GTTTTATTAA ACATATGCGT
CGCGATAAGA AAGTCTTAGG TGGTCAGATC CGGCTGGTAC TCCCGACGGC CATTGGTCGA
GCTGATGTCT TTAGCCAAGT TCCGGAATCT ACCCTAGAAC AGGTTATCTG CTGCGCATAA
 
Protein sequence
MTQQIQVDLG ERSYPIYIGQ SLMSDSETLS RYLLKKRILI VTNETVAPLY LKQIQDTMAS 
FGEVSSVILP DGEQFKDLTH LDSIFTALLQ RNYARDSVLV ALGGGVIGDM TGFAAACYQR
GVDFIQIPTT LLSQVDSSVG GKTAVNHPLG KNMIGAFYQP QIVIIDTECL QTLPAREFAA
GMAEVIKYGI MWDAEFFQWL ENNVQALKSL DTQALVYAIS RCCEIKADVV SQDETEQGVR
ALLNLGHTFG HAIEAEMGYG NWLHGEAVAA GTVLAAQTAK SMGLIDESIV RRIVQLFHAF
DLPITAPESM DFDSFIKHMR RDKKVLGGQI RLVLPTAIGR ADVFSQVPES TLEQVICCA