Gene Sbal223_4002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_4002 
SymbolaroB 
ID7086218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp4762732 
End bp4763808 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content46% 
IMG OID643462877 
Product3-dehydroquinate synthase 
Protein accessionYP_002359898 
Protein GI217975147 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000746069 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAAA TTCAGGTTGA TTTAGGTGTA CGTAGTTATC CCATTTACAT TGGCCAGAAT 
TTGATGAGTG ATGGCGAGAC CCTGTCTCGC TACCTGCTTA AAAAACGTAT TCTTATCGTC
ACCAATGAAA CTGTCGCGCC TTTGTATCTT AAACAGATAC AAGAGACGAT GGCTTCGTTT
GGTGAGGTAG AGAGTGTTAT CCTCCCCGAT GGCGAACAAT TCAAAGACTT AGCACATCTA
GATACTATTT TTACTGCATT GCTGCAGCAA AACTATGGTC GAGATTCTGT GCTGGTGGCT
TTGGGTGGCG GCGTAATTGG TGATATGACG GGCTTTGCCG CGGCATGTTA TCAACGTGGG
ATCGATTTTA TTCAAATTCC GACAACCCTA TTGTCGCAGG TGGATTCTTC CGTCGGCGGT
AAAACGGCTG TTAACCATCC TCTTGGTAAA AACATGATTG GGGCCTTTTA TCAGCCACAA
ATCGTGCTTA TCGATACTTT ATGTTTACAT ACGCTTCCAG CGCGCGAGTT TGCGGCGGGA
ATGGCGGAAG TCATCAAGTA TGGCATCATG TGGGATGCTG ATTTTTTTCA ATGGCTTGAA
GATAATGTAA CGGCACTAAA AACCTTAGAT GCCCAAGCAT TGATTTATGC TATCTCCCGT
TGCTGTGAGA TTAAGGCCGA TGTAGTTAGC CAAGACGAAA CTGAGCAGGG TGTACGTGCT
TTATTGAATC TGGGTCATAC CTTTGGTCAT GCGATTGAAG CCGAAATGGG CTACGGTAAT
TGGTTGCATG GTGAAGCCGT GTCAGCTGGC ACAGTCCTTG CTGCTCAAAC AGCTAAGGCA
CTGGGGCTTA TCGATGAGTC AATAGTTTGT CGTATCATAG AGTTACTACA AGCTTTTGAT
CTTCCAGTGA GTGCGCCGGA ATCTATGGAT TTCGACAGTT TCATTCAACA TATGCGACGC
GATAAAAAAG TTTTAGGCGG TCAGATTCGA CTGGTGCTCC CAACGGCTAT AGGCCGCGCG
GATGTGTTTA GTCAAGTCAC TGAATCTACC CTCGAACAGG TTATTCGCTG CGCATAA
 
Protein sequence
MKQIQVDLGV RSYPIYIGQN LMSDGETLSR YLLKKRILIV TNETVAPLYL KQIQETMASF 
GEVESVILPD GEQFKDLAHL DTIFTALLQQ NYGRDSVLVA LGGGVIGDMT GFAAACYQRG
IDFIQIPTTL LSQVDSSVGG KTAVNHPLGK NMIGAFYQPQ IVLIDTLCLH TLPAREFAAG
MAEVIKYGIM WDADFFQWLE DNVTALKTLD AQALIYAISR CCEIKADVVS QDETEQGVRA
LLNLGHTFGH AIEAEMGYGN WLHGEAVSAG TVLAAQTAKA LGLIDESIVC RIIELLQAFD
LPVSAPESMD FDSFIQHMRR DKKVLGGQIR LVLPTAIGRA DVFSQVTEST LEQVIRCA