Gene Sbal195_4201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_4201 
SymbolaroB 
ID5756029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp4964949 
End bp4966025 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content46% 
IMG OID641290554 
Product3-dehydroquinate synthase 
Protein accessionYP_001556619 
Protein GI160877303 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000411195 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAAA TTCAGGTTGA TTTAGGTGTA CGTAGTTATC CCATTTACAT TGGCCAGAAT 
TTGATGAGTG ATGGCGAGAC CTTGTCTCGC TACCTGCTTA AAAAACGTAT TCTTATCGTC
ACCAATGAAA CTGTCGCGCC TTTGTATCTT AAACAGATAC AAGAGACGAT GGCTTCGTTT
GGTGAGGTAG AGAGTGTTAT CCTCCCCGAT GGTGAACAAT TCAAAGACTT AGCGCATCTA
GATACTATTT TTACTGCATT GCTGCAGCAA AACTATGGTC GAGATTCTGT GCTGGTGGCT
TTGGGTGGCG GCGTAATTGG TGATATGACG GGCTTTGCCG CGGCATGTTA TCAACGTGGG
ATCGATTTTA TTCAAATTCC GACAACCCTA TTGTCGCAGG TGGATTCTTC CGTCGGCGGT
AAAACGGCTG TTAACCATCC TCTTGGTAAA AACATGATTG GGGCCTTTTA TCAGCCACAA
ATCGTGCTTA TCGATACTTT ATGTTTACAT ACGCTTCCAG CGCGCGAGTT TGCGGCGGGA
ATGGCGGAAG TCATCAAGTA TGGCATCATG TGGGATGCTG ATTTTTTTCA ATGGCTTGAA
GATAATGTAA CGGCACTAAA AACCTTAGAT GCCCAAGCAT TGGTTTATGC TATCTCCCGT
TGCTGTGAGA TTAAGGCCGA TGTGGTTAGC CAAGACGAAA CTGAGCAGGG TGTACGTGCT
TTATTGAATC TAGGTCATAC CTTTGGTCAT GCGATTGAAG CCGAAATGGG CTACGGTAAT
TGGTTGCATG GTGAAGCCGT GTCAGCTGGC ACAGTCCTTG CTGCTCAAAC AGCTAAGGCA
CTGGGGCTTA TCGATGAGTC AATAGTTTGT CGTATCATAC AGTTACTACA AGCTTTTGAT
CTTCCAGTGA GTGCGCCGGA ATCTATGGAT TTCGACAGTT TCATTCAACA TATGCGACGC
GATAAAAAAG TTTTAGGCGG TCAGATTCGA CTGGTGCTCC CAACGGCAAT AGGCCGCGCG
GATGTGTTTA GTCAAGTCAC AGAATCTACC CTTGAACAGG TTATTCGCTG CGCATAA
 
Protein sequence
MKQIQVDLGV RSYPIYIGQN LMSDGETLSR YLLKKRILIV TNETVAPLYL KQIQETMASF 
GEVESVILPD GEQFKDLAHL DTIFTALLQQ NYGRDSVLVA LGGGVIGDMT GFAAACYQRG
IDFIQIPTTL LSQVDSSVGG KTAVNHPLGK NMIGAFYQPQ IVLIDTLCLH TLPAREFAAG
MAEVIKYGIM WDADFFQWLE DNVTALKTLD AQALVYAISR CCEIKADVVS QDETEQGVRA
LLNLGHTFGH AIEAEMGYGN WLHGEAVSAG TVLAAQTAKA LGLIDESIVC RIIQLLQAFD
LPVSAPESMD FDSFIQHMRR DKKVLGGQIR LVLPTAIGRA DVFSQVTEST LEQVIRCA