Gene Strop_1847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_1847 
SymbolaroB 
ID5058306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp2114113 
End bp2115186 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content73% 
IMG OID640474117 
Product3-dehydroquinate synthase 
Protein accessionYP_001158687 
Protein GI145594390 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAAGA CAACCCGAAT CGCGGTTGGT GGTGACCGAC CGTACGACGT GTTGGTGGGG 
CGGGACCTGC TCGACCCGCC ACAGTTGCTG CCGGGCGCGC AGCGGCTGGC CGTGCTGTAC
GCGCCGCCGA TGCGGGGCCG GGCCGAGCAG CTGGCGGAGC GGGCCCGGAT GGCCGGGGTG
ACGCCACTGC TGGTCGAGGT GCCGGACGCG GAGGCGGGCA AGCACATCGA GGTCGCCGCC
AGCTGCTGGG AACGGCTCGG TGCGGCGGGC TTCACCCGCG CCGACGCCGT CGTCGGTGTG
GGTGGCGGCG CGGTTACCGA CCTGGCTGGC TTCGTCGCGG CCTGCTGGCT GCGCGGGGTG
CGTTGGGTGC CGGTGGCGAC GTCGTTGCTG GGCATGGTTG ACGCGGCGGT GGGCGGCAAG
ACCGGGATCA ATACCGCCGC CGGCAAGAAC CTGGTCGGTG CCTTCCACCC GCCGGCCGGG
GTGATCTGCG ACCTGGCTGC TCTGGACAGC CTCTCCCCGG CCGACCTGGC CGCGGGAATG
GCCGAGGTGA TCAAGTGTGG CTTCATCGCC GACCCGGTGA TCCTCGAGCT GGTCGAGCGG
GACCCCGCCG TCGCCGTTGA CCCGGCGGGT CCGGTGCTCC GGGAGCTGAT CGAGCGGGCG
ATCCGGGTCA AGGCGCAGGT CGTCTCCGGT GACCTTCGCG AGTCGGGGGC CCGGGAGATC
CTCAACTACG GGCACACCCT GGCGCATGCC ATCGAGAAGG TGGAGGGCTA CCGCTGGCGG
CACGGCCACG CGGTCGCGGT GGGGCTGGTC TACGCGGCGA CCCTGGCCCT GCTCGAAGGC
CGGCTGGACG CGCAGACCGC GCAGCGGCAC CGGGCGGTGG TGGGCGCGCT CGGCCTGCCC
ACCGGATACC GGGCGGAAGC CTGGCCGGAC CTGCTCGCCA CGATGCGGGT GGACAAGAAG
GCGCGGGGCA GCGTCCTGCG CTTCGTGGTG TTGGCCGGTC TCGCCCACCC CACGATCCTC
GAGGCGCCCT CCGACGAACT GCTGCACGCC GCCTACCGGG AGATCGCCGA ATGA
 
Protein sequence
MDKTTRIAVG GDRPYDVLVG RDLLDPPQLL PGAQRLAVLY APPMRGRAEQ LAERARMAGV 
TPLLVEVPDA EAGKHIEVAA SCWERLGAAG FTRADAVVGV GGGAVTDLAG FVAACWLRGV
RWVPVATSLL GMVDAAVGGK TGINTAAGKN LVGAFHPPAG VICDLAALDS LSPADLAAGM
AEVIKCGFIA DPVILELVER DPAVAVDPAG PVLRELIERA IRVKAQVVSG DLRESGAREI
LNYGHTLAHA IEKVEGYRWR HGHAVAVGLV YAATLALLEG RLDAQTAQRH RAVVGALGLP
TGYRAEAWPD LLATMRVDKK ARGSVLRFVV LAGLAHPTIL EAPSDELLHA AYREIAE