Gene Sare_1840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1840 
SymbolaroB 
ID5704703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2124955 
End bp2126028 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content72% 
IMG OID641271341 
Product3-dehydroquinate synthase 
Protein accessionYP_001536716 
Protein GI159037463 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.766033 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAGG TTACCCGGAT CGCGGTTGGT GGCGACCGAC CGTACGACGT GCTGGTGGGG 
CGTGACCTGT TCGACCCGCC GCAGTTGCTG CCCGGCGCGC AGCGGCTGGC CATCCTGTTC
GCGCCGCCGT TGCAGGGCCG GGCCGAACAG GTGGCGGAGC GGACCCGGAT GGCCGGGGTG
GCGCCACTGC TGATCGAGGT GCCCGACGCC GAGGCGGGCA AGCACATCGA TGTGGCCGCC
GCCTGCTGGG AGCGGCTCGG CGCCGCGGGT TTCACCCGCA CCGACGCCGT CGTCGGTGTG
GGTGGCGGCG CGGTCACCGA CCTGGCCGGG TTCGTTGCGG CCTGCTGGCT GCGTGGGGTG
CGCTGGGTGC CGGTGGCGAC GTCGCTGCTG GGCATGGTCG ACGCGGCCGT GGGCGGCAAG
ACCGGGGTCA ACACTGCCGC CGGCAAGAAC CTGGTCGGCG CCTTTCACCC GCCGGCCGGG
GTGATCTGTG ATCTGGCCAC GTTGGACACC TTGCCCCCGG CTGACCTGGC CGCCGGGATG
GCCGAGGTGG TCAAGTGCGG CTTCATCGCC GACCCGGTGA TCCTTGAGCT GGTCGAGCGG
GAGCCCGCCG CCGCCGTGGA CCCGGCAGGT CCGGTGCTCC GGGAGCTCGT CGAGCGGGCG
ATCCAGGTCA AGGCGCACGT CGTCGCCGGT GATTTTCGTG AGTCGGGGGC CCGGGAGGTG
CTGAACTACG GGCACACCCT GGCGCACGCG ATCGAGAAGG TGGAGGGCTA CCGCTGGCGG
CACGGTCACG CGGTGGCGGT GGGCCTGGTC TACGCGGCGA CCCTGGCCCG GCTCGCCGGT
CGGCTGGACG CGCAGACCGA GCAGCGGCAC CGGGCTGTGG TGGGCGCCCT TGGTCTGCCC
ACCAGCTACC GGTCGGACGC CTGGCCGGAA GTGCTCGCCA CGATGCGGGT GGACAAGAAG
GCGCGGGGCA ACGTCCTGCG TTTCGTGGTG CTGACCGGTC TCGCTCACCC GACGATCCTG
GAGGCGCCCT CCGACGAGCT GCTGCACGCG GCCTACCGGG AGATTGCCCC ATGA
 
Protein sequence
MDEVTRIAVG GDRPYDVLVG RDLFDPPQLL PGAQRLAILF APPLQGRAEQ VAERTRMAGV 
APLLIEVPDA EAGKHIDVAA ACWERLGAAG FTRTDAVVGV GGGAVTDLAG FVAACWLRGV
RWVPVATSLL GMVDAAVGGK TGVNTAAGKN LVGAFHPPAG VICDLATLDT LPPADLAAGM
AEVVKCGFIA DPVILELVER EPAAAVDPAG PVLRELVERA IQVKAHVVAG DFRESGAREV
LNYGHTLAHA IEKVEGYRWR HGHAVAVGLV YAATLARLAG RLDAQTEQRH RAVVGALGLP
TSYRSDAWPE VLATMRVDKK ARGNVLRFVV LTGLAHPTIL EAPSDELLHA AYREIAP