Gene RoseRS_4545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4545 
Symbol 
ID5211530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5700132 
End bp5701190 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content66% 
IMG OID640598123 
Product3-dehydroquinate synthase 
Protein accessionYP_001278826 
Protein GI148658621 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCATCG CGCACGGGGC GCTCGACTGT CTGCCAGATC ACCTGAACCG CATCGGTCTG 
CGCGGCGCGC TGTGGATCAT CAGCGACGAC CAGGTCTTCC CGCAGTATGC GCCTCCGTTG
ATCGAACGGT TGCGTGCAGC CGGGTACGAC GTCCAGGGAT CCACCGTACC TTCCGGCGAG
ACGAGCAAAG ACCTGGCGAT GGTTGCGCGC CTCTACGACT GGTTGATCGG CGGGGGGGTC
GAACGGCGCG ATGCAGTGCT GGCGCTGGGC GGCGGAGTCG TCGGCGACCT GGCGGGATTC
GTTGCGGCGA CGGTTCTGCG CGGCATTGCG CTTGTTCACC TGCCAACCAC CTTGCTGGCG
ATGGTCGACT CGGCAATCGG CGGCAAAACC GGGGTGAACC ATCCGCTCGG CAAGAACCTG
ATTGGCGCGT TTCACCAACC GCGCCTGACG CTCGCCGATA CCGCAACCCT GCGCACGCTT
CCGCCACGTG AACTGCGCGC CGGTTGGGCG GAGGTCATCA AACACGCTGT CATCCGCGAC
GCTGATCTGT TCGTGCGTCT GGAAACGATC GCAGGCGCCG AGCCGGACTC GCTTCAGGGT
GATGCGCTGG CAGCGATCAT CCGCCAGGCC GCCAGGGTCA AGATCGACAT TGTGAACGCC
GATGAGCGCG AAACCGGTGA ACGGATGCTG CTCAACTACG GGCACACGCT GGGGCATGCC
ATCGAAGCAG CGAGCGGCTA CGGCGACCTG CTCCACGGCG AGGCGGTTGC CATCGGGATG
CACCTGGAAG CGCAGATCGC CTGCCGCATG GGGATGGTCG AACCAGACTT CGTGGAACGC
CAGGAGCGCC TGTTGCACGC CTGGGGTCTG CCCACCGCCC TGCCGCCCGC GCTCGATATT
GATGACCTGC TCGAACGCAC CCTGCGCGAC AAGAAGGTGC GGGCAGGAAA GGTGCGCTGG
GCGCTGCCGC TGGGGATCGG GTCGGCGACC GTGCGCGACG ATGCACCCGA AACACTGGTG
CGCGCCGTGC TGGAAGAGGC GTATGCGCGA TCACCATAA
 
Protein sequence
MIIAHGALDC LPDHLNRIGL RGALWIISDD QVFPQYAPPL IERLRAAGYD VQGSTVPSGE 
TSKDLAMVAR LYDWLIGGGV ERRDAVLALG GGVVGDLAGF VAATVLRGIA LVHLPTTLLA
MVDSAIGGKT GVNHPLGKNL IGAFHQPRLT LADTATLRTL PPRELRAGWA EVIKHAVIRD
ADLFVRLETI AGAEPDSLQG DALAAIIRQA ARVKIDIVNA DERETGERML LNYGHTLGHA
IEAASGYGDL LHGEAVAIGM HLEAQIACRM GMVEPDFVER QERLLHAWGL PTALPPALDI
DDLLERTLRD KKVRAGKVRW ALPLGIGSAT VRDDAPETLV RAVLEEAYAR SP