Gene Saro_1899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1899 
SymbolaroB 
ID3917120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2011490 
End bp2012599 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content66% 
IMG OID640444643 
Product3-dehydroquinate synthase 
Protein accessionYP_497173 
Protein GI87199916 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.867319 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGTAA TCCCCGTCGC CATCGCCGGA GCGCCCTATG AAGTGCGGAT CGAGGCCGGC 
GTTCTGGCCC GTGCGGGCGA ACATTGCCGC CCCTTTCTGC GCAAGGACCG CGTGGCTATC
GTCACCGACG AGCACGTCGC CGCAGAGTGG CGGGAAACCG TCACTGCCTC GTTCGACAGC
GTCGGCGTGC GCAGCGAATG GCTCGTTCTC CCCGCTGGCG AGAGCACCAA GAGCTGGGAA
CACCTCGCCC GCCTGGTCGA CTGGCTGCTG GAACAGGAAG TCGAGCGCAA GGACCGTATT
GTCGCGCTCG GCGGAGGCGT GATCGGCGAT CTCACCGGGT TTGCCGCCTC GATCGTCAAG
CGTGGCTGCG GTTTCATCCA GATCCCGACG ACCCTCCTGG CGCAGGTCGA TTCCAGCGTC
GGCGGAAAGA CCGCGATCAA CACGCCCGCT GGCAAGAACC TCGTCGGCGC GTTTCACCAG
CCTGCGCTGG TCCTCGCCGA TCCGCTCGCG CTCGACACGC TGCCGCTGCG CGATGTGCGG
GCCGGGTACG CCGAAGTGGT GAAATATGGC CTGATCGACG ATGCGCCCTT CTTCGAGTGG
TGTGAGGCGA ACGGCGCAAA GCTCCTCGCA GGCGACCTGG CAGCGCGCGA GACGGCCATC
GCACACAGCG TCGCGGCAAA GGCGCGGATC GTGGCGGCGG ACGAGAAGGA AATCGCCGGC
ATCCGTGCGC TACTCAATCT CGGGCACACT TTCGGGCACG CGCTCGAGGC CGAGACCGGC
TTTACCGATC GCCTGCACCA TGGCGAAGGC GTGGCGCTGG GGATGGTGCT GGCGGCACGA
TTCTCGGCGC GGCAGGGGCT GATGTCGAGG CAGGATGCCG AACGCGTGGC TCGCCATGTC
GAAGCGGTGG GCCTGCCTGC CACGCTCCGC GAGCTTGGGC TTTCCTGCGA CGGCCGCCGC
CTTGCCGATC ACATGCTTCA CGACAAGAAG ATGGACGCGG GCACATTGCC CTTCCTTCTC
ATGCGCGGGA TCGGGCAGAC CTTCCTGGCA AAGGACGTCG ATCTGACGGA AGTGGCCGCT
TTCCTCGACG AGGAACTCGC CAGAACCTGA
 
Protein sequence
MAVIPVAIAG APYEVRIEAG VLARAGEHCR PFLRKDRVAI VTDEHVAAEW RETVTASFDS 
VGVRSEWLVL PAGESTKSWE HLARLVDWLL EQEVERKDRI VALGGGVIGD LTGFAASIVK
RGCGFIQIPT TLLAQVDSSV GGKTAINTPA GKNLVGAFHQ PALVLADPLA LDTLPLRDVR
AGYAEVVKYG LIDDAPFFEW CEANGAKLLA GDLAARETAI AHSVAAKARI VAADEKEIAG
IRALLNLGHT FGHALEAETG FTDRLHHGEG VALGMVLAAR FSARQGLMSR QDAERVARHV
EAVGLPATLR ELGLSCDGRR LADHMLHDKK MDAGTLPFLL MRGIGQTFLA KDVDLTEVAA
FLDEELART