Gene Sala_1800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1800 
SymbolaroB 
ID4082191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1896382 
End bp1897488 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content67% 
IMG OID638010175 
Product3-dehydroquinate synthase 
Protein accessionYP_616845 
Protein GI103487284 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAGC TGACCGTCGA ACTGGGCGCC CGCAGCTATC CGATCCTGAT CGGCGACGGG 
CTGATCCGCG ACATTGGCGC GCATGTCGCG CCGCTGTTGA AGCGACCGCG GACCATGATC
GTCACCGACA GCCATGTCGC CGACCATTAT CTTGCACCCA TCGGCACCGC GCTGGCGATG
GAGAATATCG CCTGCTCCTC CTTCGTGCTC GACCCCGGCG AGGCGACAAA GAGCTGGTCC
GGCCTCGCGC GCCTCACCGA ATGGCTGATC GGCGAAGGCA TCGAACGCAG CGACCATGTC
ATCGCGCTGG GGGGCGGTGT GATCGGCGAT CTCGTCGGTT TTGCGTGCAG CATCGTCAAG
CGCGGCTGCG CCTTCATTCA GGTGCCGACG ACTTTGCTCG CACAGGTCGA CAGCAGCGTC
GGCGGCAAGA CCGCGATCAA TGTCCCTGCG GGCAAGAATC TGATCGGCGC CTTTCACCAG
CCGGCGATGG TGGTGATCGA CCCGACCACG CTCGAAACGC TGCCCCGCCG CGAACTCGGT
GCGGGCTATG CGGAAGTCGT CAAATATGGG CTGATCGACG ACGCGGACTT CTTCGCCTGG
TGCGAGGCGC ATGGCGCCGC GCTGCTGGCA GGCGACAGTG CGGCGCGGGC CCATGCGATC
GCGCACAGCG TTGCCGCCAA GGCGCGCATC GTCGCCGCCG ACGAGCGCGA GACGCAGGAT
ATTCGGGCGT TGCTCAACCT CGGCCACAGC TTCGGCCACG CGCTCGAGGC CGAAACCGGC
TATTCGGACC GGCTGCTCCA CGGCGAGGCG GTGGCGGCGG GCATGGTGCT CGCACACCAG
TTTTCGGCCG CGAACGGACT TTGCCCCGCC GCCGACGCGG CGCGTGTCCG CGACCATCTC
GCCAGCGTTG GCCTGCCGCA CAGCCTGGCA AGCGCGGGGA TCAATGGCGG CGGCGCTCAG
CTCGCCGCGC ATATGGCGCA CGACAAGAAG GTGCGCGGCG GCAGACTGCC GCTGATCCTG
TCGCGCGGCA TCGGGCAGAG CTTCGTCACC GACGCATATG ACCTAGATGC CGTCGCCGCC
TTCCTCGACG AGCAGCGTAG CGTATGA
 
Protein sequence
MEKLTVELGA RSYPILIGDG LIRDIGAHVA PLLKRPRTMI VTDSHVADHY LAPIGTALAM 
ENIACSSFVL DPGEATKSWS GLARLTEWLI GEGIERSDHV IALGGGVIGD LVGFACSIVK
RGCAFIQVPT TLLAQVDSSV GGKTAINVPA GKNLIGAFHQ PAMVVIDPTT LETLPRRELG
AGYAEVVKYG LIDDADFFAW CEAHGAALLA GDSAARAHAI AHSVAAKARI VAADERETQD
IRALLNLGHS FGHALEAETG YSDRLLHGEA VAAGMVLAHQ FSAANGLCPA ADAARVRDHL
ASVGLPHSLA SAGINGGGAQ LAAHMAHDKK VRGGRLPLIL SRGIGQSFVT DAYDLDAVAA
FLDEQRSV