Gene Sala_2212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2212 
Symbol 
ID4080170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2322722 
End bp2324095 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content65% 
IMG OID638010590 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_617254 
Protein GI103487693 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0659111 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0554404 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAAAA ACTGGCAACC GCATAGCTGG CGCACGCACG AGGCCCGCCA GCTGCCCACC 
TATCGCGACG CCGATGCACT CGCGGCCGCC GAACGCGAGC TGGCGAACTA TCCGCCGCTC
GTCTTTGCGG GCGAGGCGCG CGAACTGACG AACGAACTCG CGCGCGTCGC GGAGGGCAGG
GCGTTCCTGC TCCAGGGCGG CGACTGCGCC GAAAGCTTTG CCGAGTTCCA CCCGAACAAT
ATCCGCGACA CATTTCGTGT GCTGCTCCAG ATGGCGGTGG TGCTGACCTT CGCCTCGAAA
ATGCCCGTGG TAAAGCTCGG CCGCATGGCG GGGCAGTTCG CCAAGCCGCG TTCGGCGGAC
ATGGAAGAGG TGGATGGCGT CGCGCTGCCC AGCTATCGCG GCGACATCAT CAACGACATC
GCGTTCGAGG AAGCCGGCCG CGAGCCCGAC CCCGCGCGGA TGGTCAAGGC CTACAACCAG
TCGGCGGCGA CGCTCAACCT GCTGCGCGCT TTCGCGGGCG GTGGCTATGC CAATCTGCAC
CAGGTCAACG CCTGGACGCA CGACTTCATG GACCGCAGCC CGTGGGCGAA GAAATATCAG
GAAACCGCAG GCCGCATTTC CGAAGCGCTC GCCTTCATGG AAGCGTGCGG CGTGACGCCC
GAAACGGTTC CGCAGATCAA GGGCACCAGC TTCTACACCA GCCATGAGGC GCTGCTCCTC
CCCTATGAGC AGGCGCTGAC CCGGCAGGAC AGCCTGACCG GCGGCTGGTA CGACACATCG
GGCCACTTCC TGTGGGTCGG CGACCGCACC CGTTTCGAAG GATCAGCGCA TATCGAATAT
CTCCGCGGGA TCGGCAATCC GGTCGGCATG AAATGCGGAC CGAGCCTCGA ACCCGACGTG
CTGCTGCGCC TGCTCGACAC GCTGAATCCC AACCATGTGC CCGGCCGCAT GACGCTCATC
ACGCGCTATG GCCACGACAA GATCGAGGCG CATCTGCCCA GGCTCGTGCG CGCGGTGAAG
GAATCGGGCC ATCCCGTCGT CTGGTCGTGC GACCCGATGC ACGGCAATGT CATCAAGACC
TCGACCGGCT ACAAGACGCG CCCGTTCGAG CGCATCCTCG CCGAAGTGCG CGGCTTCTTC
GCCGTCCACC GCGCCGAGGG CACGCATGGC GGCGGCATCC ATATCGAGAT GACCGGCCAG
AATGTCACCG AATGCACCGG CGGCGCGATG GACGTGACCC AGATGGACCT TGCCGACCGC
TATCACACGC ATTGCGACCC GCGTTTGAAT GCGGGGCAGA GCCTCGAACT CGCCTTCCTG
CTGGCGGAGA TGCTCAATCA GGAAATGAGC GAGCGGGCGA AGCAGGCGGC GTAA
 
Protein sequence
MTKNWQPHSW RTHEARQLPT YRDADALAAA ERELANYPPL VFAGEARELT NELARVAEGR 
AFLLQGGDCA ESFAEFHPNN IRDTFRVLLQ MAVVLTFASK MPVVKLGRMA GQFAKPRSAD
MEEVDGVALP SYRGDIINDI AFEEAGREPD PARMVKAYNQ SAATLNLLRA FAGGGYANLH
QVNAWTHDFM DRSPWAKKYQ ETAGRISEAL AFMEACGVTP ETVPQIKGTS FYTSHEALLL
PYEQALTRQD SLTGGWYDTS GHFLWVGDRT RFEGSAHIEY LRGIGNPVGM KCGPSLEPDV
LLRLLDTLNP NHVPGRMTLI TRYGHDKIEA HLPRLVRAVK ESGHPVVWSC DPMHGNVIKT
STGYKTRPFE RILAEVRGFF AVHRAEGTHG GGIHIEMTGQ NVTECTGGAM DVTQMDLADR
YHTHCDPRLN AGQSLELAFL LAEMLNQEMS ERAKQAA