Gene Ajs_3644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAjs_3644 
Symbol 
ID4674439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax sp. JS42 
KingdomBacteria 
Replicon accessionNC_008782 
Strand
Start bp3851405 
End bp3852523 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content66% 
IMG OID639840677 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_987832 
Protein GI121595936 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCC ATGTCTCCAG CAGCGATGCC TGGTATCCCA ACGTCGAGAA AACCAGCGAG 
ACCGACGACC AACGCATCAA GGACATCACC GTGCTTCCTC CTCCAGACCA CCTGATTCGC
TTCTTCCCGA TCCGCGGGTC GGAGGTGGAG TCGCTGATTG CCAACACACG CCGCAGCATC
CAGCGCATCA TGAGCGGCGA GGACGACCGG CTGCTCGTCA TCATCGGCCC CTGTTCGATC
CACGACCCGC ACGCCGCGGT GGACTATGCG CGCAAGCTCA AGGCCGTGCG TGAGCAGTAC
AAGGATCAAC TGGAAGTGGT GATGCGCGTG TACTTCGAGA AGCCCCGCAC CACCGTGGGC
TGGAAGGGCC TGATCAACGA CCCCTACCTG GACGGCAGCT ACCGCATCGA CGAGGGCCTG
CGCATCGCGC GCCAGCTGCT CATCGACATC AACCGCCTGG GCGTGCCGGC GGGCAGCGAG
TTCCTGGACG TGATCTCGCC CCAGTACATC GGCGACCTGA TCAGTTGGGG CGCCATCGGC
GCGCGCACCA CCGAAAGCCA GGTCCACCGC GAACTGGCCT CCGGCATTTC GGCCCCCATC
GGCTTCAAGA ACGGCACCGA TGGCAACATC CGCATCGCCA CCGATGCCAT CCAGTCCGCC
AGCCGCGGCC ACCACTTCCT GTCGGTGCAC AAGAACGGCC AGGTGGCCAT CGTGCACACC
GCGGGCAACA AGGATTGCCA CGTGATCCTG CGCGGCGGCA AGGCGCCCAA CTACGACGCA
GCCAGCGTTG CCGCGGCCTG CAAGGACCTG GAGGCCGCCG GCCTGCCGGC GTCGCTGATG
GTGGACTGCA GCCATGCCAA CAGCAGCAAG CAGCACGAGA AGCAGCGCGA TGTGGCGCGC
GACATCGCCG CGCAGATCGC AGGCGGCTCG CGCAGCGTGT TCGGCGTGAT GGTGGAAAGC
CACCTGCAGC CCGGAGCGCA GAAGTTCACG CCGGGCAAGG ACGATGCCAC GACGCTGGAA
TACGGCAAGA GCATCACCGA TGCCTGCCTG GGATGGGACG ACTCCGTTGC CTGCCTGGCC
GAGCTGGCGG CCGCCGTGCA AGCGCGGCGC GCGCGTTGA
 
Protein sequence
MNAHVSSSDA WYPNVEKTSE TDDQRIKDIT VLPPPDHLIR FFPIRGSEVE SLIANTRRSI 
QRIMSGEDDR LLVIIGPCSI HDPHAAVDYA RKLKAVREQY KDQLEVVMRV YFEKPRTTVG
WKGLINDPYL DGSYRIDEGL RIARQLLIDI NRLGVPAGSE FLDVISPQYI GDLISWGAIG
ARTTESQVHR ELASGISAPI GFKNGTDGNI RIATDAIQSA SRGHHFLSVH KNGQVAIVHT
AGNKDCHVIL RGGKAPNYDA ASVAAACKDL EAAGLPASLM VDCSHANSSK QHEKQRDVAR
DIAAQIAGGS RSVFGVMVES HLQPGAQKFT PGKDDATTLE YGKSITDACL GWDDSVACLA
ELAAAVQARR AR