Gene Avin_23130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_23130 
SymbolaroG 
ID7761229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2307362 
End bp2308438 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content60% 
IMG OID643805195 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_002799476 
Protein GI226944403 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.118954 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGATT TACCGATCAA TGACATCAAC GTTGCCTCCA ACGACCCCCT GATCACCCCA 
GAGCAGCTCA AGGCAGAGAT CCCCCTGAGC GCTGCCGCCA TGAACACCGT CTCCCGCGGC
CGCGAAGTCA TCCGCAATAT CCTCGACGGC AAGGATCATC GCCTGTTCCT GGTGGTCGGC
CCGTGCTCCA TCCACGATGT CAAGGCGGCA CACGAATACG CCGAACGGCT CAAGAAGCTC
GCAGCCGAAG TTTCCGACAC CCTGTTTCTG GTGATGCGTG TCTATTTCGA GAAACCCCGC
ACCACCGTCG GTTGGAAGGG CCTCATCAAC GATCCCTATC TGGACGACAC CTTCAAGATC
CAGGAAGGGC TGCATATCGC CCGCCAATTG CTGCTCGACA TCGCCGAAAC GGGCTTGCCG
AGCGCTGGCG AAGCCCTGGA CCCGATTTCC CCACAGTATC TGCAGGACCT GTTCAGTTGG
TCGGCCATCG GTGCCCGCAC CACGGAATCC CAGACACACC GCGAGTTGGC CTCCGGCCTG
TCTTCTGCCG TCGGTTTCAA GAACGGCACG GACGGCAGCC TGACCGTGGC GATCAACGCG
CTGCAATCGG TATCCAGGCC CCACCGTTTC CTGGGCATCA ACCAGCAGGG CAGCGTTTCC
ATCGTGACGA CCAAGGGCAA TACCTATGGG CACGTCGTTC TGCGCGGCGG CAATGGCAAA
CCCAACTACG ACTCGGTCAA CGTCGCCATC TGCGAGCAGG AGCTGCGCAA GGCCGGTATC
CTGCCGAATA TCATGGTGGA CTGCAGCCAC GCCAATTCGA ACAAGGATCC GGCCCTGCAA
CCCCTGGTGA TGACCAACGT CGCCAACCAG ATTCTCGAAG GCAATTCATC CATCATAGGT
CTGATGGTGG AGAGCAACCT GGGCTGGGGC AGCCAGTCGA TTCCCGACAA TCTGGACGAC
CTCAAGTACG GAGTCTCCGT CACCGACGCC TGCATCGACT GGGACACCAC AGCCACGGCG
ATACGCGACA TGCACGCCAA ACTCAAGGAT ATCCTGCCGA ACCGGAAACG CTCCTGA
 
Protein sequence
MADLPINDIN VASNDPLITP EQLKAEIPLS AAAMNTVSRG REVIRNILDG KDHRLFLVVG 
PCSIHDVKAA HEYAERLKKL AAEVSDTLFL VMRVYFEKPR TTVGWKGLIN DPYLDDTFKI
QEGLHIARQL LLDIAETGLP SAGEALDPIS PQYLQDLFSW SAIGARTTES QTHRELASGL
SSAVGFKNGT DGSLTVAINA LQSVSRPHRF LGINQQGSVS IVTTKGNTYG HVVLRGGNGK
PNYDSVNVAI CEQELRKAGI LPNIMVDCSH ANSNKDPALQ PLVMTNVANQ ILEGNSSIIG
LMVESNLGWG SQSIPDNLDD LKYGVSVTDA CIDWDTTATA IRDMHAKLKD ILPNRKRS