Gene Avin_33210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_33210 
SymbolaroF 
ID7762216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3394870 
End bp3395934 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content72% 
IMG OID643806186 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_002800450 
Protein GI226945377 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCAT CCGTTTCCGC CCAGCTCGCC ACCGCCGAGT CCATTCCCGC CCGCCGCAGC 
GCCCAGCCGC TGCCCAGGCC CTCGGTGCTG CGCCAGCGCC TGCCGCTGAC TCCCGCGCTC
ACCGAGCGCA TCCGGGCCGA CCGCGCCGCC ATCCGCGCGG TGCTCGACGG CCGGGACCCG
CGCCTTCTGG TGGTGGTCGG CCCCTGCTCG CTGCACGACC CCGACTCCGC CCTGGATTAC
ACCGCGCGCT TGGCCGAGCT GGCGCCGCAG GTCGACGACC GGTTGTTGCT GGTGATGCGC
GCCTATGTCG AGAAGCCGCG CACCACCGTC GGCTGGAAGG GGCTGGTCTA CGATCCGCAC
CTGGACGGCA GCGGCGACAT GGCCGAGGGC CTGCGGCTGT CGCGCCGACT GATGCTGGAC
ATTCTGGAAC TGGGCCTGCC GCTGGCCAGC GAACTGCTGC AGCCGCTGGC GGCCAGCTAC
TTCGACGACC TGCTGGGCTG GGCCGCCATC GGCGCGCGCA CCAGCGAGTC GCAGATCCAC
CGCGAGATGG TCAGCGGCCT GGATCTGCCG GTGGGCTTCA AGAACGGCAC CGACGGCAGC
CTGGGCATCG CCTGCGACGC CATGCGCTCG GCCGCCCATG CCCATCGGCA TTTCGGCATC
GACGAACTGG GCCATCCGGC CCTGCTGCAG ACCCGCGGCA ACCCGGATAC CCATCTGGTG
CTGCGCGGCG GCCACGGCGG ACCGAACCAC GACGCGGCCA GCGTCGCCGG CGCCCGTCAG
GCCCTGGAGC GCCAGGGCAT CGCCGCGCGG ATCATGGTCG ACTGCAGCCA CGCCAACAGC
GGCAAGGACC CGTTGCGCCA GCCGGCCGTG CTGGACGACG TGCTCGCGCA GCGCCTGGCC
GGCGATACCA GCCTGCGCGG GGTGATGCTG GAAAGCCATC TGTTCGACGG CTGCCAGCCG
CTGTCCGGCG AGCTGCGCTA CGGTGTCTCG ATCACCGACG GCTGTCTCGG CTGGAGCGCC
ACCGAACGGA TGCTGCTGGA CGCCGCCCGG CGCCTGCGCG CTTGA
 
Protein sequence
MNASVSAQLA TAESIPARRS AQPLPRPSVL RQRLPLTPAL TERIRADRAA IRAVLDGRDP 
RLLVVVGPCS LHDPDSALDY TARLAELAPQ VDDRLLLVMR AYVEKPRTTV GWKGLVYDPH
LDGSGDMAEG LRLSRRLMLD ILELGLPLAS ELLQPLAASY FDDLLGWAAI GARTSESQIH
REMVSGLDLP VGFKNGTDGS LGIACDAMRS AAHAHRHFGI DELGHPALLQ TRGNPDTHLV
LRGGHGGPNH DAASVAGARQ ALERQGIAAR IMVDCSHANS GKDPLRQPAV LDDVLAQRLA
GDTSLRGVML ESHLFDGCQP LSGELRYGVS ITDGCLGWSA TERMLLDAAR RLRA