Gene Avin_45240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_45240 
SymbolaroB 
ID7763392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4589710 
End bp4590813 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content68% 
IMG OID643807372 
Product3-dehydroquinate synthase 
Protein accessionYP_002801613 
Protein GI226946540 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGACGT TACAGGTTGA TCTCGGCGAG CGCAGCTATC CCATCTATAT TGGAGCCGGC 
CTGCTGGACC GGGCGGAATG CATGGTTCCG CACCTGGCCG GTCGCCAAGT GGCGGTGGTC
ACCAACGAGA CCGTCGCGCC GCTCTATCTG GAGCGGTTGT CCCAAACCCT GGCCGGCCAT
GAGCTGACGC CCATCGTGCT GCCGGACGGC GAGGCCTTCA AGCACTGGGA AACCCTGCAG
AAGATTTTCG ACGGCCTGCT CGAGGCCCGG CACGACCGGC GCACCACCCT CATCGCCCTT
GGCGGGGGCG TCGTCGGCGA CATGGCGGGG TTTGCCGCCG CCTGCTACCA GCGCGGGGTC
GACTTCATCC AGATTCCGAC CACCTTGCTG TCCCAGGTGG ACTCCTCGGT CGGCGGCAAG
ACCGGCATCA ATCACCCGCT GGGCAAGAAC ATGATCGGCG CTTTCTATCA GCCGAAGGCC
GTGGTGATCG ATACCGCCAC GCTGGCGACC CTGCCGGAGC GGGAACTGTC CGCCGGCCTG
GCCGAGGTGA TCAAGTACGG CCTGATCTGC GACGAGCCCT TCCTCGGCTG GCTGGAAACC
CACATGGCGG CGCTTCGCGC CCTGGATGCG GCGGCCCTGA CCGAGGCCAT CGCCCGCTCC
TGCGCGGCCA AGGCGCGGGT GGTCGGTGTC GACGAACGCG AGTCGGGCGT GCGCGCCACC
CTGAACCTGG GGCACACCTT CGGTCATGCC ATCGAAACCC ACATGGGCTA TGGTGCATGG
CTGCACGGCG AGGCGGTGGC CGCGGGGTCC GTCATGGCCC TGGAGATGTC CCGGCGCCTC
GGCTGGCTCG GCGAGGCCGA GCGCGATCGC GGCATCCGCC TGCTGCAGCG CGCGGGACTG
CCCGTGGTGC CGCCGGCGCA GATGTCGGCG GACGACTTCC TCGGGCACAT GGCCGTCGAC
AAGAAGGTTC TGGGCGGCCG CCTGCGGCTG GTGCTGCTCA GGCGCCTGGG CGAGGCGGTG
GTGACTGGAG ACTTCCCGCA CGAGGCCTTG CAGGCCACCC TGGATACCGA TTACCGGACT
CTGATGGAAC AGATCAGTCA CTGA
 
Protein sequence
MQTLQVDLGE RSYPIYIGAG LLDRAECMVP HLAGRQVAVV TNETVAPLYL ERLSQTLAGH 
ELTPIVLPDG EAFKHWETLQ KIFDGLLEAR HDRRTTLIAL GGGVVGDMAG FAAACYQRGV
DFIQIPTTLL SQVDSSVGGK TGINHPLGKN MIGAFYQPKA VVIDTATLAT LPERELSAGL
AEVIKYGLIC DEPFLGWLET HMAALRALDA AALTEAIARS CAAKARVVGV DERESGVRAT
LNLGHTFGHA IETHMGYGAW LHGEAVAAGS VMALEMSRRL GWLGEAERDR GIRLLQRAGL
PVVPPAQMSA DDFLGHMAVD KKVLGGRLRL VLLRRLGEAV VTGDFPHEAL QATLDTDYRT
LMEQISH