Gene Avin_15850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_15850 
SymbolaroA 
ID7760520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1559155 
End bp1561413 
Gene Length2259 bp 
Protein Length752 aa 
Translation table11 
GC content68% 
IMG OID643804485 
Productbifunctional cyclohexadienyl dehydrogenase/ 3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_002798775 
Protein GI226943702 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCTGAGC ATGCATTCCC GACTGCCGCC GCGACGCAGC GGAAAGAGCC CGGGGTCGGC 
CGCCTGGTCG TGGTGGGACT GGGGCTGATC GGCGGCTCCT TCGCCAAGGG GGTCCGCGAG
CGTGGCTTGT GTCGCGAGGT CGTCGGCGTG GATCTGGACC CGCATACCCG CGAACTGGCC
GTGGCCCAGG GTGTGGTCGA CCGTTGCGAG GAGGACCTCG CCGTCGCCTG CCGGGGCGCG
CAGGTGATCC AGCTCGCCGT CCCCATCCTG GCCATGGAGA AGCTGCTCGG CGTACTCGCC
GGGCTCGCTC TGGGCGACGC CGTCATCACC GATGTCGGCA GCTCCAAGGG ATATGTGCTG
CAGGCGGCGC GGCGGGCCTT CGCCGGCCGG CCCTTGAACT TCGTGCCGGG CCATCCGATC
GCCGGCTCCG AGCGCAGCGG CGTGGCGGCG GCCGACAGCG AACTGTTCCT CCATCACAAG
GTCATCCTGA CACCGGTGCA GGAGAGCGAG CCCCAGGCAC TGGCGCTGGT CGACGGACTC
TGGCGGGGGC TTGGGGCGGT GGTCGAGCGC ATGGCCGTCG AGCGCCACGA CGAGGTGCTC
GCCGCCACCA GCCATCTGCC GCATCTGCTG GCCTTCGGCC TCGTGGACTC GCTGGCCAAG
CGCAACGAAA ACCTGGAAAT CTTTCGCTAT GCCGCCGGCG GCTTTCGCGA TTTCACACGG
ATCGCCGGCA GCGATCCGGT GATGTGGCGC GATATCTTCC TCGCCAACCG CGAGGCGGTA
TTGCGCACTC TCGATGTCTT TCGCAGCGAT CTCGACGCCC TGCGCGCCGC GGTCGACGCA
GGGGACGGGC ATCAGTTGCT GGGCGTATTC ACCCGCGCCC GCGTTGCCCG TGAGCATTTC
AGCAAGATAC TGGCCCGTCG GGCCTATGTG GATGCCATGC ACGATAACGA CCTGATCTAC
CTGGCCCTGC CGGGCGGTCA TGCCCGCGGA CATATTCGTG TTCCCGGTGA CAAGTCCATT
TCTCACCGTT CGATCATGCT CGGCTCGCTG GCCGAGGGCA CTACCGAAGT GGAGGGCTTT
CTAGAGGGTG AGGATGCTCT CGCCACCATC CAGGCGTTCC GCGACATGGG GGTGGTCATC
GAAGGGCCTC ACCATGGCCG GGTGACCGTC CATGGTGTCG GCCTGCATGG CCTGAAGGCA
CCGCCGGGGC CGCTGTACCT GGGCAACTCG GGCACCTCCA TGCGCCTGCT GTCGGGCCTG
CTCGCGGCCC AGCCGTTCGA CACCGTGCTG ACCGGCGACG CCTCGCTCTC CAAGCGGCCG
ATGAACCGTG TGGCCAAACC CCTGCGCGAG ATGGGGGCGG AGATCGAAGC CGGGCCGGAA
GGGCGTCCGC CGCTGACCAT CAAAGGCGGG CGCAGGCTCA CCGGCATGCA TTACCAGATG
CCCATGGCCA GCGCGCAGGT GAAGTCCTGC CTGCTGCTCG CCGGCCTCTA TGCCGGCGGC
GAAACCTCGG TCAGCGAGCC GGCGCCGACC CGCGACCACA CCGAGCGCAT GCTGCGCGGT
TTCGGTTACC CGGTGAAGGT CGAGGGCAGC AAGGTCACCG TCGAGTCCGG CCACAAACTG
CAGGCGACCC AGATCGAGGT GCCGGCGGAC ATCTCCTCGG CGACGTTCTT CCTGGTGGCC
GCAACCATCG CCGAAGACTC CGAACTGCTG CTCGAGCACG TCGGCATCAA CCCGACCCGC
ACCGGGTCGA TCGAGATCCT CAAGCTGATG GGGGCCGACA TCACGCTGGA GAACCCGCGC
GAGGTGGGGG GCGAGCCGGT CGCCGACATC CGCGTGCGCT CCGCGCGCCT GCAGGGCATC
GAGATCCCCC TGGACCTGGT GCCGCTGGCG ATCGACGAGT TCCCCGTGCT GTTCGTCGCC
GCGGCCTGCG CCGAGGGACG CACCGTGCTG CGTGGCGCCG AGGAGCTGCG GGTCAAGGAG
TCCGACCGCA TCCAGGTGAT GGCCGACGGT TTGCGGGCAC TCGGCGTCAA GGCCGAGCCG
ACCCCGGACG GCATCGCCAT CGAGGGCGGC CCGATCGGCG GGGGGGAGAT ATACAGCCAC
GGAGACCACC GCATCGCCAT GTCCTTCAGC ATCGCCTCGC TGCGTGCCAG CGCGCCGATC
CGCATCCATG ACTGCGCCAA CGTGGCCACC TCCTTCCCGA ATTTCATCGC CCTGGCCAAG
CATGTCGGCA TTCGCGTGGA CGAGGAGGGG GTGTCATGA
 
Protein sequence
MSEHAFPTAA ATQRKEPGVG RLVVVGLGLI GGSFAKGVRE RGLCREVVGV DLDPHTRELA 
VAQGVVDRCE EDLAVACRGA QVIQLAVPIL AMEKLLGVLA GLALGDAVIT DVGSSKGYVL
QAARRAFAGR PLNFVPGHPI AGSERSGVAA ADSELFLHHK VILTPVQESE PQALALVDGL
WRGLGAVVER MAVERHDEVL AATSHLPHLL AFGLVDSLAK RNENLEIFRY AAGGFRDFTR
IAGSDPVMWR DIFLANREAV LRTLDVFRSD LDALRAAVDA GDGHQLLGVF TRARVAREHF
SKILARRAYV DAMHDNDLIY LALPGGHARG HIRVPGDKSI SHRSIMLGSL AEGTTEVEGF
LEGEDALATI QAFRDMGVVI EGPHHGRVTV HGVGLHGLKA PPGPLYLGNS GTSMRLLSGL
LAAQPFDTVL TGDASLSKRP MNRVAKPLRE MGAEIEAGPE GRPPLTIKGG RRLTGMHYQM
PMASAQVKSC LLLAGLYAGG ETSVSEPAPT RDHTERMLRG FGYPVKVEGS KVTVESGHKL
QATQIEVPAD ISSATFFLVA ATIAEDSELL LEHVGINPTR TGSIEILKLM GADITLENPR
EVGGEPVADI RVRSARLQGI EIPLDLVPLA IDEFPVLFVA AACAEGRTVL RGAEELRVKE
SDRIQVMADG LRALGVKAEP TPDGIAIEGG PIGGGEIYSH GDHRIAMSFS IASLRASAPI
RIHDCANVAT SFPNFIALAK HVGIRVDEEG VS