Gene Avin_42190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_42190 
Symbol 
ID7763097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4247750 
End bp4249096 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content66% 
IMG OID643807072 
Productaromatic ring hydroxylating dioxygenase, alpha subunit 
Protein accessionYP_002801321 
Protein GI226946248 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0579776 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGACA AACCGACAAC AACACTCAAA GCCATCGAAT CCACGCCGCC GCTGGCCGAC 
CTCGATCGCA TCTGCGACAT GCAGGAAGGC CGCATGGCCG GCAAGATCTT CTGGGACAGC
GAGATCTACG AACAGGAGCT GGAGAAGATC TTCGCCCGCT GCTGGCTGTT CGTCGCCCAC
GAGTCGCAGA TCCCCGAGCC CGGCGACTAC GTCGCCACCA CCCTGGGTGA GGACGAGGTA
CTGGTCGTGC GCCAGAAGGA CCGCTCGATC AAGGTCCTGA TCAACGCCTG CCCGCACCGC
GGCAACAAGG TCTGCTTCGC CGAGGCCGGC AACGCCCGCG GTTTCATCTG CAACTACCAC
GGCTGGGCCT TCGGCCCCGA CGGCGCGCTG CGCGGCATGC ACGAGTCGGG GGTCTACGAG
CAGAGCGGCT TCGACAAGTC GCGGCAAGGC CTGCGCGAGG CGCGGGTGGA CAGCTACAAG
GGCCTGGTGT TCGCCACCTT CGCGGCGGAC GCGCCGAGCC TCGCCGACTA CCTGGGGCCG
ATGACCTGGT ACCTGGACGT GATCCTCGAC AACGATGAGG GCGGCACCGA ATTCATCGGC
GGCTGCATCC GCTCCACCTA CGAATGCAAC TGGAAGATCG CCGCGGAAAA CTTCGTCGGC
GACATCCTGC ACGCCGGCTG GACCCACGAC TCCGCCGCCC AGGCGATGCT CGGCGGCTCG
GTGACCAAGG TCAGCGAACT GCCCGAGTCC TACCAGGTGA ACTGGAACGG CCACGGCTAC
GAATTCGCCC GCGACCTGGT GGGCAACGCC GCGGTGCTCG GCGAGAGCGC GATCAACAAG
TACCTGCACC TGCACAGCCC GAAGGCCGCC GAGCGCCTGG GCGAGTTCCG CGCGCGGATG
CTGGGCGCGG TGTCGTCCTT CACGGTGTTC CCCAACTTCT CGTTCCTGCC GGGGCAGAAC
ACCGTGCGCG TCTGGCAGCC GCGCGGCCCC AACAAGATCG AGCTGTACAC CTGGGTGATC
GTCAACAGGA ACGCCCCCGC GGAGGTGAAG GAGAAGTGGC GGCGCGGGGC GATGATGACC
TTCTCGCCGA CCGGGGTGTT CGAGATGGAC GACGGCGAGA ACTGGGAATA CTGCACCAAG
ACCAGCCGCG GCAAGGTCAC CCGCTACCAG GACCTGTACG TCGGCCTGGG CATGAACAGC
CGCCTGAGCG ATACCGAACT GCCGGGCAAC GTGTTCAGGG GCCAGTTGAA CGAAGCCAAC
GCCCGCGCCT ACTACCAACG CTGGAAAGAC CTGCTGCAGG CACGCACCTG GGCGGAAGTA
CCCGACCGCA ACGGCAAGCT GGACTGA
 
Protein sequence
MMDKPTTTLK AIESTPPLAD LDRICDMQEG RMAGKIFWDS EIYEQELEKI FARCWLFVAH 
ESQIPEPGDY VATTLGEDEV LVVRQKDRSI KVLINACPHR GNKVCFAEAG NARGFICNYH
GWAFGPDGAL RGMHESGVYE QSGFDKSRQG LREARVDSYK GLVFATFAAD APSLADYLGP
MTWYLDVILD NDEGGTEFIG GCIRSTYECN WKIAAENFVG DILHAGWTHD SAAQAMLGGS
VTKVSELPES YQVNWNGHGY EFARDLVGNA AVLGESAINK YLHLHSPKAA ERLGEFRARM
LGAVSSFTVF PNFSFLPGQN TVRVWQPRGP NKIELYTWVI VNRNAPAEVK EKWRRGAMMT
FSPTGVFEMD DGENWEYCTK TSRGKVTRYQ DLYVGLGMNS RLSDTELPGN VFRGQLNEAN
ARAYYQRWKD LLQARTWAEV PDRNGKLD