Gene Avin_11420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_11420 
Symbol 
ID7760084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1092056 
End bp1094050 
Gene Length1995 bp 
Protein Length664 aa 
Translation table11 
GC content69% 
IMG OID643804044 
Productsigma54-dependent activator protein 
Protein accessionYP_002798346 
Protein GI226943273 
COG category[K] Transcription
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3284] Transcriptional activator of acetoin/glycerol metabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCCCCG TGCCGGAAAA CGACGACCGC ATCAAGGTTT TGTGGGAGAG GTTCCTCAAT 
GGCGGCGAGC CCGGATCGGA TGCGCTGCGC CGCCTGATCG ACGACTCCTG GCGGCGCTGC
CTCGGCGCCA GCGTCGACCC GGACCGCTTC CAGGCGCCGC CGCCGCTCAG CGAGCATTCC
CTGCATTCCC TGCGCGACGA ATCCGCCGAC CTGCTGAGCG CCAGCGCGCC CATCATGGCG
GCGGCGCGGG ACTTCCTCGC CGAAACCGGA ACGGTGATGG TGCTCGCCAA CCCGAGCGGC
ACCATCCTCA ATCTCGAAGG GGACGTGTCC ACCCGCGGCC TGGCGGAAAA CATCTATCTG
CTGTCGGGGG CGAACTGGAG CGAACTGGCC TGCGGCACCA ACGCGATCGG CACCGCGCTG
GAGGTCGGCC AGCCGGTGCA GATCCATTCG GCGGAACACT ATTGCGCCGG CATCAAGCGC
TGGTCCTGTT CGGCGACGGT GATCCGCGAC CCCTGCGACG GCAGCATCCT CGGCGTGGTC
GACGTGTCGG GGCTCAGCGC CTCCTACAGC CGCCACAGCC TCGCCCTGGT CGTGGCGACG
GCCGGGCGTA TCGAGAGCCG TCTGGCGAAA ATGGAAATGG ATGTCCGTTA CCGACTGCTG
GAAACCTGCA TCGGTCGCCT GTCGCCGAGC GCGACCGACG GCATCGTCGT GTTCGACCGG
CGCGGCCGCG CCATCCAGGC GAATGCGCGC GCCTCGGCGA TGGTGGCCGA CCTGGCCAGC
CGCGAGCCGC TCGGCTCGAC CGGCGAGCTC GGCGCCTTGT CGCTGAAATG GGAGAAAAAC
GGCGGGCTCG CCGAGCACCT GCCCGAATGG ATGGACCCGG ACTGGCTGCA GCCGGTGATC
GTCGGCAACC AGCATCTCGG CACGCTGCTG ACGCTGCCGA ACCGGCGCGC GCCCGGCGCC
GGCGCGGCCA GCCCGGCCGT CCTCGACGAG AGGTTCGACG AGAGCGGCTT CGACAAGGTG
GTCGGCGAAT CGACGGCGCT GCTGCAGGCG GTCGGCCGTG CCCGGCAGCT CGCGAAATCC
CGCGTCCCGG TCCTGCTGCT GGGCGAGACC GGCGTGGGCA AGGACGTGTT CGCGCGCGGC
ATACACGAAT CCGGCGCGAC CCGCGACGGC CCCTTCGTCG CCCTGAATTG CGGCGGCTTC
TCGCGCGAGC TGTTGACCAG CGAGCTGTTC GGCTACGTCG AAGGCTCGTT CACCGGCGCA
CGGCGCGGCG GCATGATGGG CAAGATCGAG GCGGCCGACG GCGGCACCCT GTTCCTCGAC
GAGATCGGCG AGATGCCCAT CGACCTCCAG CCCCATTTCC TCCGGGTGCT GGAGGAAGGC
GAGGTCTATC GCATCGGCGA GACCAAGCCG CGCAAGGTCA ATTTCCGCCT CGTCGCCGCG
ACCAACCGCG ACCTGCACAA GGAAATCCAG GCCGGCACCT TCCGCATGGA CCTCTTCTAC
CGGGTCGCGG TGACCAGCAT CCACATCCCG TCCCTGCGCG AGCGACTCGG GGACATTCCC
CTGCTCGGCG AACACTACCT CGACATCCTG ACCCGCCAGC ACGGGCTCGC ACCGCGGACG
CTGTCGGCCG GCGCGGTGAC CCTGCTGCAG CGCTACGCCT GGCCGGGCAA CATCCGCGAG
TTCCGCAACG TCATCGAAAG CATGCTGCTC ACCTCGCCGG CGAGCGTGCT CGGCGAAGCC
GACGTGCCGC TCGACGGGCG CTGCGTGGCC CACGCGACAC GCCAGCCGGA ACCCGACGCG
CAGGACGAGG CGCGCGATCT GAACGGCCTG GAAAGCGCCG AGCGCGAGGT CATTCGGCGA
ACCGTCAAGG GCTGTCGCGG CAACATGACC GCGGTGGCGC GGGAACTGGG CATAGCCAAG
AGCACCGTCT ACGCGAAGCT CAAGCGCTTC GGCCTGGAAA CCTACGTGGA AGATCTGCGC
AACTACCGCG CCTGA
 
Protein sequence
MFPVPENDDR IKVLWERFLN GGEPGSDALR RLIDDSWRRC LGASVDPDRF QAPPPLSEHS 
LHSLRDESAD LLSASAPIMA AARDFLAETG TVMVLANPSG TILNLEGDVS TRGLAENIYL
LSGANWSELA CGTNAIGTAL EVGQPVQIHS AEHYCAGIKR WSCSATVIRD PCDGSILGVV
DVSGLSASYS RHSLALVVAT AGRIESRLAK MEMDVRYRLL ETCIGRLSPS ATDGIVVFDR
RGRAIQANAR ASAMVADLAS REPLGSTGEL GALSLKWEKN GGLAEHLPEW MDPDWLQPVI
VGNQHLGTLL TLPNRRAPGA GAASPAVLDE RFDESGFDKV VGESTALLQA VGRARQLAKS
RVPVLLLGET GVGKDVFARG IHESGATRDG PFVALNCGGF SRELLTSELF GYVEGSFTGA
RRGGMMGKIE AADGGTLFLD EIGEMPIDLQ PHFLRVLEEG EVYRIGETKP RKVNFRLVAA
TNRDLHKEIQ AGTFRMDLFY RVAVTSIHIP SLRERLGDIP LLGEHYLDIL TRQHGLAPRT
LSAGAVTLLQ RYAWPGNIRE FRNVIESMLL TSPASVLGEA DVPLDGRCVA HATRQPEPDA
QDEARDLNGL ESAEREVIRR TVKGCRGNMT AVARELGIAK STVYAKLKRF GLETYVEDLR
NYRA