Gene Avin_33440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_33440 
SymbolvnfA2 
ID7762240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3419317 
End bp3420798 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content68% 
IMG OID643806208 
Productsigma54-dependent activator protein 
Protein accessionYP_002800472 
Protein GI226945399 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID[TIGR01817] Nif-specific regulatory protein 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGAAT GCCATACCGA TCTGTTGCCG CTGCTCGGCG AACTGGGGCG CATCCCCAGC 
GAGGGCAGCG ACCTGGCCGG CATCCTGCGG GTGCTGCTGG AGCTGATGCA GCGCCATTTG
AAGATGGCGC GTGGCATGGT GACCCTGCGC GATCCGGAGT CGGGGCGGAT CTTCGTCCAG
CAGGGCTGCG GCCTGAGCGA GGAGGAGGAG GCCTCGGGCT ACCTCGCTCT CGGCGAGGAT
ATCGTCGCCC AAGCGGTGGA CAGCGGGCGG ACGGTGGTAC TGCCCGGCGA AGCGGGCAAC
CAGTCCCTCC TCTGCGTGCC GATCCGCCGC GACCGCAAGG TGCTGGGCAC CATCGTCGCC
GAGCGCCATT ACGCCAACCG CCGGATGCTC GAGCTGGACG CGGAGATTCT CGCCATTCTC
GCCGCCCTCA CCGCCCAGGC GGTGGAGCTG CACCTGCAGG AGCACGTGCG CAAGGTCGCC
CTGGAGGACG AGAACCACCG TCTGCATTCA GCCTTGCAGA GCCGTTTCAA GCCCAGCAGC
ATCATCGGCA GCTCGCGGCC GCTGCAGGAA GTCCATAGGC TGATCGAGAA GGTCGCCCGC
TCGCGGGCCA CGGTGCTGAT CCTCGGCGAG AGCGGCGTGG GCAAGGAACT GGTAGCCAGT
GCCATCCACT ACAACGGCAG CAATGCCGAA GGGCCCTTCG TCAAGTTCAA CTGCGCCGCC
CTGCCGGAGA GCGTCATCGA GAGCGAGCTG TTCGGCCATG AGTACGGCGC CTTCACCGGA
GCGGCGACCC CGCGGCGCGG GCGCTTCGAG GCGGCCGACG GCGGCACCAT CTTCCTCGAC
GAGGTGGGCG AGCTGTCCCC GGCCATGCAG GCCAAGCTGC TGCGGGTGCT GCAGGAGAAG
AGCTTCGAAC GGGTCGGCGG CAACGTCACC CACCAGGTCG ACCTGCGCAT CCTCGCCGCC
ACCAGTCGCG ACCTGCGGGC GATGGTCGAG CAGAGCCGCT TCCGCGGGGA CCTCTACTAC
CGGCTCAACG TCTTCCCCAT CAGCGTGCCG CCGCTGCGCG AACGCGACTC CGACATCGCC
ATCCTGGCGG AGCATTTCGT CGCGCGCCTC TCCAGCAGGA TGGGCATCCC GGCGAAGCGC
ATTTCCACAC CGGCAATGGG CATGCTGATG TGCTACCACT GGCCGGGCAA CGTGCGCGAG
CTGGAGAACG TCATCGAGCG GGCGGTCCTT CTCTGCGAGG GTGCAGTCAT CGAACCGCAC
CATCTGCCGC CTTCGCTGCA GACCCCGGCG GTTTCCGAGA GCCCGTCCGC CGGCGGCATT
CTCGACGTCC GGCTGGGACA GGCCGAGTGT GAACTGATCG CCGAGGCGCT CAAGCGGCAC
AAGGGCAACA TGACCGAGTC CGCCGCCCAC CTGGGCCTGA CCCGGCGCGT CCTCGGCCTG
CGCATGGCCC GGCACAACCT GAACCACAAG GAGTTTCGGT GA
 
Protein sequence
MGECHTDLLP LLGELGRIPS EGSDLAGILR VLLELMQRHL KMARGMVTLR DPESGRIFVQ 
QGCGLSEEEE ASGYLALGED IVAQAVDSGR TVVLPGEAGN QSLLCVPIRR DRKVLGTIVA
ERHYANRRML ELDAEILAIL AALTAQAVEL HLQEHVRKVA LEDENHRLHS ALQSRFKPSS
IIGSSRPLQE VHRLIEKVAR SRATVLILGE SGVGKELVAS AIHYNGSNAE GPFVKFNCAA
LPESVIESEL FGHEYGAFTG AATPRRGRFE AADGGTIFLD EVGELSPAMQ AKLLRVLQEK
SFERVGGNVT HQVDLRILAA TSRDLRAMVE QSRFRGDLYY RLNVFPISVP PLRERDSDIA
ILAEHFVARL SSRMGIPAKR ISTPAMGMLM CYHWPGNVRE LENVIERAVL LCEGAVIEPH
HLPPSLQTPA VSESPSAGGI LDVRLGQAEC ELIAEALKRH KGNMTESAAH LGLTRRVLGL
RMARHNLNHK EFR