Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_33440 |
Symbol | vnfA2 |
ID | 7762240 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3419317 |
End bp | 3420798 |
Gene Length | 1482 bp |
Protein Length | 493 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643806208 |
Product | sigma54-dependent activator protein |
Protein accession | YP_002800472 |
Protein GI | 226945399 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains |
TIGRFAM ID | [TIGR01817] Nif-specific regulatory protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGAAT GCCATACCGA TCTGTTGCCG CTGCTCGGCG AACTGGGGCG CATCCCCAGC GAGGGCAGCG ACCTGGCCGG CATCCTGCGG GTGCTGCTGG AGCTGATGCA GCGCCATTTG AAGATGGCGC GTGGCATGGT GACCCTGCGC GATCCGGAGT CGGGGCGGAT CTTCGTCCAG CAGGGCTGCG GCCTGAGCGA GGAGGAGGAG GCCTCGGGCT ACCTCGCTCT CGGCGAGGAT ATCGTCGCCC AAGCGGTGGA CAGCGGGCGG ACGGTGGTAC TGCCCGGCGA AGCGGGCAAC CAGTCCCTCC TCTGCGTGCC GATCCGCCGC GACCGCAAGG TGCTGGGCAC CATCGTCGCC GAGCGCCATT ACGCCAACCG CCGGATGCTC GAGCTGGACG CGGAGATTCT CGCCATTCTC GCCGCCCTCA CCGCCCAGGC GGTGGAGCTG CACCTGCAGG AGCACGTGCG CAAGGTCGCC CTGGAGGACG AGAACCACCG TCTGCATTCA GCCTTGCAGA GCCGTTTCAA GCCCAGCAGC ATCATCGGCA GCTCGCGGCC GCTGCAGGAA GTCCATAGGC TGATCGAGAA GGTCGCCCGC TCGCGGGCCA CGGTGCTGAT CCTCGGCGAG AGCGGCGTGG GCAAGGAACT GGTAGCCAGT GCCATCCACT ACAACGGCAG CAATGCCGAA GGGCCCTTCG TCAAGTTCAA CTGCGCCGCC CTGCCGGAGA GCGTCATCGA GAGCGAGCTG TTCGGCCATG AGTACGGCGC CTTCACCGGA GCGGCGACCC CGCGGCGCGG GCGCTTCGAG GCGGCCGACG GCGGCACCAT CTTCCTCGAC GAGGTGGGCG AGCTGTCCCC GGCCATGCAG GCCAAGCTGC TGCGGGTGCT GCAGGAGAAG AGCTTCGAAC GGGTCGGCGG CAACGTCACC CACCAGGTCG ACCTGCGCAT CCTCGCCGCC ACCAGTCGCG ACCTGCGGGC GATGGTCGAG CAGAGCCGCT TCCGCGGGGA CCTCTACTAC CGGCTCAACG TCTTCCCCAT CAGCGTGCCG CCGCTGCGCG AACGCGACTC CGACATCGCC ATCCTGGCGG AGCATTTCGT CGCGCGCCTC TCCAGCAGGA TGGGCATCCC GGCGAAGCGC ATTTCCACAC CGGCAATGGG CATGCTGATG TGCTACCACT GGCCGGGCAA CGTGCGCGAG CTGGAGAACG TCATCGAGCG GGCGGTCCTT CTCTGCGAGG GTGCAGTCAT CGAACCGCAC CATCTGCCGC CTTCGCTGCA GACCCCGGCG GTTTCCGAGA GCCCGTCCGC CGGCGGCATT CTCGACGTCC GGCTGGGACA GGCCGAGTGT GAACTGATCG CCGAGGCGCT CAAGCGGCAC AAGGGCAACA TGACCGAGTC CGCCGCCCAC CTGGGCCTGA CCCGGCGCGT CCTCGGCCTG CGCATGGCCC GGCACAACCT GAACCACAAG GAGTTTCGGT GA
|
Protein sequence | MGECHTDLLP LLGELGRIPS EGSDLAGILR VLLELMQRHL KMARGMVTLR DPESGRIFVQ QGCGLSEEEE ASGYLALGED IVAQAVDSGR TVVLPGEAGN QSLLCVPIRR DRKVLGTIVA ERHYANRRML ELDAEILAIL AALTAQAVEL HLQEHVRKVA LEDENHRLHS ALQSRFKPSS IIGSSRPLQE VHRLIEKVAR SRATVLILGE SGVGKELVAS AIHYNGSNAE GPFVKFNCAA LPESVIESEL FGHEYGAFTG AATPRRGRFE AADGGTIFLD EVGELSPAMQ AKLLRVLQEK SFERVGGNVT HQVDLRILAA TSRDLRAMVE QSRFRGDLYY RLNVFPISVP PLRERDSDIA ILAEHFVARL SSRMGIPAKR ISTPAMGMLM CYHWPGNVRE LENVIERAVL LCEGAVIEPH HLPPSLQTPA VSESPSAGGI LDVRLGQAEC ELIAEALKRH KGNMTESAAH LGLTRRVLGL RMARHNLNHK EFR
|
| |