Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_51000 |
Symbol | nifA |
ID | 7763948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 5170162 |
End bp | 5171730 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643807928 |
Product | Nif-specific sigma54-dependent transcriptional activator protein, NifA |
Protein accession | YP_002802162 |
Protein GI | 226947089 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains |
TIGRFAM ID | [TIGR01817] Nif-specific regulatory protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.519777 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGCAA CCATCCCTCA GCGCTCGGCC AAACAGAACC CGGTCGAACT CTATGACCTG CAATTGCAGG CCCTGGCGAG CATCGCCCGC ACGCTCAGCC GCGAACAACA GATCGACGAA CTGCTCGAAC AGGTCCTGGC CGTACTGCAC AATGACCTCG GCCTGCTGCA TGGCCTGGTG ACCATTTCCG ACCCGGAACA CGGCGCCCTG CAGATCGGCG CCATCCACAC CGACTCGGAA GCGGTGGCCC AGGCCTGCGA AGGCGTGCGC TACAGAAGCG GCGAAGGCGT GATCGGCAAC GTGCTCAAGC ACGGCAACAG CGTGGTGCTC GGGCGCATCT CCGCCGACCC GCGCTTTCTC GACCGCCTGG CGCTGTACGA CCTGGAAATG CCGTTCATCG CCGTGCCGAT CAAGAACCCC GAGGGCAACA CCATCGGCGT GCTGGCGGCC CAGCCGGACT GCCGCGCCGA CGAGCACATG CCCGCGCGCA CGCGCCTTCT GGAGATCGTC GCCAACCTGC TGGCGCAGAC CGTGCGCCTG GTGGTGAACA TCGAGGACGG CCGCGAGGCG GCCGACGAGC GCGACGAACT GCGTCGCGAG GTGCGCGGCA AGTACGGCTT CGAGAACATG GTGGTGGGCC ACACCCCCAC CATGCGCCGG GTGTTCGATC AGATCCGCCG GGTCGCCAAG TGGAACAGCA CCGTACTGGT CCTCGGCGAG TCCGGTACCG GCAAGGAACT GATCGCCAGC GCCATCCACT ACAACTCGCC GCGCGCGCAC CGCCCCTTCG TGCGCCTGAA CTGCGCCGCG CTGCCGGAAA CCCTGCTCGA GTCCGAACTC TTCGGCCACG AGAAGGGCGC CTTCACCGGC GCGGTGAAGC AGCGCAAGGG GCGTTTCGAG CAGGCCGACG GCGGCACCCT GTTCCTCGAC GAGATCGGCG AGATCTCGCC GATGTTCCAG GCCAAGCTGC TGCGCGTGCT GCAGGAAGGC GAGTTCGAGC GGGTCGGCGG CAACCAGACG GTGCGGGTCA ACGTGCGCAT CGTCGCCGCC ACCAACCGCG ACCTGGAAAG CGAGGTGGAA AAGGGCAAGT TCCGCGAGGA CCTCTACTAC CGCCTGAACG TCATGGCCAT CCGCATTCCG CCGCTGCGCG AGCGTACCGC CGACATTCCC GAACTGGCGG AATTCCTGCT CGGCAAGATC GGCCGCCAGC AGGGCCGCCC GCTGACCGTC ACCGACAGCG CCATCCGCCT GCTGATGAGC CACCGCTGGC CGGGCAACGT GCGCGAACTG GAGAACTGCC TGGAGCGCTC GGCGATCATG AGCGAGGACG GCACCATCAC CCGCGACGTG GTCTCGCTGA CCGGGGTCGA CAACGAGAGC CCGCCGCTCG CCGCGCCGCT GCCCGAGGTC AACCTGGCCG ACGAGACCCT GGACGACCGC GAACGGGTGA TCGCCGCCCT CGAACAGGCC GGCTGGGTGC AGGCCAAGGC CGCGCGGCTG CTGGGCATGA CGCCGCGGCA GATCGCCTAC CGCATCCAGA CCCTCAACAT CCACATGCGC AAGATCTGA
|
Protein sequence | MNATIPQRSA KQNPVELYDL QLQALASIAR TLSREQQIDE LLEQVLAVLH NDLGLLHGLV TISDPEHGAL QIGAIHTDSE AVAQACEGVR YRSGEGVIGN VLKHGNSVVL GRISADPRFL DRLALYDLEM PFIAVPIKNP EGNTIGVLAA QPDCRADEHM PARTRLLEIV ANLLAQTVRL VVNIEDGREA ADERDELRRE VRGKYGFENM VVGHTPTMRR VFDQIRRVAK WNSTVLVLGE SGTGKELIAS AIHYNSPRAH RPFVRLNCAA LPETLLESEL FGHEKGAFTG AVKQRKGRFE QADGGTLFLD EIGEISPMFQ AKLLRVLQEG EFERVGGNQT VRVNVRIVAA TNRDLESEVE KGKFREDLYY RLNVMAIRIP PLRERTADIP ELAEFLLGKI GRQQGRPLTV TDSAIRLLMS HRWPGNVREL ENCLERSAIM SEDGTITRDV VSLTGVDNES PPLAAPLPEV NLADETLDDR ERVIAALEQA GWVQAKAARL LGMTPRQIAY RIQTLNIHMR KI
|
| |