Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_49020 |
Symbol | anfA |
ID | 7763760 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4961738 |
End bp | 4963351 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643807741 |
Product | sigma54-dependent transcriptional activator for the iron only nitrogenase, AnfA |
Protein accession | YP_002801976 |
Protein GI | 226946903 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains |
TIGRFAM ID | [TIGR01817] Nif-specific regulatory protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGATC AGGCAACCTT CGAGTTCGAC ACGGATTACT TCGTCGAGGA GTTCAGCCAC TGCTTCACCG GCGAGTGCCG GGTGAAGATG CTCCCCATCC TGTACAAGAT CAGCCAGATC ATTACCGGCA ATGCCGATCT GGCCGATGCC CTGTCCATCG TTCTGGGCGT CATGCAGCAG CACCTGAAGA TGCAGCGCGG CATCGTCACG CTCTACGACA TGCGTGCCGA AACCATCTTC ATCCACGATA GCTTCGGCTT GACGGAAGAG GAGAAGAAAC GCGGCATCTA CGCTGTCGGC GAGGGCATCA CCGGCAAGGT GGTGGAAACG GGCAAGGCCA TTGTCGCCCG GCGTTTGCAG GAGCATCCGG ACTTTCTGGG GCGCACCCGG GTATCCCGCA ACGGCAAGGC CAAGGCAGCG TTTTTCTGTG TGCCCATCAT GCGTGCCCAG AAAGTGCTGG GCACCATCGC TGCGGAGCGG GTTTACATGA ATCCGCGCCT GCTCAAGCAG GATGTCGAAC TGCTGACCAT GATCGCCACC ATGATCGCTC CCCTGGTCGA GTTGTACCTG ATCGAGAACA TCGAAAGGGT GCGTCTGGAA AACGAAAACC GTCGCCTGAA GCATGCGTTG AAGGAGCGTT TCAAGCCTTC CAACATCATC GGCAATTCCA AGCCCATGCA GGAGGTGTAC GAACTGATTC ACAAGGTGGC CTCCACCAAG GCCACGGTGC TGATCCTGGG GGAAAGCGGA GTCGGCAAGG AACTGGTGGC CAACGCCATC CACTACAACA GCCCCAATGC CGAGGCTGCT CTCGTCACCT CCAACTGCGC GGCGCTGCCG GAGAATCTGG CGGAAAGCGA ACTCTTCGGC CATGAAAAGG GCTCCTTTAC CGGAGCCTTG ACCATGCACA AGGGCTGTTT CGAGCAGGCC GATGGCGGCA CCATCTTCCT GGACGAGGTC GGCGAACTGA GCCCGACGGT CCAGGCCAAG CTGCTGCGGG TGCTGCAGAA CCGTACCTTC GAGCGGGTAG GGGGCAGCAA GCCGGTCAAG GTGGATGTGC GCATCATCGC GGCGACCAAC CGCAACCTGG TGGAAATGGT GGAGCAGGGC ACTTTTCGTG AAGACCTGTA TTACCGCCTC AATGTCTTTC CCATTACCGT CCCGCCGTTA CGTGAGCGGG GCAGCGACGT CATCGCCCTG GCAGATCACT TTGTCAGCGC CTTTTCCAGG GAGAACGGCA AGAACGTCAA GCGCATCTCC ACGCCGGCGC TCAACATGTT GATGAGCTAT CACTGGCCGG GCAATGTCCG CGAACTGGAA AACGTCATGG AGCGTGCGGT CATTCTGTCC GATGACGATG TGATCCATAG CTACAATCTG CCCCCCTCCC TGCAGACCTC CAAAGAGAGC GGTACGGCCT TCGGGCTCAC CCTGGAGGAA AAGATAAAGG CTGTCGAGTG CGAGATGATT GTGGAGGCCT TGAAAAACTC CAGCGGTCAC ATAGGGGAGG CGGCCAAGGA GCTGGGTTTG ACGCGGCGCA TGCTTGGCGT ACGGATGGAG CGTTATGGCA TCAGTTATAA AAGTTTTCGC GGTACGCATG ATACCGATGA GTGA
|
Protein sequence | MSDQATFEFD TDYFVEEFSH CFTGECRVKM LPILYKISQI ITGNADLADA LSIVLGVMQQ HLKMQRGIVT LYDMRAETIF IHDSFGLTEE EKKRGIYAVG EGITGKVVET GKAIVARRLQ EHPDFLGRTR VSRNGKAKAA FFCVPIMRAQ KVLGTIAAER VYMNPRLLKQ DVELLTMIAT MIAPLVELYL IENIERVRLE NENRRLKHAL KERFKPSNII GNSKPMQEVY ELIHKVASTK ATVLILGESG VGKELVANAI HYNSPNAEAA LVTSNCAALP ENLAESELFG HEKGSFTGAL TMHKGCFEQA DGGTIFLDEV GELSPTVQAK LLRVLQNRTF ERVGGSKPVK VDVRIIAATN RNLVEMVEQG TFREDLYYRL NVFPITVPPL RERGSDVIAL ADHFVSAFSR ENGKNVKRIS TPALNMLMSY HWPGNVRELE NVMERAVILS DDDVIHSYNL PPSLQTSKES GTAFGLTLEE KIKAVECEMI VEALKNSSGH IGEAAKELGL TRRMLGVRME RYGISYKSFR GTHDTDE
|
| |