Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_01620 |
Symbol | nifU |
ID | 7759125 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 153425 |
End bp | 154363 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643803084 |
Product | Nitrogen fixation Fe-S cluster scaffold protein |
Protein accession | YP_002797400 |
Protein GI | 226942327 |
COG category | [C] Energy production and conversion |
COG ID | [COG0822] NifU homolog involved in Fe-S cluster formation |
TIGRFAM ID | [TIGR02000] Fe-S cluster assembly protein NifU |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGGATT ATTCGGAAAA AGTCAAAGAG CACTTCTACA ACCCCAAGAA TGCTGGAGCC GTGGAAGGCG CCAACGCCAT CGGCGACGTC GGATCGCTGA GTTGCGGTGA TGCGCTGCGC CTGACCCTGA AGGTGGACCC GGAAACCGAC GTGATTCTGG ATGCCGGCTT CCAGACCTTC GGCTGTGGTT CCGCCATCGC TTCCTCCTCG GCGCTGACCG AGATGGTCAA GGGCCTGACC CTGGACGAGG CGCTGAAGAT CAGTAACCAG GACATCGCCG ACTACCTCGA TGGCCTGCCG CCGGAGAAGA TGCACTGCTC GGTGATGGGC CGCGAAGCCC TGCAGGCCGC GGTGGCCAAC TACCGTGGCG AGACGATCGA GGACGACCAC GAAGAGGGCG CGCTGATCTG CAAGTGCTTC GCCGTCGACG AGGTGATGGT CCGCGATACC ATCCGTGCCA ACAAGCTGTC TACCGTTGAG GACGTGACCA ATTACACCAA GGCCGGCGGT GGCTGCTCCG CCTGCCACGA GGCTATCGAG CGCGTGCTGA CCGAAGAGCT GGCCGCTCGC GGTGAAGTCT TCGTCGCGGC CCCGATAAAG GCCAAGAAGA AGGTCAAGGT GCTCGCCCCC GAGCCGGCTC CCGCCCCGGT GGCCGAAGCC CCGGCGGCTG CCCCGAAGCT GAGCAACCTG CAGCGCATCC GTCGCATCGA GACCGTGCTG GCGGCGATCC GTCCGACCTT GCAGCGCGAC AAGGGCGACG TCGAACTGAT CGATGTCGAC GGCAAGAACG TTTATGTCAA GCTCACCGGC GCCTGCACCG GCTGCCAGAT GGCCAGCATG ACCCTCGGCG GCATCCAGCA GCGCCTGATC GAGGAGCTCG GCGAGTTCGT CAAGGTGATT CCGGTCAGCG CTGCGGCTCA CGCGCAGATG GAGGTCTGA
|
Protein sequence | MWDYSEKVKE HFYNPKNAGA VEGANAIGDV GSLSCGDALR LTLKVDPETD VILDAGFQTF GCGSAIASSS ALTEMVKGLT LDEALKISNQ DIADYLDGLP PEKMHCSVMG REALQAAVAN YRGETIEDDH EEGALICKCF AVDEVMVRDT IRANKLSTVE DVTNYTKAGG GCSACHEAIE RVLTEELAAR GEVFVAAPIK AKKKVKVLAP EPAPAPVAEA PAAAPKLSNL QRIRRIETVL AAIRPTLQRD KGDVELIDVD GKNVYVKLTG ACTGCQMASM TLGGIQQRLI EELGEFVKVI PVSAAAHAQM EV
|
| |