Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_51010 |
Symbol | nifB |
ID | 7763949 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 5172015 |
End bp | 5173526 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643807929 |
Product | Nitrogenase cofactor biosynthesis protein |
Protein accession | YP_002802163 |
Protein GI | 226947090 |
COG category | [R] General function prediction only |
COG ID | [COG0535] Predicted Fe-S oxidoreductases |
TIGRFAM ID | [TIGR01290] nitrogenase cofactor biosynthesis protein NifB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0231468 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACTGA GCGTACTTGG GCAAAACAAT GGGGGACAGC ACAGCGCTGG CGGCTGTTCC TCAAGTAGCT GCGGCAGCAC GCACGATCAG CTCTCCCACC TGCCGGAAAA CATTCGTGCG AAGGTGCAGA ACCATCCGTG CTATTCGGAA GAGGCGCACC ACTATTTCGC GCGCATGCAC GTGGCGGTGG CGCCTGCCTG CAACATCCAG TGCCACTACT GCAACCGCAA GTACGACTGC GCCAACGAGT CGCGTCCGGG CGTGGTGTCC GAAGTGCTGA CCCCCGAGCA GGCGGTCAAG AAGGTCAAGG CCGTGGCCGC CGCCATCCCG CAGATGAGCG TGCTCGGCAT CGCCGGCCCC GGCGACCCCT TGGCCAACCC GAAGCGCACC CTCGACACCT TCCGCATGCT CAGCGAGCAG GCCCCGGACA TCAAGCTGTG CGTTTCCACC AACGGCCTGG CCCTGCCGGA GTGCGTCGAG GAACTGGCCA AGCACAACAT CGACCACGTC ACCATCACCA TCAACTGCGT GGACCCGGAG ATCGGCGCCA AGATCTATCC GTGGATCTAC TGGAACAACA AGCGCATCCG CGGCGTCAAG GCCGCCAAGA TCCTCATCGA GCAGCAGCAG AAGGGTCTGG AGATGCTGGT GGCGCGCGGC ATCCTGGTGA AGGTCAATTC GGTGATGATC CCCGGCGTCA ACGACGAGCA CCTGAAGGAA GTCAGCAAGA TCGTCAAGGC CAAGGGCGCC TTCCTGCACA ACGTCATGCC GCTGATCGCC GAGCCCGAGC ACGGCACCTT CTACGGCGTG ATGGGCCAGC GCAGCCCCGA GCCGGAAGAA CTGCAGGACC TGCAGGACGC CTGCGCCGGC GACATGAACA TGATGCGCCA CTGCCGCCAG TGCCGCGCCG ACGCGGTCGG CATGCTCGGC GAGGACCGCG GCGACGAGTT CACCCTGGAC AAGATCGAGT CGATGGAGAT CGATTACGAG GCGGCGATGG TCAAGCGCGC CGCCATCCAT GCGGCGATCA AGGAAGAGCT GGACGAGAAG GCGGCGAAGA AGGAACGGCT GGCTGGCCTG TCCGTTGCAT CCGTCCAGAA CGGCACGAGC GGTCGCTACC GTCCGGTGCT GATGGCCGTG GCCACCAGCG GCGGCGGCCT GATCAACCAG CACTTCGGGC ACGCCACCGA GTTCCTGGTG TACGAAGCCT CGCCGTCCGG GGTGCGCTTC ATCGGCCATC GCCGGGTCGA CCAGTACTGC GTCGGCAACG ACACCTGCGG CGAGAAGGAA AGTGCACTCG CCGGCAGCAT CCGTGCCCTG AAGGGATGCG AGGCGGTGCT CTGCTCGAAG ATCGGTTTCG AACCCTGGAG CGACCTGGAG ACCGCCGGCA TCCAGCCCAA TGGCGAGCAC GCCATGGAGC CCATCGAGGA AGCGGTCATG GCGGTCTACC GGGAAATGAT CGAGTCGGGT CGGCTGGAGA ATGACGGAGC CCTGCTGCAG GCCAAGGCCT GA
|
Protein sequence | MELSVLGQNN GGQHSAGGCS SSSCGSTHDQ LSHLPENIRA KVQNHPCYSE EAHHYFARMH VAVAPACNIQ CHYCNRKYDC ANESRPGVVS EVLTPEQAVK KVKAVAAAIP QMSVLGIAGP GDPLANPKRT LDTFRMLSEQ APDIKLCVST NGLALPECVE ELAKHNIDHV TITINCVDPE IGAKIYPWIY WNNKRIRGVK AAKILIEQQQ KGLEMLVARG ILVKVNSVMI PGVNDEHLKE VSKIVKAKGA FLHNVMPLIA EPEHGTFYGV MGQRSPEPEE LQDLQDACAG DMNMMRHCRQ CRADAVGMLG EDRGDEFTLD KIESMEIDYE AAMVKRAAIH AAIKEELDEK AAKKERLAGL SVASVQNGTS GRYRPVLMAV ATSGGGLINQ HFGHATEFLV YEASPSGVRF IGHRRVDQYC VGNDTCGEKE SALAGSIRAL KGCEAVLCSK IGFEPWSDLE TAGIQPNGEH AMEPIEEAVM AVYREMIESG RLENDGALLQ AKA
|
| |