Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_01640 |
Symbol | nifV |
ID | 7759127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 155646 |
End bp | 156800 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643803086 |
Product | nitrogen fixation homocitrate synthase |
Protein accession | YP_002797402 |
Protein GI | 226942329 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR02660] homocitrate synthase NifV |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAGCG TGATCATCGA CGACACTACC CTGCGTGACG GTGAACAGAG TGCCGGGGTC GCCTTCAATG CCGACGAGAA GATCGCTATC GCCCGCGCGC TCGCCGAACT GGGCGTGCCG GAGTTGGAGA TCGGCATTCC CAGCATGGGC GAGGAAGAGC GCGAGGTGAT GCACGCCATC GCCGGTCTCG GCCTGTCGTC TCGCCTGCTG GCCTGGTGCC GGCTATGCGA CGTCGATCTC GCGGCGGCGC GCTCCACCGG GGTGACCATG GTCGACCTTT CGCTGCCGGT CTCCGACCTG ATGCTGCACC ACAAGCTCAA TCGCGATCGC GACTGGGCCT TGCGCGAAGT GGCCAGGCTG GTCGGCGAAG CGCGCATGGC CGGGCTCGAG GTGTGCCTGG GCTGCGAGGA CGCCTCGCGG GCGGATCTGG AGTTCGTCGT GCAGGTGGGC GAAGTGGCGC AGGCCGCCGG CGCCCGTCGG CTGCGCTTCG CCGACACCGT CGGGGTCATG GAGCCCTTCG GCATGCTCGA CCGCTTCCGT TTCCTCAGCC GGCGCCTGGA CATGGAGCTG GAAGTGCACG CCCACGATGA TTTCGGGCTG GCCACGGCCA ACACCCTGGC CGCGGTGATG GGCGGGGCGA CTCATATCAA CACCACGGTC AACGGGCTCG GCGAGCGTGC CGGCAACGCC GCGCTGGAAG AGTGCGTGCT GGCGCTCAAG AACCTCCACG GTATCGACAC CGGTATCGAT ACCCGCGGCA TCCCGGCCAT CTCCGCGCTG GTCGAGCGGG CCTCGGGGCG CCAGGTGGCC TGGCAGAAGA GCGTGGTCGG CGCCGGGGTG TTCACTCACG AGGCCGGTAT CCACGTCGAC GGACTGCTCA AGCATCGGCG CAACTACGAG GGGCTGAATC CCGACGAACT CGGTCGCAGC CACAGTCTGG TGCTGGGCAA GCATTCCGGG GCGCACATGG TGCGCAACAC GTACCGCGAT CTGGGTATCG AGCTGGCGGA CTGGCAGAGC CAAGCGCTGC TCGGCCGCAT CCGTGCCTTC TCCACCAGGA CCAAGCGCAG CCCGCAGCCT GCCGAGCTGC AGGATTTCTA TCGGCAGTTG TGCGAGCAAG GCAATCCCGA ACTGGCCGCA GGAGGAATGG CATGA
|
Protein sequence | MASVIIDDTT LRDGEQSAGV AFNADEKIAI ARALAELGVP ELEIGIPSMG EEEREVMHAI AGLGLSSRLL AWCRLCDVDL AAARSTGVTM VDLSLPVSDL MLHHKLNRDR DWALREVARL VGEARMAGLE VCLGCEDASR ADLEFVVQVG EVAQAAGARR LRFADTVGVM EPFGMLDRFR FLSRRLDMEL EVHAHDDFGL ATANTLAAVM GGATHINTTV NGLGERAGNA ALEECVLALK NLHGIDTGID TRGIPAISAL VERASGRQVA WQKSVVGAGV FTHEAGIHVD GLLKHRRNYE GLNPDELGRS HSLVLGKHSG AHMVRNTYRD LGIELADWQS QALLGRIRAF STRTKRSPQP AELQDFYRQL CEQGNPELAA GGMA
|
| |