Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4026 |
Symbol | |
ID | 3681258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 5011312 |
End bp | 5013075 |
Gene Length | 1764 bp |
Protein Length | 587 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637719378 |
Product | nitrogenase vanadium-iron protein, alpha chain |
Protein accession | YP_324526 |
Protein GI | 75910230 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01284] nitrogenase alpha chain [TIGR01860] nitrogenase vanadium-iron protein, alpha chain [TIGR01862] nitrogenase component I, alpha chain [TIGR02930] V-containing nitrogenase, delta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCACTAA AACTATTAAA GTGCGACGAA ACAATCCCAG AAAGGGAGAA GCACGTATAC ATCAAAGAAA AAGGCGAAGA CACAACCCAA TTTCTGCCCC TATCCAACAT CGAAACCATC CCAGGTTCAC TATCAGAACG TGGATGTAGC TACTGCGGAG CCAAACTGGT AATTGGTGGC GTACTCAAAG ACACCATCCA AATGATTCAC GGCCCCATAG GTTGCGCCTA CGACACCTGG CACACCAAAC GTTATCCTAG CGACAACGGT AACTTTCAAC TAAAATACGT CTGGTCATCA GACATGAAAG AATCCCACGT CGTCTTCGGT GGAGAAAAAC AACTAGGCAA ATCCATCCGA GAAGCCTTCA AAGAATTTCC CGACATCAAA CGGATGATAG TCTACACCAC CTGCGCCACA GCCTTGATTG GTGACGACAT CAAAGCCGTA GTCAAAAGCG CACAGCAAGA ACTCGGTGAC GTAGACATCT TCTGCGTAGA ATGTCCCGGA TTTGCAGGTG TGAGCCAATC CAAAGGACAC CACGTCCTCA ACATCGCTTG GATTAACGAA AAAGTCGGCA CATTAGAGCC AGAAATCACC TCACCCTACA CCATCAACGT CATCGGAGAC TACAACATCC AAGGTGACAC CTTCGTACTA GAGAAGTACA TGGAGAAAAT GGGCATTCAA ATCATCGCCC ATTTTACCGG AAATGGTACT TATGACTCCT TACGAGGAAT GCACAGAGCG CAACTCAACG TTACCAACTG TGCGCGTTCA GCCGGATACA TCGCTAACGA ACTCAAAAAG AGATATGGTA TTCCTCGTCT AGACGTAGAC ACCTGGGGTT TTGACTATTG CCAAGAAGCA TTACGCAAAA TCGGCGCATT CTTCGGGATT GAAGACAGAG CCGAGGCTGT AATTGCCGAA GAAATCGCCA AATACCAAGA GAAAATGGAT TGGTACAAGG AAAGACTCTC AGGCAAAAAA GTCTGTATTT GGACAGGTGG TCCGAGACTA TGGCACTGGA CAAAAGCTCT GGAAGATGAC TTAGGAATGC AGGTAGTATC CATGTCTTCT AAGTTCGGAC ACCAAGAAGA CTTTGAAAAG GTCATTGCCA GAGGTCAAGA AGGAACTATC TATATTGATG ATGGTAATGA ATTGGAATTC TTTGAAGTTC TGGAAATGAT TCGTCCTGAT GTGGTGTTAA CTGGGCCACG GGTGGGTGCG TTAGTTAAGA AACTGCACTT ACCTTATGTT AATGGTCACG GTTATCACAA TGGCCCATAC ATGGGCTTTG AAGGTGCGGT GAATATGGCG CGTGACTTGT ACAATGCCAT TTATTCTCCT TTGATGCAGC TTGCTGCTAT TGATGTTCGT GATGATGCTC CTAAAGCTCC TGCCAAGACA AAAGAAATTG AACACTTGAA TGAAAAAGTG ACCAATATCA CAACCTACAT TCAAGAACGT TGTTTGTGGC AATTCCACTC CCGTGCATGG GATAGAGAAG AGAACATCAA TGGTGTAATC AAGAAAGCTG CCGAACTCTT GAGTGGAGAA AGATCAGTTC AGGAAACGCT GACTGACAAG CTGCATTATG CAGATGCGAA GATTCTGGTG TCTGAACTCA AACGCAACTT ACCTTGGATC AAAGAGTTGG ATAAGGCGCA AGTTAAGTCT GTACTAGAGT CAGTTAAGCA AAATCTAGTC GGTATTGCGA TCGCTGGTTC TCTCAACGGT GAATTGCACC ACTCTCTCTA TTAA
|
Protein sequence | MPLKLLKCDE TIPEREKHVY IKEKGEDTTQ FLPLSNIETI PGSLSERGCS YCGAKLVIGG VLKDTIQMIH GPIGCAYDTW HTKRYPSDNG NFQLKYVWSS DMKESHVVFG GEKQLGKSIR EAFKEFPDIK RMIVYTTCAT ALIGDDIKAV VKSAQQELGD VDIFCVECPG FAGVSQSKGH HVLNIAWINE KVGTLEPEIT SPYTINVIGD YNIQGDTFVL EKYMEKMGIQ IIAHFTGNGT YDSLRGMHRA QLNVTNCARS AGYIANELKK RYGIPRLDVD TWGFDYCQEA LRKIGAFFGI EDRAEAVIAE EIAKYQEKMD WYKERLSGKK VCIWTGGPRL WHWTKALEDD LGMQVVSMSS KFGHQEDFEK VIARGQEGTI YIDDGNELEF FEVLEMIRPD VVLTGPRVGA LVKKLHLPYV NGHGYHNGPY MGFEGAVNMA RDLYNAIYSP LMQLAAIDVR DDAPKAPAKT KEIEHLNEKV TNITTYIQER CLWQFHSRAW DREENINGVI KKAAELLSGE RSVQETLTDK LHYADAKILV SELKRNLPWI KELDKAQVKS VLESVKQNLV GIAIAGSLNG ELHHSLY
|
| |