Gene Ava_4026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4026 
Symbol 
ID3681258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5011312 
End bp5013075 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content45% 
IMG OID637719378 
Productnitrogenase vanadium-iron protein, alpha chain 
Protein accessionYP_324526 
Protein GI75910230 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01284] nitrogenase alpha chain
[TIGR01860] nitrogenase vanadium-iron protein, alpha chain
[TIGR01862] nitrogenase component I, alpha chain
[TIGR02930] V-containing nitrogenase, delta subunit 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACTAA AACTATTAAA GTGCGACGAA ACAATCCCAG AAAGGGAGAA GCACGTATAC 
ATCAAAGAAA AAGGCGAAGA CACAACCCAA TTTCTGCCCC TATCCAACAT CGAAACCATC
CCAGGTTCAC TATCAGAACG TGGATGTAGC TACTGCGGAG CCAAACTGGT AATTGGTGGC
GTACTCAAAG ACACCATCCA AATGATTCAC GGCCCCATAG GTTGCGCCTA CGACACCTGG
CACACCAAAC GTTATCCTAG CGACAACGGT AACTTTCAAC TAAAATACGT CTGGTCATCA
GACATGAAAG AATCCCACGT CGTCTTCGGT GGAGAAAAAC AACTAGGCAA ATCCATCCGA
GAAGCCTTCA AAGAATTTCC CGACATCAAA CGGATGATAG TCTACACCAC CTGCGCCACA
GCCTTGATTG GTGACGACAT CAAAGCCGTA GTCAAAAGCG CACAGCAAGA ACTCGGTGAC
GTAGACATCT TCTGCGTAGA ATGTCCCGGA TTTGCAGGTG TGAGCCAATC CAAAGGACAC
CACGTCCTCA ACATCGCTTG GATTAACGAA AAAGTCGGCA CATTAGAGCC AGAAATCACC
TCACCCTACA CCATCAACGT CATCGGAGAC TACAACATCC AAGGTGACAC CTTCGTACTA
GAGAAGTACA TGGAGAAAAT GGGCATTCAA ATCATCGCCC ATTTTACCGG AAATGGTACT
TATGACTCCT TACGAGGAAT GCACAGAGCG CAACTCAACG TTACCAACTG TGCGCGTTCA
GCCGGATACA TCGCTAACGA ACTCAAAAAG AGATATGGTA TTCCTCGTCT AGACGTAGAC
ACCTGGGGTT TTGACTATTG CCAAGAAGCA TTACGCAAAA TCGGCGCATT CTTCGGGATT
GAAGACAGAG CCGAGGCTGT AATTGCCGAA GAAATCGCCA AATACCAAGA GAAAATGGAT
TGGTACAAGG AAAGACTCTC AGGCAAAAAA GTCTGTATTT GGACAGGTGG TCCGAGACTA
TGGCACTGGA CAAAAGCTCT GGAAGATGAC TTAGGAATGC AGGTAGTATC CATGTCTTCT
AAGTTCGGAC ACCAAGAAGA CTTTGAAAAG GTCATTGCCA GAGGTCAAGA AGGAACTATC
TATATTGATG ATGGTAATGA ATTGGAATTC TTTGAAGTTC TGGAAATGAT TCGTCCTGAT
GTGGTGTTAA CTGGGCCACG GGTGGGTGCG TTAGTTAAGA AACTGCACTT ACCTTATGTT
AATGGTCACG GTTATCACAA TGGCCCATAC ATGGGCTTTG AAGGTGCGGT GAATATGGCG
CGTGACTTGT ACAATGCCAT TTATTCTCCT TTGATGCAGC TTGCTGCTAT TGATGTTCGT
GATGATGCTC CTAAAGCTCC TGCCAAGACA AAAGAAATTG AACACTTGAA TGAAAAAGTG
ACCAATATCA CAACCTACAT TCAAGAACGT TGTTTGTGGC AATTCCACTC CCGTGCATGG
GATAGAGAAG AGAACATCAA TGGTGTAATC AAGAAAGCTG CCGAACTCTT GAGTGGAGAA
AGATCAGTTC AGGAAACGCT GACTGACAAG CTGCATTATG CAGATGCGAA GATTCTGGTG
TCTGAACTCA AACGCAACTT ACCTTGGATC AAAGAGTTGG ATAAGGCGCA AGTTAAGTCT
GTACTAGAGT CAGTTAAGCA AAATCTAGTC GGTATTGCGA TCGCTGGTTC TCTCAACGGT
GAATTGCACC ACTCTCTCTA TTAA
 
Protein sequence
MPLKLLKCDE TIPEREKHVY IKEKGEDTTQ FLPLSNIETI PGSLSERGCS YCGAKLVIGG 
VLKDTIQMIH GPIGCAYDTW HTKRYPSDNG NFQLKYVWSS DMKESHVVFG GEKQLGKSIR
EAFKEFPDIK RMIVYTTCAT ALIGDDIKAV VKSAQQELGD VDIFCVECPG FAGVSQSKGH
HVLNIAWINE KVGTLEPEIT SPYTINVIGD YNIQGDTFVL EKYMEKMGIQ IIAHFTGNGT
YDSLRGMHRA QLNVTNCARS AGYIANELKK RYGIPRLDVD TWGFDYCQEA LRKIGAFFGI
EDRAEAVIAE EIAKYQEKMD WYKERLSGKK VCIWTGGPRL WHWTKALEDD LGMQVVSMSS
KFGHQEDFEK VIARGQEGTI YIDDGNELEF FEVLEMIRPD VVLTGPRVGA LVKKLHLPYV
NGHGYHNGPY MGFEGAVNMA RDLYNAIYSP LMQLAAIDVR DDAPKAPAKT KEIEHLNEKV
TNITTYIQER CLWQFHSRAW DREENINGVI KKAAELLSGE RSVQETLTDK LHYADAKILV
SELKRNLPWI KELDKAQVKS VLESVKQNLV GIAIAGSLNG ELHHSLY