Gene Ava_3933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3933 
Symbol 
ID3682978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4887596 
End bp4888930 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content46% 
IMG OID637719285 
Productnitrogenase molybdenum-cofactor biosynthesis protein NifN 
Protein accessionYP_324433 
Protein GI75910137 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0348274 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATCG TTACGCTCCC CAATAAATCA GTTGCGGTCA ATCCTCTCAA GCAAAGTCAA 
GCCCTGGGCG CTTCTTTAGC CTTCTTGGGA TTGAAAGGGA TGATTCCTCT GTTTCATGGT
TCCCAAGGTT GTACAGCTTT TGCCAAAGTG GTGTTAGTCC GGCATTTTCG GGAAGCCATA
CCCCTGGCGA CAACGGCGAT GACGGAAGTA ACTACCATTT TGGGTGGTGA AGATAATATT
GAGCAAGCAA TTCTCACTTT GGTGGAGAAG TCCAGCCCAG AAATTATTGG TCTGTGTAGC
ACTGGATTAA CAGAAACCAG AGGCGATGAT ATTGAACGCT TTCTTAAGGA TATCCGCGAT
CGCCATCCGG AAATATCTCA CCTACCAATT GTATTCGCGC CTACACCAGA TTTTAAAGGG
GCGTTGCAAG ATGGATTTGC GGCGGCTGTG GAAAGCATCG TCCAAGAAAT TCCCCAACCC
GGTACAACCA GAAGCGAACA AGTCACAATT TTGGCTGGTT CTGCCTTCAC CCCCGGAGAT
TTGCAAGAAA TCAAAGAGAT TGTCACCGCT TTTGGTTTAG TACCTATCTT TGTTCCTGAT
ATTGGTGCTT CCTTGGATGG ACACTTAGAT GAGGAATATA GTTCGGTAAC AACCAGTGGA
ACAACCGTCA CACAACTAAA AGAAGTCGGT CGTTCCGCCT TCACCATCGC CTTGGGTGAA
AGTATGCGGG GTGCGGCGAG GATTTTGGAA GACAGATTTA ACATTCCCTA CGAAGTCTTT
AGCGAACTCA CTGGCTTAGA ACCCGTAGAC GAATTTATCC AAGCCTTAGC AATTCTCAGC
AGCAACCCAG TACCAGAAAA GTATTGTCGT CAACGTCGCC AACTACAAGA TGCAATGTTA
GACACACACT TTTACTTTGG TGCAAAACGC ATCTCCTTGG CGCTAGAACC AGACCTGCTG
TGGTCAATGG TCAAGTTTCT GCAATCAATG GGGACACAAA TTCACGCCGC CGTTACTACC
ACACGCTCAC CCTTATTAGA ACAACTCCCC ATCAAGAGCG TAACCATCGG TGATTTAGAA
GACTTTGAAG AACTGGCAGT AGAATCTGAC TTGCTAATTG GTAATTCTAA CTTAGCAGCG
ATCGCCAAAC GTCTTTCCAT CCCTCACTAT CGTCTTGGTA TTCCCATTTA TGACCGCTTA
GGTAATGGTC ATTTCACGAA AGTCGGCTAT CGCGGCTCAA TGGAAGTCTT GTTTGGCATC
GGTAACCTAT TTATAGATGC AGAAGAAGCA AGAGTTAAGA ACTTTGATGA GAATTTTGTC
ATGGGTAATA GGTAA
 
Protein sequence
MAIVTLPNKS VAVNPLKQSQ ALGASLAFLG LKGMIPLFHG SQGCTAFAKV VLVRHFREAI 
PLATTAMTEV TTILGGEDNI EQAILTLVEK SSPEIIGLCS TGLTETRGDD IERFLKDIRD
RHPEISHLPI VFAPTPDFKG ALQDGFAAAV ESIVQEIPQP GTTRSEQVTI LAGSAFTPGD
LQEIKEIVTA FGLVPIFVPD IGASLDGHLD EEYSSVTTSG TTVTQLKEVG RSAFTIALGE
SMRGAARILE DRFNIPYEVF SELTGLEPVD EFIQALAILS SNPVPEKYCR QRRQLQDAML
DTHFYFGAKR ISLALEPDLL WSMVKFLQSM GTQIHAAVTT TRSPLLEQLP IKSVTIGDLE
DFEELAVESD LLIGNSNLAA IAKRLSIPHY RLGIPIYDRL GNGHFTKVGY RGSMEVLFGI
GNLFIDAEEA RVKNFDENFV MGNR