Gene Ava_4249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4249 
Symbol 
ID3680897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5329452 
End bp5330990 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content46% 
IMG OID637719597 
Productnitrogenase molybdenum-iron protein beta chain 
Protein accessionYP_324743 
Protein GI75910447 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01286] nitrogenase molybdenum-iron protein beta chain 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCAAA ATGATGTAAA CAAGATTAAG GATCACGCCG AACTTTTCCA ACAGCAAGAA 
TATCAAGAAT TATTTCAAGC CAAGAAAGAG TTTGAATGTG GACACAACCC CGAAGAAGTT
ACCCGCATTG CTGAGTGGAC AAAAACCTGG GAATACCGCG AAAAGAACTT TGCTCGTCAA
GCTTTGACCG TTAACCCTGC GAAAGCTTGT CAACCTCTAG GCTCAATTTT GGCAGCCGTG
GGTTTTGAAG GTACACTACC TTTCGTACAT GGTTCTCAAG GTTGCGTTGC TTACTTCCGT
AGTCACTTAA CCCGCCACTT CAAAGAACCG TTTTCTGCTG TATCCTCCTC TATGACAGAA
GATGCAGCCG TATTCGGTGG TCTAAAAAAC CTGGTTGAAG GTTTAGCAGT TTCCTACAAC
CTATATAAGC CCAAGATGAT TGCTGTCTGC ACTACCTGCA TGGCAGAGGT AATAGGAGAT
GACCTGCAAG CTTTTATCAG AACTGCAAAA GAGGAAGGTG CTGTGCCTGA AGACTTCCCA
GTTCCTTATG CTCACACCCC CAGCTTTGTA GGTTCCCATA TTTTGGGTTA CGACAATATG
ATGAAGGGGA TTCTGTCGAA TTTGACAGCC AGGAAGAAGA AAGAAACCAC GAACGGCAAA
ATCAACTTTA TTCCAGGATT TGAAACCTAT ATTGGCAACC TGCGCGAAAT TAAACGGATT
GCCAATGTGA TGGGAATTAA CTACACCTTG TTGGCAGATA ACTCAGAATA CCTGGATTCT
CCAAATAACG GGGAATACAA TATGTATCCT GGCGGAACTA AGCTAGAGGA TGCGGCTGAT
TCCATCAATG CAGAAGCGAC CATTGCCCTG CAAGCTTACG CCACTACTAA AACTCGTGAG
TACATCGAAC ACGAATGGCA GCACAAGACT TATGTTAACC GTCCAGTCGG AATTCGTGGC
ACTGATGAAT TTCTGATGAA ACTGTCGGCG TTAACTGGCA AGCCAATTCC CCAAGAATTA
GAAGATGAAC GCGGACGCGC TGTGGATGCC TTGACCGATT CTCAAGCATG GTTGCACGGT
AAGCGCGTGG CGATGTATGG CGATCCTGAT TTGGTGATTG GTTTGACACA ATTCCTGTTA
GAAGTCGGTG CAGAACCAGT TCACATCGTT GTCAGCAATA GTAACGAACA TTTTGAAGCA
GAACTGCGGG CATTACTGGA CTCTAGCCCC TTTGGTCAAG GTGCTACTAT CCACGGTGGG
CGTGACCTGT GGCACATGAG GTCTTTGCTA TTCACCGAAC CTGTAGACTT GCTCATCGGT
AACTCCTACG GTAAATATCT CTGGCGCGAC ACCAAAACCC CATTTGTACG GATTGGCTAC
CCCATCTTTG ACCGCCACCA CCTACACCGC TACGCCACCT ACGGCTACCA AGGCACAATC
AACCTACTCA ACTGGATAGT TAACACCATT CTGGATGAGT TGGATCGTAA CACCATCATT
CCATCCAAAA CCGATATTTC CTACGACTTG ATTCGATAG
 
Protein sequence
MAQNDVNKIK DHAELFQQQE YQELFQAKKE FECGHNPEEV TRIAEWTKTW EYREKNFARQ 
ALTVNPAKAC QPLGSILAAV GFEGTLPFVH GSQGCVAYFR SHLTRHFKEP FSAVSSSMTE
DAAVFGGLKN LVEGLAVSYN LYKPKMIAVC TTCMAEVIGD DLQAFIRTAK EEGAVPEDFP
VPYAHTPSFV GSHILGYDNM MKGILSNLTA RKKKETTNGK INFIPGFETY IGNLREIKRI
ANVMGINYTL LADNSEYLDS PNNGEYNMYP GGTKLEDAAD SINAEATIAL QAYATTKTRE
YIEHEWQHKT YVNRPVGIRG TDEFLMKLSA LTGKPIPQEL EDERGRAVDA LTDSQAWLHG
KRVAMYGDPD LVIGLTQFLL EVGAEPVHIV VSNSNEHFEA ELRALLDSSP FGQGATIHGG
RDLWHMRSLL FTEPVDLLIG NSYGKYLWRD TKTPFVRIGY PIFDRHHLHR YATYGYQGTI
NLLNWIVNTI LDELDRNTII PSKTDISYDL IR