Gene Ava_4028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4028 
Symbol 
ID3682157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5014923 
End bp5016452 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content44% 
IMG OID637719380 
Productoxidoreductase/nitrogenase, component 1 
Protein accessionYP_324528 
Protein GI75910232 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.764483 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACAA ATATGGATGC GAATGTCGTA TTTTACGGAC ATTTAAGCGA ACTGTACCAA 
CTGGCAAAGG AAGGTAAGAT TAAAACCACT TTACAAGGTA GTCATACTCG ACCCTGTAAA
TTTTGGGCTG CAACCAAGAT TTTAAGTGGG ATTAAAGATG CGATCGTCAT TTCTCACGGG
CCGAGTGGCT GTGCTTATGG GGTGAAGCGG GCGTATAAAC TCACCAATAG CCGTAACAGT
GGTTCTCCTT ATGAACCCGT TGTCACCACC AACATGAGTG AGAAAGCGGT AATTTTTGGG
GGTGAAAAAG AATTACGCGG TGCAATTCTG GAAGTAGATC AAAAATATCA TCCTGATGCG
ATCGTTGTTG CTACCAGTTG CGCGTCGGGG ATTATTGGCG ATTGTGTAGA TGAAGTAGTG
GGTAAAGCGC GTAGCGAAAT CGACGCAGAA ATCATGACGA TACACTGCGA AGGCTTCGCC
GGAGAGTATC GTAGTGGCTT TGATATTGTC TTCCGCCAAA TTGTAGACTT CATGGAGCCA
CCCACCCCAG AACGCCAAGC ACAACTAGCT GATTCTGTCA ATATTGTGGG CGCAAAGATG
GGGCCAGAAA GGACGGAAGT AGAAGGAGAT GTCAAGGAAC TGAAACGACT GATCAAAGGG
ATGGGTGCAA GAGTTCACAG CGTGATTGCA GGGGACTGCA CATTAGAAGA ACTCAAACAA
GCACCAAGCG CAGCCGTTAA CTGTACCTTA TGTTTGGATC TGGGCTATAC CATCGGTAAA
GCTATGTCAG ATAGATTTGG TACACCTTTA AATTCGACAA TTCTGCCCTA TGGTATCAGT
GCTACAGAAA AATGGTTGCG CGGGGCGGCA AAATATTTAA AGATGGAAGC ACAAGCAGAA
GCCCTGATGG AACGGGAATA TGCAGCCATC AAAACAGAAT TTGAAGCAGC CAAGAAATAT
ATCGAAGGTA AGTTAGCCAT TATCGAAGGA CATGACGCTA TTAAGTGCTT GTCTATCGCC
CATATGTTGG AACGCGATTT TGGGATGCGT GCTGTTATTT ATAACTTCCA TCCCTGGAGT
ACGGAAGCAC GAGAAACCAG TGTAGATTAT TTGCTAGAAA CAGGCTTAGA CCCAGAAATT
TTAATTACCA AAGGGACTTT AGCTTTTGGT AAGTACGAAT CAATGAAACA AACTGAAGAT
GAATTACTAG AATTTATCGG TGGTTTAGAT GCTGATTCTG TAGTTTACTT CGGTTCTTCC
TTGAGTTTCC CCCACATTCC CGTTGTGGAT TTAAACGCCA TCTTAAATCG TCCCAGATTT
GGCTATCGCG GAGCTTTAAA GGTCGCTAAG TGTATTAAAA CCGCCCTTGA ATATGGTTTT
AGACCCCGCA GTTCCCTAAC CAAGCAAATG GTATTCCCGA AAAACTCAGG ATTAGCATCT
GCTCAATCAC TCACCGGAAA ATTAGCCCAA GACTTGCCAG ATTGCACCGT ATACGCAGGT
AAGAAACGGG GCAAATGTTT TAACGAGTAA
 
Protein sequence
MATNMDANVV FYGHLSELYQ LAKEGKIKTT LQGSHTRPCK FWAATKILSG IKDAIVISHG 
PSGCAYGVKR AYKLTNSRNS GSPYEPVVTT NMSEKAVIFG GEKELRGAIL EVDQKYHPDA
IVVATSCASG IIGDCVDEVV GKARSEIDAE IMTIHCEGFA GEYRSGFDIV FRQIVDFMEP
PTPERQAQLA DSVNIVGAKM GPERTEVEGD VKELKRLIKG MGARVHSVIA GDCTLEELKQ
APSAAVNCTL CLDLGYTIGK AMSDRFGTPL NSTILPYGIS ATEKWLRGAA KYLKMEAQAE
ALMEREYAAI KTEFEAAKKY IEGKLAIIEG HDAIKCLSIA HMLERDFGMR AVIYNFHPWS
TEARETSVDY LLETGLDPEI LITKGTLAFG KYESMKQTED ELLEFIGGLD ADSVVYFGSS
LSFPHIPVVD LNAILNRPRF GYRGALKVAK CIKTALEYGF RPRSSLTKQM VFPKNSGLAS
AQSLTGKLAQ DLPDCTVYAG KKRGKCFNE