Gene Ava_4029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4029 
Symbol 
ID3682158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5016498 
End bp5017898 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content43% 
IMG OID637719381 
Productoxidoreductase/nitrogenase, component 1 
Protein accessionYP_324529 
Protein GI75910233 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.522703 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGTCAG CAATAACAAG TGAAAACTCC CAGAAACTCA ATGTCATACC CTCCCACACC 
GAGCCACTAA CCTTTGATAA TTGCGACCAC AGCAAAGACC CAATTGTCGG CTGTGCGTTG
GAAGGTATCG CTAACATGGT GGCGGGGATT AAAGATGTCA GTATTGTGAT TCACTCTCCC
CAAGGTTGCG CCTCTACGGT GGCGGCTGGC TATGATAACC ATGAGGTTGA TTTCACTAAG
CGCAAAGTCG GTTGTACGCG CCTGTTTGAG TCAGATATTG TCATGGGTGC GTCTGAGAAA
CTCAAAGGTT TGATTAAAGA GGCGGATCAA TCTTTCAAAG CTAAAGTGAT GTTTGTCGTT
GGTACTTGCG CCGCAGATAT TATCGGTGAA GATATTCAAG GATTATGCAA TAGTATCCAA
CCAGAAATCA ACGCCAAGCT TGTACCTCTG CTGGCTGGTG GTTTTCGCGG TAATGCTTAC
GATGGCTTGG AAATGGGCTT AGAAGCTTTA CTTCCTTTCA TCCAAAAAAG ACAGAAGCGG
CGCGGCGGTA AAAAACCCAG AATTGTGAAT ATTATTGCAC CCCAAGCAAA TGTCAATCCT
ACTTGGTGGG CTGATTTGCA ATGGGTAACG CAGATGTTAA AATCTTTAAG GATTAAAGTA
CAGACCGTAA TTTCTCATGG TACTTCCTTT GAAGAACTGG AAAAAGCGGG TAATGCAACT
GCCAATATTC TTCTCAGCCA TGATGTGGGG TATAAGTTTG CGCGGAAAAT GCAAGAAACC
CATAATATCC CCCTAATTCT GGATGATATA CCTTTACCCA TTGGTGTGCA GAATACTACA
CGCTGGTTGA AGGCTTTAGC AGCACATTTC AAAATAGACG AAAAAAGAGT AGAACCTCTC
ATCAACGAAG GTGAAAACAG GGTTGTAGAA ACTCTTCGCA AACGCGCATT GATGATTATT
CCCCGATATC GTAACTGTAG AATTGCTGTG TCTGCGGATG GGACAATGGG TATTGGGTTG
GTGAGAATGC TGTTTGAAGA ATTGGAAATG ATTCCAGAAG TGTTGTTGTT CCGTTCAGGA
ATGCGCGAAT CTCGTTCAAT TCTAGAGCGA GAATTACAAA GTATGGGGAT TTCTCCCCGT
GTAGTGTTTT CTGCTGATGG GTATCAGATT AAACAAGCCT TAGCTGATGT TGATACTGAT
GCTGTCATTG GCTCGGCTTG GGAAAAATAC ATGGCGGAAG AATTGGGAAT TAAAATTGCT
TTTGATGTAT TTAGTCCGAC AAACAGAGAG ACTTACCTTG ATAGACCATA TTTTGGCTAT
GAAGGTATGA TTAACATGAT GGAAGTTGTT GCTAACGATT GGGAAAGAGC ATTTCGTTCC
AAACATATTC ATTGGACTTA G
 
Protein sequence
MLSAITSENS QKLNVIPSHT EPLTFDNCDH SKDPIVGCAL EGIANMVAGI KDVSIVIHSP 
QGCASTVAAG YDNHEVDFTK RKVGCTRLFE SDIVMGASEK LKGLIKEADQ SFKAKVMFVV
GTCAADIIGE DIQGLCNSIQ PEINAKLVPL LAGGFRGNAY DGLEMGLEAL LPFIQKRQKR
RGGKKPRIVN IIAPQANVNP TWWADLQWVT QMLKSLRIKV QTVISHGTSF EELEKAGNAT
ANILLSHDVG YKFARKMQET HNIPLILDDI PLPIGVQNTT RWLKALAAHF KIDEKRVEPL
INEGENRVVE TLRKRALMII PRYRNCRIAV SADGTMGIGL VRMLFEELEM IPEVLLFRSG
MRESRSILER ELQSMGISPR VVFSADGYQI KQALADVDTD AVIGSAWEKY MAEELGIKIA
FDVFSPTNRE TYLDRPYFGY EGMINMMEVV ANDWERAFRS KHIHWT