Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4029 |
Symbol | |
ID | 3682158 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 5016498 |
End bp | 5017898 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637719381 |
Product | oxidoreductase/nitrogenase, component 1 |
Protein accession | YP_324529 |
Protein GI | 75910233 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.522703 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGTCAG CAATAACAAG TGAAAACTCC CAGAAACTCA ATGTCATACC CTCCCACACC GAGCCACTAA CCTTTGATAA TTGCGACCAC AGCAAAGACC CAATTGTCGG CTGTGCGTTG GAAGGTATCG CTAACATGGT GGCGGGGATT AAAGATGTCA GTATTGTGAT TCACTCTCCC CAAGGTTGCG CCTCTACGGT GGCGGCTGGC TATGATAACC ATGAGGTTGA TTTCACTAAG CGCAAAGTCG GTTGTACGCG CCTGTTTGAG TCAGATATTG TCATGGGTGC GTCTGAGAAA CTCAAAGGTT TGATTAAAGA GGCGGATCAA TCTTTCAAAG CTAAAGTGAT GTTTGTCGTT GGTACTTGCG CCGCAGATAT TATCGGTGAA GATATTCAAG GATTATGCAA TAGTATCCAA CCAGAAATCA ACGCCAAGCT TGTACCTCTG CTGGCTGGTG GTTTTCGCGG TAATGCTTAC GATGGCTTGG AAATGGGCTT AGAAGCTTTA CTTCCTTTCA TCCAAAAAAG ACAGAAGCGG CGCGGCGGTA AAAAACCCAG AATTGTGAAT ATTATTGCAC CCCAAGCAAA TGTCAATCCT ACTTGGTGGG CTGATTTGCA ATGGGTAACG CAGATGTTAA AATCTTTAAG GATTAAAGTA CAGACCGTAA TTTCTCATGG TACTTCCTTT GAAGAACTGG AAAAAGCGGG TAATGCAACT GCCAATATTC TTCTCAGCCA TGATGTGGGG TATAAGTTTG CGCGGAAAAT GCAAGAAACC CATAATATCC CCCTAATTCT GGATGATATA CCTTTACCCA TTGGTGTGCA GAATACTACA CGCTGGTTGA AGGCTTTAGC AGCACATTTC AAAATAGACG AAAAAAGAGT AGAACCTCTC ATCAACGAAG GTGAAAACAG GGTTGTAGAA ACTCTTCGCA AACGCGCATT GATGATTATT CCCCGATATC GTAACTGTAG AATTGCTGTG TCTGCGGATG GGACAATGGG TATTGGGTTG GTGAGAATGC TGTTTGAAGA ATTGGAAATG ATTCCAGAAG TGTTGTTGTT CCGTTCAGGA ATGCGCGAAT CTCGTTCAAT TCTAGAGCGA GAATTACAAA GTATGGGGAT TTCTCCCCGT GTAGTGTTTT CTGCTGATGG GTATCAGATT AAACAAGCCT TAGCTGATGT TGATACTGAT GCTGTCATTG GCTCGGCTTG GGAAAAATAC ATGGCGGAAG AATTGGGAAT TAAAATTGCT TTTGATGTAT TTAGTCCGAC AAACAGAGAG ACTTACCTTG ATAGACCATA TTTTGGCTAT GAAGGTATGA TTAACATGAT GGAAGTTGTT GCTAACGATT GGGAAAGAGC ATTTCGTTCC AAACATATTC ATTGGACTTA G
|
Protein sequence | MLSAITSENS QKLNVIPSHT EPLTFDNCDH SKDPIVGCAL EGIANMVAGI KDVSIVIHSP QGCASTVAAG YDNHEVDFTK RKVGCTRLFE SDIVMGASEK LKGLIKEADQ SFKAKVMFVV GTCAADIIGE DIQGLCNSIQ PEINAKLVPL LAGGFRGNAY DGLEMGLEAL LPFIQKRQKR RGGKKPRIVN IIAPQANVNP TWWADLQWVT QMLKSLRIKV QTVISHGTSF EELEKAGNAT ANILLSHDVG YKFARKMQET HNIPLILDDI PLPIGVQNTT RWLKALAAHF KIDEKRVEPL INEGENRVVE TLRKRALMII PRYRNCRIAV SADGTMGIGL VRMLFEELEM IPEVLLFRSG MRESRSILER ELQSMGISPR VVFSADGYQI KQALADVDTD AVIGSAWEKY MAEELGIKIA FDVFSPTNRE TYLDRPYFGY EGMINMMEVV ANDWERAFRS KHIHWT
|
| |