Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4054 |
Symbol | |
ID | 3681675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 5039282 |
End bp | 5040256 |
Gene Length | 975 bp |
Protein Length | 324 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637719405 |
Product | nitrogenase iron protein subunit NifH |
Protein accession | YP_324553 |
Protein GI | 75910257 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1348] Nitrogenase subunit NifH (ATPase) |
TIGRFAM ID | [TIGR01287] nitrogenase iron protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.00458685 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.287563 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGACG AAAACATTAG ACAGATAGCT TTTTACGGTA AAGGCGGTAT CGGTAAATCC ACCACTTCCC AAAATACCCT CGCAGCAATG GCAGAAGTGG GACAACGTAT TCTGATTGTC GGATGTGACC CCAAAGCAGA CTCTACCCGC TTGATTCTCC ACACCAAAGC ACAAACTACC GTACTTCACT TAGCTGCTGA ACGCGGTGCA GTGGAAGACT TAGAACTTGA TGAAGTAGTA CTCAAAGGCT TCCGCGATAT CAAATGCGTA GAATCTGGTG GTCCAGAACC CGGTGTAGGT TGCGCTGGTC GTGGTATTAT CACCGCTATT AACTTCCTCG AAGAAAACGG TGCATATCAA GACGTAGATT TCGTATCTTA CGACGTATTA GGTGACGTTG TATGCGGTGG TTTCGCCATG CCAATTCGGG AAGGGAAAGC GCAAGAAATC TACATCGTTA CTTCCGGTGA AATGATGGCG ATGTACGCTG CAAACAACAT CGCTCGCGGT ATTTTGAAAT ATGCTCACTC CGGCGGTGTA CGCCTGGGTG GTCTAATTTG TAACAGCCGT AAAACTGACC GGGAAGACGA ACTGATTACC ACACTCGCAA ACCGATTAAG TACCCAGATG ATTCACTTCG TTCCCCGCGA CAACATCGTG CAGCACGCAG AGTTACGCCG GATGACTGTG AACGAATACG CACCTGATAG CAATCAAGCT AATGAATACC GCACATTAGC CGACAAGATT ATCAACAATC AAAATATGGC TGTTCCTACA CCCATCGAAA TGGATGAGTT AGAAGCATTG TTGATTGAGT TCGGTATCCT TGAAAGCGAC GAAGACAAGG AAAAATTGGT CGGTATGAGC AAAGCTGAAG AAGAAGCTCT CAAGAAGCAA GAAGAACTCA AAGCTCAAGC ACTGGAAGCT GTGCAGAAAG GCAACGTTGA AGTTGTTTCC CGTAACAATA AATAG
|
Protein sequence | MSDENIRQIA FYGKGGIGKS TTSQNTLAAM AEVGQRILIV GCDPKADSTR LILHTKAQTT VLHLAAERGA VEDLELDEVV LKGFRDIKCV ESGGPEPGVG CAGRGIITAI NFLEENGAYQ DVDFVSYDVL GDVVCGGFAM PIREGKAQEI YIVTSGEMMA MYAANNIARG ILKYAHSGGV RLGGLICNSR KTDREDELIT TLANRLSTQM IHFVPRDNIV QHAELRRMTV NEYAPDSNQA NEYRTLADKI INNQNMAVPT PIEMDELEAL LIEFGILESD EDKEKLVGMS KAEEEALKKQ EELKAQALEA VQKGNVEVVS RNNK
|
| |