Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4247 |
Symbol | |
ID | 3680895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 5326926 |
End bp | 5327816 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637719595 |
Product | nitrogenase iron protein subunit NifH |
Protein accession | YP_324741 |
Protein GI | 75910445 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1348] Nitrogenase subunit NifH (ATPase) |
TIGRFAM ID | [TIGR01287] nitrogenase iron protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.252394 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGACA ACATTAGACA AATCGCTTTC TACGGTAAAG GTGGTATCGG TAAATCCACA ACCTCGCAAA ACACGATCGC TGGTCTTGCA GAAATGGGCG AACGCATCAT GATTGTCGGA TGCGACCCCA AAGCAGACTC CACTCGTTTG ATGTTGCACA GCAAAGCTCA AACTACCATT CTCCACTTAG CTGCGGAACG CGGTGCAGTC GAAGATTTAG AACTGGACGA AGTATTATTG ACTGGTTATC AAGGCGTTAA GTGTGTCGAG TCTGGCGGCC CCGAACCCGG TGTGGGTTGC GCTGGACGTG GTATCATCAC CGCCATTAAC TTCTTAGAAG AAAACGGCGC TTACGAAGAC CTCGATTTCG TTTCTTACGA CGTATTAGGC GACGTTGTTT GTGGTGGTTT CGCCATGCCA ATTCGGGAGG GGAAAGCACA AGAAATCTAT ATCGTTTGCT CCGGTGAGAT GATGGCGATG TATGCGGCAA ACAACATTGC TCGCGGTATT TTGAAATATG CTCACTCCGG TGGTGTACGC TTGGGTGGTC TCATCTGCAA CAGCCGTAAT GTTGACCGAG AAGTCGAATT GATTGAAGCA TTGGCTGAAA GACTCAATAC CCAAATGATT CACTTTGTCC CTCGCAACAA CGTAGTACAA CACGCAGAAC TGCGCCGGAT GACAGTAATT GAATACGCTA CCGAACATCC CCAAGCCAAC GAATACCGCA CCTTGGCGAA GAAAATCAAA GAAAATACCA AACTGACCAT CCCCACCCCC ATCTCAATGG ACGAATTGGA AGAACTGCTG GTGGAGTTCG GTATTCTTGG CGGCGAAGAA GAATATCAAA AAGCGATCGC TCAAGACGCA GGCAAAGCAG TAGTAGTCTA A
|
Protein sequence | MIDNIRQIAF YGKGGIGKST TSQNTIAGLA EMGERIMIVG CDPKADSTRL MLHSKAQTTI LHLAAERGAV EDLELDEVLL TGYQGVKCVE SGGPEPGVGC AGRGIITAIN FLEENGAYED LDFVSYDVLG DVVCGGFAMP IREGKAQEIY IVCSGEMMAM YAANNIARGI LKYAHSGGVR LGGLICNSRN VDREVELIEA LAERLNTQMI HFVPRNNVVQ HAELRRMTVI EYATEHPQAN EYRTLAKKIK ENTKLTIPTP ISMDELEELL VEFGILGGEE EYQKAIAQDA GKAVVV
|
| |