Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_1787 |
Symbol | |
ID | 7101849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 1873451 |
End bp | 1874986 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 643474855 |
Product | nitrogenase molybdenum-iron protein beta chain |
Protein accession | YP_002371989 |
Protein GI | 218246618 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01286] nitrogenase molybdenum-iron protein beta chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCAGA ACATTGATAA AATCCAAGAC CACGTTGAGT TATTCCACCA ACCAGAGTAC CAAGAGCTAT TTGAAAACAA GAAAGCTCTC CAAGGAATGG CTTCTGATGA GAAAGTCGCT GAAATAGCCG AATGGACCAA AACCTGGGAA TATCGGGAAA AGAACTTCGC TCGTGAAGCT CTGACCATCA ACCCCGCTAA AGCTTGTCAA CCTTTGGGTG CTATCTTAGC TGCGGTTGGT TTTGAAGGAA CCCTCCCCTT TGTGCATGGA TCACAAGGTT GTGTGGCTTA CTTCCGTACC CACTTTACCC GTCACTTCAA AGAGCCTTTC AGTGGTGTTT CTTCTTCCAT GACTGAAGAT GCAGCCGTCT TCGGTGGACT GAAAAACATG ATCGAAGGGT TACAGAATGC TTATAGTCTC TATCAACCCA AAATGATTGC TGTCTGTACA ACTTGTATGG CAGAAGTTAT TGGGGATGAC TTAGGTTCCT TCATTGGCAA TGCTAAGGCT GACGGTTCTG TTCCTCAAGA TTTCCCCGTT CCCTTTGCTC ACACTCCTTC TTTCGTGGGT TCTCATATCA CGGGATATGA CAACATGATG AAAGCCATCC TGTTGAACTT AACCGACGGC AAGAAACCTA CCACCAGCAA CGGTAAAGTT AACTTTATTC CTGGGTTTGA AACCTATGTT GGTAACCTAC GCGAACTGAA GCATTTAACC AGTGCTATGG GGGTTGATGC TACCATTTTA GGAGACAACG AACTCTATTT AGATTCTCCT AACGATGGCG AGTTCAAAAT GTACCAAGGT GGTACTACTC TAGAAGAAGG TGCTGATGCT ATCAATGCAA CCAAAACCAT CGCACTGCAA ACCTATCCCA CCGTTAAAAC CCTCGAATAC ATCGAGAAAG AATGGCAGCA ACCCACCGCT ACCTATCGTC CTTGGGGTAT TAAAGGAACG GATGAGTTCG TCATGGCTTT ATCTGAACTC ACTGGGAATC CTGTTCCTCC CGAATTGGAA CTAGAACGGG GACGCGCAGT GGATGCTATG ACCGATAGTC ATGCTTGGTT ACATGGTAAA AAAGCGGCTA TCTATGGCGA CCCTGACTTA GTCATGGGAA TGCTGCAATT CATGTTAGAG ATGGGTGTTG AACCTGTTCA CGTTTTGGTT CACAACTCTA CCACTGAATT TGAAGAAGAA GCCAAAGCTC TCTTAGCTTC TAGTCCTTAT GGTCAAAAAG CCACCGTTTG GGGCGGTAAA GACCTCTGGC ACCTCCGTTC CTTACTGTTT ACTGAACCTG TTGACTTCTT AATCGGGAAT TCCTACGGTA AATACCTCTG GCGTGATACC AAGATTCCTT TAATCCGCAT CGGGTATCCT ATCTTTGATC GCCACCACTT ACATCGCTAT TCTACCATTG GTTACAATGG CGCGATTAAC CTGCTCAATT GGATTGTTAA TGGTCTGTTT GAAGAAATCG ACCGCAACAC CAATATCCCC TCGAAGACCG ACATTTCCTT CGATTTAGTT CGTTAA
|
Protein sequence | MSQNIDKIQD HVELFHQPEY QELFENKKAL QGMASDEKVA EIAEWTKTWE YREKNFAREA LTINPAKACQ PLGAILAAVG FEGTLPFVHG SQGCVAYFRT HFTRHFKEPF SGVSSSMTED AAVFGGLKNM IEGLQNAYSL YQPKMIAVCT TCMAEVIGDD LGSFIGNAKA DGSVPQDFPV PFAHTPSFVG SHITGYDNMM KAILLNLTDG KKPTTSNGKV NFIPGFETYV GNLRELKHLT SAMGVDATIL GDNELYLDSP NDGEFKMYQG GTTLEEGADA INATKTIALQ TYPTVKTLEY IEKEWQQPTA TYRPWGIKGT DEFVMALSEL TGNPVPPELE LERGRAVDAM TDSHAWLHGK KAAIYGDPDL VMGMLQFMLE MGVEPVHVLV HNSTTEFEEE AKALLASSPY GQKATVWGGK DLWHLRSLLF TEPVDFLIGN SYGKYLWRDT KIPLIRIGYP IFDRHHLHRY STIGYNGAIN LLNWIVNGLF EEIDRNTNIP SKTDISFDLV R
|
| |