Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_1060 |
Symbol | |
ID | 9338856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 1132942 |
End bp | 1134327 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | |
Product | photosystem II 44 kDa subunit reaction center protein |
Protein accession | YP_003720540 |
Protein GI | 298490363 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000366993 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTAACGC TCTCTAATAG ATCAGTTATA GGAGGCGGAC GTGATCAAGA ATCAAGCGGC TTTGCTTGGT GGTCTGGAAA CGCTCGTTTA ATTAACCTAT CTGGTAAACT GCTTGGCGCT CACGTTGCCC ATGCTGGTTT AATCGTCTTC TGGGCTGGAG CAATGACTTT ATTTGAAGTT GCTCACTTTA TCCCAGAAAA GCCCATGTAC GAACAGGGCT TGATTCTTCT GCCTCACATT GCTACATTAG GTTGGGGCGT TGGTGCTGGT GGTGAAGTAA TTGACACCTT CCCCTACTTT GTTGCTGGTG TACTGCACCT GATTTCCTCT GCTGTACTGG GTTTTGGTGG CATCTATCAT GCCGTTCGTG GCCCAGAAAC ATTAGAAGAA TATTCTTCCT TTTTCGGTTA CGACTGGAAA GACAAGAACA AAATGACCAA CATCATCGGC TTCCACCTTA TCATCTTGGG ATGTGGTGCG CTGCTGTTGG TGTTAAAGGC TATGTTTTTC GGCGGTGTCT ATGATACCTG GGCACCCGGT GGTGGTGATG TGCGTGTTAT TACTAACCCC ACACTCAATC CAGCGATCAT CTTTGGTTAT CTAATTAAAG CTCCCTTCGG TGGCGAAGGC TGGATTGTTA GCGTTGATAA CATGGAAGAT GTTATCGGTG GTCACATTTG GATTGCTTTA ATTTGTATTT CCGGTGGTAT TTGGCACATC TTCACTAAGC CTTTTGCTTG GGCGCGTCGC GCTTTCATCT GGTCTGGTGA AGCTTACCTT TCCTACAGCT TGGGCGCTCT TTCCTTGATG GGCTTTATCG CTTCCATCAT GGTTTGGTAC AACAACACTG TTTACCCCAG CGAATTCTTC GGTCCTACTG GTCCTGAAGC TTCTCAAGCA CAAGCTTTAA CCTTCTTGAT TCGTGACCAA CGCTTAGGTG CTAACGTTGG TTCTGCTCAA GGGCCTACTG GTCTAGGTAA ATACTTGATG CGTTCTCCTA CTGGTGAAAT CATCTTCGGT GGTGAAACCA TGCGCTTCTG GGATTTCCGT GGTCCTTGGT TAGAGCCTCT CCGTGGTCCT AACGGTCTTG ACCTCGAGAA AATCAAGAAT GATATTCAGC CTTGGCAAGC TCGTCGTGCT GCTGAATACA TGACTCACGC TCCTCTAGGT TCTTTGAACT CTGTAGGTGG TGTAGCTACT GAAATCAACT CTTTCAACTA TGTATCTCCT CGTGCGTGGT TGTCTACCTC TCACTTCGTA TTAGGTTTCT TCTTCCTTAT CGGTCACTTG TGGCATGCAG GACGCGCTCG TGCGGCTGCT GGTGGTTTTG AGAGAGGTAT TGACCGTGAG AACGAACCAG CATACAGCAT GAAGGATCTT GACTAG
|
Protein sequence | MVTLSNRSVI GGGRDQESSG FAWWSGNARL INLSGKLLGA HVAHAGLIVF WAGAMTLFEV AHFIPEKPMY EQGLILLPHI ATLGWGVGAG GEVIDTFPYF VAGVLHLISS AVLGFGGIYH AVRGPETLEE YSSFFGYDWK DKNKMTNIIG FHLIILGCGA LLLVLKAMFF GGVYDTWAPG GGDVRVITNP TLNPAIIFGY LIKAPFGGEG WIVSVDNMED VIGGHIWIAL ICISGGIWHI FTKPFAWARR AFIWSGEAYL SYSLGALSLM GFIASIMVWY NNTVYPSEFF GPTGPEASQA QALTFLIRDQ RLGANVGSAQ GPTGLGKYLM RSPTGEIIFG GETMRFWDFR GPWLEPLRGP NGLDLEKIKN DIQPWQARRA AEYMTHAPLG SLNSVGGVAT EINSFNYVSP RAWLSTSHFV LGFFFLIGHL WHAGRARAAA GGFERGIDRE NEPAYSMKDL D
|
| |