Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_1059 |
Symbol | |
ID | 9338855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 1131903 |
End bp | 1132958 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | |
Product | photosystem II D2 protein |
Protein accession | YP_003720539 |
Protein GI | 298490362 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00303262 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCATCG CAGTTGGACG CGCCCAGTCA AGAGGGTGGT TTGACGTACT AGACGACTGG TTGAAGCGCG ATCGCTTCGT ATTCGTAGGT TGGTCAGGAA TATTACTATT CCCCTGCGCC TTCCTAGCAC TAGGTGGCTG GCTAACCGGT ACAACCTTCG TCACCTCCTG GTACACCCAC GGATTAGCAT CCTCCTATCT AGAAGGAGCA AACTTCCTCA CAGTAGCAGT ATCCACACCC GCAGACAGCA TGGGACACTC CCTACTGTTA CTCTGGGGAC CTGAAGCTCA AGGTAACTTA ACTCGTTGGT TTCAACTAGG TGGCCTATGG CCATTCGTTG CTCTACACGG AGCATTCGGA CTAATCGGCT TCATGTTGCG CCAATTTGAA ATTGCCAGAC TAGTAGGGAT TCGTCCTTAC AACGCTCTCG CCTTCTCAGG CCCCATCGCC GTATTCGTCA GCGTCTTCTT GATGTATCCC TTGGGACAAT CTAGCTGGTT CTTTGCACCC AGCTTTGGTG TAGCAGCAAT CTTCCGATTC TTGTTATTCC TACAAGGTTT CCACAACTGG ACACTCAACC CCTTCCACAT GATGGGTGTA GCCGGAATAT TAGGTGGTGC ATTGCTATCT GCCATTCATG GTGCAACAGT AGAAAACACC CTGTTTGAAG ATGGCGAAGG CTCCAATACC TTCCCCGGAT TTAATCCCAC CCAGGCGGAA GAAACCTACT CCATGGTGAC AGCAAACCGA TTCTGGTCAC AGATTTTCGG GATTGCTTTC TCTAACAAAC GTTGGTTACA CTTCTTCATG TTGTTTGTCC CAGTCACAGG CTTGTGGATG AGTGCAGTAG GAATTGTGGG TTTAGCATTA AACCTACGGG CTTATGACTT CGTTTCCCAA GAATTACGGG CAGCAGAAGA CCCAGAGTTT GAAACTTTCT ATACCAAAAA TATTTTGCTG AACGAGGGTA TCCGCGCTTG GATGGCTCCT CAAGATCAAC CCCACGAACA ATTTGTATTC CCTGAGGAGG TATTACCTCG TGGTAACGCT CTCTAA
|
Protein sequence | MTIAVGRAQS RGWFDVLDDW LKRDRFVFVG WSGILLFPCA FLALGGWLTG TTFVTSWYTH GLASSYLEGA NFLTVAVSTP ADSMGHSLLL LWGPEAQGNL TRWFQLGGLW PFVALHGAFG LIGFMLRQFE IARLVGIRPY NALAFSGPIA VFVSVFLMYP LGQSSWFFAP SFGVAAIFRF LLFLQGFHNW TLNPFHMMGV AGILGGALLS AIHGATVENT LFEDGEGSNT FPGFNPTQAE ETYSMVTANR FWSQIFGIAF SNKRWLHFFM LFVPVTGLWM SAVGIVGLAL NLRAYDFVSQ ELRAAEDPEF ETFYTKNILL NEGIRAWMAP QDQPHEQFVF PEEVLPRGNA L
|
| |