Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4113 |
Symbol | |
ID | 9341918 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 4181207 |
End bp | 4182736 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | |
Product | photosystem II chlorophyll-binding protein CP47 |
Protein accession | YP_003722680 |
Protein GI | 298492503 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGACTAC CTTGGTACCG AGTACATACA GTTGTTCTGA ACGATCCAGG TCGACTGATT TCTGTACACC TGATGCACAC AGCACTAGTC GCCGGCTGGG CTGGTTCAAT GGCATTATAC GAACTAGCTG TATACGACCC CAGTGATCCA GTTCTCAACC CCATGTGGCG ACAAGGGATG TTTGTTCTTC CTTTCATGTC ACGTTTGGGC GTAATCAAAT CCTGGGGCGG TTGGAGTGTT ACTGGTGGCA CAGCAGTAGA TCCTGGCTTC TGGTCATTTG AAGGCGTTGC TGCTGCTCAC ATTGTTCTTT CCGGTTTGTT ATTCCTAGCA GCCGTTTGGC ACTGGGTTTA CTGGGATTTG GAACTCTTCA GAGATCCTCG TACCGGCGAA CCTGCCTTAG ACTTGCCAAA AATGTTTGGC ATCCACTTAT TCTTATCTGG TTTACTTTGC TTCAGCTTCG GTGCTTTCCA CCTCACCGGA CTATTTGGTC CTGGGATGTG GGTATCCGAT GCCTTTGGTG TCACTGGCAG CATCCAACCA GTAGCACCAG AATGGGGACC AGCCGGGTTT AACCCCTATA ACCCTGGTGG CATTGTCGCT CACCACATTG CAGCTGGTGT AGTTGGTATT ATCGCTGGTT TATTCCACCT CACAGTCAGA CCCCCCGAAA GGCTCTACAA AGCACTACGG ATGGGTAACA TTGAAACCGT ACTTTCCAGC AGTATTGCTG CGGTGTTCTT CGCTGCTTTC GTAGTAGCTG GTACTATGTG GTACGGTAAC GCTGCTACCC CCATCGAATT GTTTGGACCT ACCCGTTACC AGTGGGATCA AGGCTACTTC CGTCAAGAAA TTGAGCGCCG TGTGCAAACC AGTGTTGCTC AAGGCACAAG TCTAAGTGAA GCTTGGTCAC AAATCCCCGA AAAATTGGCC TTCTACGATT ACGTAGGTAA TAGCCCCGCT AAAGGTGGTC TATTCCGTAC AGGTCCAATG GTTAAGGGTG ATGGTATTGC CCAATCTTGG CAAGGCCACG CAGTATTCAC AGATGCAGAA GGACGTGAGT TAACTGTACG TCGTCTGCCT AACTTCTTTG AAACCTTCCC AGTAATTTTG ACCGATAAAG ATGGAATTGT CCGCGCTGAC ATTCCTTTCC GTCGGGCAGA ATCTAAATAT AGCTTTGAGC AAACAGGCGT TACTGTTAGC TTCTACGGCG GCAATCTCAA CGGCAATACC TTTACAGATC CTGCTGACGT GAAGAAATAC GCTCGTAAAG CTCAAGGTGG AGAAATATTT GAATTTGACC GCGAAACCTT AAACTCTGAT GGTGTATTCC GTACATCTCC CAGAGGTTGG TTTACCTTTG GTCACGCGGT ATTTGCCCTA CTGTTCTTCT TTGGACACCT CTGGCATGGT TCTCGGACAA TCTACCGTGA CGTCTTTGCC GGTGTAGAAG CGGATCTGGA AGAGCAAGTT GAGTGGGGTC TGTTCCAGAA AGTTGGTGAC AAGACCACTC GTACGCGTAA AGAAGCTTAA
|
Protein sequence | MGLPWYRVHT VVLNDPGRLI SVHLMHTALV AGWAGSMALY ELAVYDPSDP VLNPMWRQGM FVLPFMSRLG VIKSWGGWSV TGGTAVDPGF WSFEGVAAAH IVLSGLLFLA AVWHWVYWDL ELFRDPRTGE PALDLPKMFG IHLFLSGLLC FSFGAFHLTG LFGPGMWVSD AFGVTGSIQP VAPEWGPAGF NPYNPGGIVA HHIAAGVVGI IAGLFHLTVR PPERLYKALR MGNIETVLSS SIAAVFFAAF VVAGTMWYGN AATPIELFGP TRYQWDQGYF RQEIERRVQT SVAQGTSLSE AWSQIPEKLA FYDYVGNSPA KGGLFRTGPM VKGDGIAQSW QGHAVFTDAE GRELTVRRLP NFFETFPVIL TDKDGIVRAD IPFRRAESKY SFEQTGVTVS FYGGNLNGNT FTDPADVKKY ARKAQGGEIF EFDRETLNSD GVFRTSPRGW FTFGHAVFAL LFFFGHLWHG SRTIYRDVFA GVEADLEEQV EWGLFQKVGD KTTRTRKEA
|
| |