Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_3985 |
Symbol | |
ID | 9341789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 4049087 |
End bp | 4050256 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | |
Product | RpoD subfamily RNA polymerase sigma 70 subunit |
Protein accession | YP_003722597 |
Protein GI | 298492420 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACCAAA CAAAGCAACA ATCCCGAAAG GAAACTATGA ATCTTGCTGA ATTGGGAACA ATGGAAATAC TAGAGACTGC TGCTGATCAT GAAGAACCAT CACTTGATAG TTTAGAAGCA GTAGTATTTG AAGACTCTTC AATCATAGAA AATTTGGAGT TAGATGAACG CGATGGCGAT GAAATGGCCG CGGCTCGTCC TTCCGGATAC AATAAAACCG AACATGACGA TGCTGTAGGC GCTTTTTTCA AAGAAATGGC GCGTTATCCC CTCCTTAAAC CTGATGAAGA AGTAGAATTA GCACGACGAG TTAGGTTTTT AGAAGAAGTA AAAGACTTAC AAGCGGCTTT AGAAGAAGAA CTAGGACAGC AACCAAGCAG AAGCGAAGTA GCTGCTAAGT TTGAGATGAC AGAAAAACAA CTAGAAAGCC GCTTATATCA AGGACGGGTA GCCAAGCGAA AAATGATTCG CTCCAATTTA AGGCTAGTAG TATCTATTGC TAAACGATAT CTTAACCGGG GAGTTCCTTT TCTAGATTTG ATTCAGGAAG GAGCAATGGG TTTAAACCGC GCTACAGAAA AGTTTGACCC CGATAAAGGA TATAAATTCT CAACCTACGC TTATTGGTGG ATTAGACAGG CAATTACAAG AGCGATCGCT AATGATGCCC GCACCATTCG CTTACCGATA CATATTGTTG AAAAACTTAA CAAACTCAAA AAAGCTCAAC GCGAACTAAA GCAAAAACTA GCTCGTAACC CCTCGGAAGC AGAAATGGCC ACAGCCTTAG AAATTAGCAT CCAACAACTG CGTCAACTCC AACAACTGCG TCGTCAAGCA CTCTCCCTTA ACCACCGTGT CGGTAAAGAA GAAGACACCG AATTAATGGA CTTACTAGAA GACGAAGATA ACCAATCTCC AGAAGCAAAA ATGAACGAAA ACATGATGCG TCAGGAGATT TGGGAAGTGT TAGGAGATGT CCTCACCCCA CGAGAAAAAG ACGTAATCTC TCTGCGCTAT GGACTAACAA CCAGCGAACC CTGCACCCTA GAAGAAGTTG GTAATATGTT CAACCTTTCC CGTGAACGAG TACGCCAAAT TCAAAGTAAA GCCATGCGAA AATTACGCCG TCCCCACATA GCTAAACGTT TAAAAGGGTG GTTGATATGA
|
Protein sequence | MYQTKQQSRK ETMNLAELGT MEILETAADH EEPSLDSLEA VVFEDSSIIE NLELDERDGD EMAAARPSGY NKTEHDDAVG AFFKEMARYP LLKPDEEVEL ARRVRFLEEV KDLQAALEEE LGQQPSRSEV AAKFEMTEKQ LESRLYQGRV AKRKMIRSNL RLVVSIAKRY LNRGVPFLDL IQEGAMGLNR ATEKFDPDKG YKFSTYAYWW IRQAITRAIA NDARTIRLPI HIVEKLNKLK KAQRELKQKL ARNPSEAEMA TALEISIQQL RQLQQLRRQA LSLNHRVGKE EDTELMDLLE DEDNQSPEAK MNENMMRQEI WEVLGDVLTP REKDVISLRY GLTTSEPCTL EEVGNMFNLS RERVRQIQSK AMRKLRRPHI AKRLKGWLI
|
| |