Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_2817 |
Symbol | |
ID | 9340617 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 2901901 |
End bp | 2903073 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | |
Product | RpoD subfamily RNA polymerase sigma 70 subunit |
Protein accession | YP_003721786 |
Protein GI | 298491609 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00156659 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCAGG CTAACAACGT ACTCGACAGC ATTTATCAGC CTGACCTAGA AATGATAAAT CCGCCTGAGA TCGAAGAAGA ACTCTTATTG ATTGAGGATG AAGAGGACTT ATTGCTTACC GATGATGGCG AAATTGATGA TTTTTTAGAG CCTCAGTCTG ATGAGGACGA CGCAAAGTCT GGAAAAGCCG CTAAATCGCG TCGTCGGACA CAAAGCAAGA AAAAGCACTA TACTGAAGAC TCGATTCGTC TTTATCTGCA AGAAATTGGT CGTATTCGTC TATTACGAGC AGATGAAGAA ATAGAATTGG CGCGGAAAAT TGCGGATTTA CTGGAATTGG AAAGGGTGCG GGATCGACTG TACGAACAGT TAGAACGCGA ACCCCAATTT AAGGAATGGG CAGAAGCGGT ACAATTGCCA TTACCTACCT TCCGTTATCG CCTGCACGTT GGTCGCAGAG CGAAAGATAA AATGGTACAG TCGAACCTAC GACTTGTGGT TTCAATTGCC AAAAAATATA TGAATCGTGG CTTGTCTTTC CAAGACTTGA TTCAAGAAGG TAGTCTGGGC TTGATTCGGG CTGCTGAAAA GTTCGACCAC GAAAAAGGTT ATAAGTTTTC TACCTACGCT ACATGGTGGA TTCGGCAGGC AATTACTAGA GCGATCGCTG ATCAATCTCG CACTATTCGT CTACCTGTTC ATCTCTACGA GACGATATCA CGAATCAAGA AAACTACCAA ACTTTTATCT CAAGAAATGG GTCGCAAACC CACAGAAGAA GAAATTGCAA CTCGTATGGA AATGACCATC GAGAAACTGC GGTTTATTGC TAAATCTGCC CAGTTGCCAA TTTCATTAGA AACACCTATT GGTAAAGAAG AAGATTCCCG TTTGGGTGAT TTTATCGAAT CTGATGGAGA AACACCAGAA GATCAAGTTT CTAAAAATCT TCTCCGCGAA GACCTAGAAA AAGTCCTTGA TAGTCTCAGC CCCCGTGAAC GAGATGTTCT TAGACTCCGC TATGGTCTAG ATGACGGTCG AATGAAAACC CTCGAAGAAA TCGGTCAGAT TTTCAACGTT ACCCGCGAAA GAATTCGCCA AATTGAGGCT AAGGCACTAC GCAAATTACG CCACCCCAAT CGTAACAGCG TCCTCAAAGA ATACATTCGT TAA
|
Protein sequence | MNQANNVLDS IYQPDLEMIN PPEIEEELLL IEDEEDLLLT DDGEIDDFLE PQSDEDDAKS GKAAKSRRRT QSKKKHYTED SIRLYLQEIG RIRLLRADEE IELARKIADL LELERVRDRL YEQLEREPQF KEWAEAVQLP LPTFRYRLHV GRRAKDKMVQ SNLRLVVSIA KKYMNRGLSF QDLIQEGSLG LIRAAEKFDH EKGYKFSTYA TWWIRQAITR AIADQSRTIR LPVHLYETIS RIKKTTKLLS QEMGRKPTEE EIATRMEMTI EKLRFIAKSA QLPISLETPI GKEEDSRLGD FIESDGETPE DQVSKNLLRE DLEKVLDSLS PRERDVLRLR YGLDDGRMKT LEEIGQIFNV TRERIRQIEA KALRKLRHPN RNSVLKEYIR
|
| |