Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4771 |
Symbol | |
ID | 9342578 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 4871120 |
End bp | 4872220 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | |
Product | PfpI family intracellular protease |
Protein accession | YP_003723072 |
Protein GI | 298492895 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.102673 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAATA ATAATCATAT TATTGGAGAT CAAAAAGTTG CTATCCTGAT TGAACAAGGA GTAGAAGACG TAGAATTTAT TGTTCCTTTT AATGGTTTGA AACAAGCAGG AATAGAGGTA ATAGTGCTTG GTTCACGGAT GAATGAAAAA TATAAAGGTA AACGAGGCAA ACTCAGCATC CAAGCTGATG CAACGACAAC AGAAGTTGTG GCTGATGAAT TTGCAGCAGT GGTAATTCCT GGTGGTATGG CTCCTGATAA AATGCGCCGC AATTGTAATA CAGTTTGGTT TGTAATGGAG GCTATGAAGC AAGGTAAATT AATAGCCGCA GTATGCCACG GTCCACAGGT TTTAATTGAA GGTGATTTAC TGAAAGGTAA ACAAGTAACA GGATTTGCTG CTATTTGCAA AGACATAACT AATGCTGGTG CCAATTATCT AGATGAACCA GTAGTTGTGG ATGGTAATTT GATTACATCT CGTGAACCTG GAGACTTGGC AATTTTTACA ACGGTACTGT TAAATCGTTT AGGTTATGGT GGTAAAGATG CTGTTTTACC TAATGAAAAA GATACTGGTG CTGAATGGTG GAAATTAGCT GATGCTTGGG GTGGTTCAAC AAAAAACGAA ATTGTTAAAG GTTTGAATAC TGCTTTAGGT GGTGAGCGTT ATTCGTTAGA AGCTTTAGAA AAATATTTAG AGAAAGAATC AGATGAGGAA GTGAAAAATC TGTTTCAAGA GATGATAACT AATAAAAATC AGCACATTAA AAAGCTGGAA AGTTATCTTC ATCGTTTCCA TGAAAAACCG TCTTTGACTG CAAATATTGC TAATCAATAT GCTAAGCTTA AAACAGCTTT AACGGGTAGT GAGAGTATCT ATCAAATTCG TTGTGCATTG GGTGATATAC AAACAGCAAT TGGTGATATT ACCAACTTGT CTGCAATGCT TACTGACCCA GTAGCAACGG CAATTTTTAA ACAAATTCAC AACGATTTGG GTAAATATGA ACAGCGATTG ATTGAGCTTT ATCGAGGGCG GATTGCTGCT GGTGTGAGGC CTCCTAAACC AACTTCTAGG GCGGCTGTAA CTCAAGTTTA A
|
Protein sequence | MRNNNHIIGD QKVAILIEQG VEDVEFIVPF NGLKQAGIEV IVLGSRMNEK YKGKRGKLSI QADATTTEVV ADEFAAVVIP GGMAPDKMRR NCNTVWFVME AMKQGKLIAA VCHGPQVLIE GDLLKGKQVT GFAAICKDIT NAGANYLDEP VVVDGNLITS REPGDLAIFT TVLLNRLGYG GKDAVLPNEK DTGAEWWKLA DAWGGSTKNE IVKGLNTALG GERYSLEALE KYLEKESDEE VKNLFQEMIT NKNQHIKKLE SYLHRFHEKP SLTANIANQY AKLKTALTGS ESIYQIRCAL GDIQTAIGDI TNLSAMLTDP VATAIFKQIH NDLGKYEQRL IELYRGRIAA GVRPPKPTSR AAVTQV
|
| |