Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_0500 |
Symbol | |
ID | 9338286 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 514514 |
End bp | 516619 |
Gene Length | 2106 bp |
Protein Length | 701 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | |
Product | Oligopeptidase A |
Protein accession | YP_003720142 |
Protein GI | 298489965 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.452158 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGCAA CTGCTATTAT TTCTCAAAAT CCTCTACTCC AAGGCTTTGG TTTACCTGCA TTTGCAGAGA TTACACCAGA ACAAGTAGAA CCAGCTTTCA GGCATCTGTT AGCAGATTTG GAACAACAGC TAAATATTTT AGAAGCTAAT GTCCAACCTA CTTGGGATGG TTTAGTAGAA CCTCTAGAAA AGCTAACAGA AAGGCTGCAT TGGAGTTGGG GCATCCTGAA CCATCTAATG GGTGTTCAAA ATAGCCCAGA GCTACGGATA GCTTATCAAA AGGTTCAGCC ACTAGTAGTA CAGTTTATTA ATACCCTTGG ACAAAGTAAA CCTATATACA AGGCTTTTAA AGCACTCCGC GCCAGCGATA CTTGGGAAAC CTTAGAATCA GCCCAGCAGC GGATTATTGA AGCGGCTATT CGAGATGCAA AATTGTCTGG TGTCGGCTTA GAAGGACAAG CGCGAGAGCG TTTTAATGCT ATTCAGATGG AGTTAGCGGA ACTGGCTACC AAGTTTTCTA ATCATCTTTT GGATGCGACT ACAGCTTTTA GCTTAATTCT CACTACCAAA GCAGAAATAG AGGGTTTACC TAGCAGTTTA CTAAGTTTAG CTGCCCAAGC TGCTCGCATT GCTGGAGAAG AATATGCGAC TCCAGAAACA GGTCCTTGGC ATATTACGTT GGATTTTCCC AGTTATTTCC CGTTTATGCA GCACAGCACT AGGCGGGATT TGCGCGAAAA ACTTTACAAG GCTTATATTA CCCGTGCTTC TTTCGGGGAA TTGGATAATA ATCCTTTGAT TGAACGTATT TTAGAACTGC GGCAAGAATT GTCAGAGTTA CTTAGCTTTG ATAATTTTGC TGAATTGAGT TTGGCTAGTA AGATGGCGAA GAATGTCCCC GCAGTGGAAA AGCTGTTAGA AGAACTACGC CAAGCTAGTT ATGATGCGGC TGTTAAAGAT TTAGAAGCAC TTAAAGCTTT TGCTCAATCC AAGGGAGCAG CAGAAGCAGA TAATTTACAA CATTGGGATA TCAGCTTTTG GGCTGAACGT CAAAGAGAAG AAAAATTTGC TTTTACTGAA GAGGAATTAC GTCCTTATTT CCCTCTTCCC CAGGTGCTAG ATGGTTTATT TGGGCTAATT AAACGCCTGT TTGGTGTAAC GGTGACACCA GCTGATGGTC AAGCCCCTGT TTGGCATGAA GATGTACGCT ATTTTAAAAT CTCTGATAAA TATGCTAATG CGATCGCCTA CTTTTATCTA GACCCCTACA GCCGTCCCGC TGAAAAACGT GGTGGTGCTT GGATGGATGC CTGTATTCAT CGCCGCAAAA TCACAGAACT TGGTATAACT AGCATCCGCT TACCTGTAGC TTATTTGATT TGCAACCAAA CTCCTCCAGT TGATCACAAG CCTAGTTTAA TGACTTTTGA TGAAGTTGAA ACCTTATTCC ACGAATTTGG ACATGGTTTA CACCACATGC TCACCAAGGT TAATTATACT GGAGCTGCAG GCATCAATAA CGTTGAATGG GATGCAGTAG AATTACCTAG TCAATTCATG GAAAACTGGT GTTATGACCG TCCCACTTTG TTTGATATGG CTAAACATTA TGAAACAGGT AAACCTCTAC CAGAACATTA TTATCAAAAG CTCTTAGCAT CACGTAATTA TATGAGTGGT TCGGCAATGT TGCGTCAAAT TCACCTCAGC AGCGTGGACT TAGAACTACA CTACCGCTAT CGTCCTAGTG GTGACGAAAC TCCCATAGAT GTGCGTCAAC GCATTGCTAA AACCACTACC GTTTTACCGC CACTACCTGA AGATGGTTTT CTATGTGCAT TCGGTCACAT TTTTGAAGGA GGTTATGCAG CCGGTTACTA CAGCTACAAA TGGGCTGAAG TTCTCAGTGC GGATGCTTTT GCTGCTTTTG AAGAAGCTGG ACTAGAAGAT GAAGAAGCAA TTTATGTTAC TGGTAGACTT TACAGAGACA CAGTATTAGC ACTGGGCGGT AGTATGCACC CAATGGAAGT TTTTCAAGCC TTCCGACATC GAGAACCGAG TACCACGGCT TTACTCAAAC ATAATGGTTT GTTGCCTACT ATCTAA
|
Protein sequence | MSATAIISQN PLLQGFGLPA FAEITPEQVE PAFRHLLADL EQQLNILEAN VQPTWDGLVE PLEKLTERLH WSWGILNHLM GVQNSPELRI AYQKVQPLVV QFINTLGQSK PIYKAFKALR ASDTWETLES AQQRIIEAAI RDAKLSGVGL EGQARERFNA IQMELAELAT KFSNHLLDAT TAFSLILTTK AEIEGLPSSL LSLAAQAARI AGEEYATPET GPWHITLDFP SYFPFMQHST RRDLREKLYK AYITRASFGE LDNNPLIERI LELRQELSEL LSFDNFAELS LASKMAKNVP AVEKLLEELR QASYDAAVKD LEALKAFAQS KGAAEADNLQ HWDISFWAER QREEKFAFTE EELRPYFPLP QVLDGLFGLI KRLFGVTVTP ADGQAPVWHE DVRYFKISDK YANAIAYFYL DPYSRPAEKR GGAWMDACIH RRKITELGIT SIRLPVAYLI CNQTPPVDHK PSLMTFDEVE TLFHEFGHGL HHMLTKVNYT GAAGINNVEW DAVELPSQFM ENWCYDRPTL FDMAKHYETG KPLPEHYYQK LLASRNYMSG SAMLRQIHLS SVDLELHYRY RPSGDETPID VRQRIAKTTT VLPPLPEDGF LCAFGHIFEG GYAAGYYSYK WAEVLSADAF AAFEEAGLED EEAIYVTGRL YRDTVLALGG SMHPMEVFQA FRHREPSTTA LLKHNGLLPT I
|
| |