Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_3381 |
Symbol | |
ID | 9341186 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 3444918 |
End bp | 3446108 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | |
Product | peptidase M50 |
Protein accession | YP_003722159 |
Protein GI | 298491982 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000039902 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAAAA ATTGGCAAAT TGGATCTTTC TTTGGCATTC CGCTGTTTCT AGACCCTTTA TGGTTTGTAG TTTTGGGGTT GGCAACTTTA AATTTTGGGG TGGCTTATCA ACAGTGGGGA AATGTCACGG CTTGGAGTGC TGGACTCGTA ATGGCACTGC TGTTATTTGC TTCTGTGCTG TTACATGAGT TGGGTCACAG CTTGGTGGCA CAGTTACAAG GGATTAAAGT TAACTCTATT ACGCTATTTT TCTTTGGTGG GATTGCTGCA ATAGAGGAGG AATCAAAAAC ACCTGGTAAA GCCTTTCAAG TAGCGATCGC AGGTCCTTTG GTAAGTATAG TGCTATTTTT GCTGCTAAGT CTAGTTTCTA GTGTCATACC TGATACCACT CTACTAAGTT TTATGGTCAG GGATTTAGCA AGAGTTAACT TGATTGTGGC TTTATTTAAC TTGATTCCTG GCTTACCTCT AGATGGGGGA CAGGTGTTAA AGGCAGCACT ATGGAAACTT ACAGGCGATC GCTTTCAAGC AGTACATTGG GCAGCCCGGT CTGGACAGAT TTTAGGTTAT AGTGCGATCG CTATAGGATT TGCTTTAGAT TTCTTTACCA GGGAATTGGT GACAGGTCTG TGGATCGTGT TATTAGGTTG GTTCGGGATT CGCAATGCTA ACAGCTACGA CCGTGTAACC ACATTACAAG AAACTCTACT CAAGCTAGTA GGTAGTGATG CTATGACTCG TGATTTTCGC GTAGTGGATG CAGACCAAAC ATTGCGGGAA TTTGCTGACT TATATCTTTT AGAATCATCT TCCCCTCATG TTTACTTTGC TGCTGCTGAT GGACGTTACC GAGGTTTAGT AAAAGTTGAT GATTTGCGAA CAACGGAAAG AAGTCAGTGG GAAACCCAAA CTCTACAAAG CATTGTCCAT CCCCTCACAA CAATACCCAC AGTTAGTGAA TCCACTTCCT TAGCAGAGGT AATTAACAAA CTAGAAAACG AACAGCTACC TCAAATTACG GTACTTTCTC CCGCAGGTGC TGTAGCTGGT GTGATTGATA GGGGAGATAC TGTCAAATGT TTAGCACAAA AGTTGAATTT GCGAATTACC GATGCTGAAA TCAAGCAAAT TAAAGAAGAA GGTAGTTTTC CCCCAGGGTT ACAACTAGGA GTCATAGCGA AGTCTGTTTA G
|
Protein sequence | MQKNWQIGSF FGIPLFLDPL WFVVLGLATL NFGVAYQQWG NVTAWSAGLV MALLLFASVL LHELGHSLVA QLQGIKVNSI TLFFFGGIAA IEEESKTPGK AFQVAIAGPL VSIVLFLLLS LVSSVIPDTT LLSFMVRDLA RVNLIVALFN LIPGLPLDGG QVLKAALWKL TGDRFQAVHW AARSGQILGY SAIAIGFALD FFTRELVTGL WIVLLGWFGI RNANSYDRVT TLQETLLKLV GSDAMTRDFR VVDADQTLRE FADLYLLESS SPHVYFAAAD GRYRGLVKVD DLRTTERSQW ETQTLQSIVH PLTTIPTVSE STSLAEVINK LENEQLPQIT VLSPAGAVAG VIDRGDTVKC LAQKLNLRIT DAEIKQIKEE GSFPPGLQLG VIAKSV
|
| |