Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4860 |
Symbol | |
ID | 9342667 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 4971924 |
End bp | 4973159 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003723129 |
Protein GI | 298492952 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0351089 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACATC AACAAGGCAG GACGAATGGA AGCACAAATT CACAACTTCA GGGGGGAAGT AAAACCAAAT TTCCCCGCTT CTATTCTGAT GATAATTTAA AATTACCACA AAATTGGTCT TACCTGCCTT TGACCTTATT TTTGGTCATT GGTTTGTTTG CTTGTGGTAA TCCTTCTCCT CATAAACCAA ATAGAGGGGA TTCATCTAAC CAGGATACCA ATAGTAAGTT AACTTTTTTT GGCGTTGCTT TAGAACAATT TGATGAAGTA GGTAGACCTA TTTGGAAAGT CAAAGCTAAA AAGGCAAAAT ATACTAAAGA AAAAGAAATT GGTGAAGCAC AAAATCCCGA TGGTGAACTC TACCAAGATG GTAAAGTAGT TTACACAATT AGAGGTGAAA CAGCTGATAT TCAGCAGGAT GGAAAACAGC TATTCCTTAA AGGTAAGATT ATTGCTACCG ATCCCCATAA TGGTACTATC TTAAAAGGTA ATGAATTAGA ATGGCGACAT AAAGAAGATT TATTAATTGT TCGTAATCAA TTAAATGGGA CTCATAAAGA ACTACAAGCA ACCGCTCAAG AAGTAAGAGT AAAAACCCGG GAACAACGAA TAGAATTTGC TGGTAAAGTA GTTGCGATAT CTGCTGATCC TCAGTTGCAA ATGCGAACTG AAAGGTTAAT TTGGCAGATT AAAGAAGGAA AATTAATTAG CGAGTGCCCC ATTCAAATTG ACCGCTATAA AGATAATAAA ATCACTGATC GTGGTCAAGG AAATGCTGCA GAAATTAACT TAAAAACCAA AATTGCTACT ATTGGACCTA AAGCCAAACT AGAGTTAATA GAACCACCTA TGCAGATAGT TAGTAACTCT ATGACCTGGA ATATCAATCA AGAAACTGTT AAGGCAAATT CCCCTGTGCG TGTTTTTCAC CAAGCTGAAA ATGTGACTGT AACTGGCAAT AAAGCAGAAG TAAAGATTCT ACAAAAAAGT GTTTATTTAA CAGGCAATGT GAATGCTGTA GGACAACACA AGCAATCTTT AAAATCAAAT CTACTTACTT GGTATTTAGA AAGAAAATTA CTAGAAGCCC AGGGGAATGT GGTTTATCTT CAAGTTGATC CACCGTTAAA TCTTCAAGGT GCAACCGCAC TGGCTAATCT ACAAACAGAC AATATTGTTG TTAAAGGTGG CAGTTATAAC GACAGAGTGG TAACAGAAAT TATTCCGCAG GAATGA
|
Protein sequence | MQHQQGRTNG STNSQLQGGS KTKFPRFYSD DNLKLPQNWS YLPLTLFLVI GLFACGNPSP HKPNRGDSSN QDTNSKLTFF GVALEQFDEV GRPIWKVKAK KAKYTKEKEI GEAQNPDGEL YQDGKVVYTI RGETADIQQD GKQLFLKGKI IATDPHNGTI LKGNELEWRH KEDLLIVRNQ LNGTHKELQA TAQEVRVKTR EQRIEFAGKV VAISADPQLQ MRTERLIWQI KEGKLISECP IQIDRYKDNK ITDRGQGNAA EINLKTKIAT IGPKAKLELI EPPMQIVSNS MTWNINQETV KANSPVRVFH QAENVTVTGN KAEVKILQKS VYLTGNVNAV GQHKQSLKSN LLTWYLERKL LEAQGNVVYL QVDPPLNLQG ATALANLQTD NIVVKGGSYN DRVVTEIIPQ E
|
| |