Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_0049 |
Symbol | |
ID | 9337832 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 42970 |
End bp | 44670 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003719826 |
Protein GI | 298489649 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCTAC CTATTGTGTC TGAGAAGAAA AAAAAGAAGC GGTCTTTAGT ACTGACATTA TCAGCTGCGG CGTTATTGAT TGGTGTAGGC AGTTTTGCTC ATTGGTTTTT TACCCAAGGA CGACCGTTTT CTAGAAGTTT GCCAGTGGGT GCAAATATTA TTCCTCAAGA TGCCTTGTTT GCAGTTTCTT TAACAACAGA TACTAAACAA TGGCAAAAGT TGAGGGAATT TGGGACACCA GAAACTCAAA AGGAACTGGA TAAAAATTTG GTGCAACTGC GTGATCGCTG GTTAACCAAT AATGGCTACA ATTTCGAGAA GGATATTCAA CCTTGGGTGG GGGATGAGGT CACCATTGCT GTTTTGCCTC CCCCAGTAGT CAAGCCAGTG CTGAAACCAG TAGCTACTGA GGCTAATATT CCTTATGAAC AGTCAATGGT GATGGTGCTG CCAATCAAAA ACCCAGCAAT TGCTAAAAAG ATGTTGGCAC AATCTCAAAC CCTTAAACAA GGTAAATCGA CTGAGCTTAC TTATCGGGGA ATCGCAATTA AGCAAACTGA AGGAAAAGCT GGAGAAAAGC TGTCAGCAAC ATTAATAAAT CAGCAATTGC TTGTAATTAC AGATAATGCT AAAGCCACAG AAAAAACAAT TGATGCTTAT AAAAATCAAA CATCTGTAGC AACATTAGTA GGCTTTGCAG AAAATTTCTC AAAAACCTCT AGTTATCAAC CGTTTGCTCA ATTTTATATA AATGTACCCT TGGCTGCAAA AATAGCGGCG ACAGCCCCTA ATCGACGTTT ACCTGTTCAG GTTCTTGCCC AACTTCAGAA TAACCAAGGT TTAGCGGGAA CTCTGAACTT AGAATCTGAA GGAGTGCGTT TAAAGGGTAT TTCTTGGTTA AGCCCTAATA GTCAAAGAGT TTTGGCGGTA GAAAATAAAG CTGGAAGTAT GCAAAATCGC CTCCCCAGCG AAACCTTAAT GATGTTGTCT GGTAGTAACT TAAAGCGGTT GTGGGCAGAC TATGTTTCTA CTTCTGGGGG AAATCCGTTG GCACCAATCA AACCCGAAGA ACTGCGACGG GGTGTAAAAT CTTTAACAAA TTTGGATTTA GATCAAGATT TACTGAGTTG GATGAAGGGG GAATTTTCAG TTTCGGTAAT TCCTAATACT TCACGAGATG GTTCACCGGA TAACTTTCGG GCTGGTTTAG TATTTATGGT TAAGGTAAGT GATGGTAAAG CTGGGGTACG GCAATCGGCT GAAACTGCTT TACAAAATCT TGATGATGTG CTGAAAAATC AATACCAGTT TAAAGTTGAA TCAGCTACTG TTGGGGGTAA ACCCGTTGTT AACTGGATTT CACCTTTTGG GACTTTAACG GCTACTCATG GTTGGTTAGA TGACAATGTG GTTTTTTTCG GCTTCGGCGC TCCCATCAGC GATAAAATTG TTCCTAAACC CAACAATACT CTAGCCAACA CTCTACGTTT TCAACAAACC GTTCCCAAGG AATTAAATCC AGCCAAGGGT CAATTTTTCT TGGATATGGA ACGGACTGTT AAAAGTTTTC CTCTCAATCT TGAATCTCCT GGTCAACAAG CATTACTTTC TGCTATACAA ACTATAGGTA TAACAACTGC TGTCAACGAT AATCGTAGTC AGGAATATGA CATTTTTGTG GAACTGAAAA AAGGTAAATA G
|
Protein sequence | MTLPIVSEKK KKKRSLVLTL SAAALLIGVG SFAHWFFTQG RPFSRSLPVG ANIIPQDALF AVSLTTDTKQ WQKLREFGTP ETQKELDKNL VQLRDRWLTN NGYNFEKDIQ PWVGDEVTIA VLPPPVVKPV LKPVATEANI PYEQSMVMVL PIKNPAIAKK MLAQSQTLKQ GKSTELTYRG IAIKQTEGKA GEKLSATLIN QQLLVITDNA KATEKTIDAY KNQTSVATLV GFAENFSKTS SYQPFAQFYI NVPLAAKIAA TAPNRRLPVQ VLAQLQNNQG LAGTLNLESE GVRLKGISWL SPNSQRVLAV ENKAGSMQNR LPSETLMMLS GSNLKRLWAD YVSTSGGNPL APIKPEELRR GVKSLTNLDL DQDLLSWMKG EFSVSVIPNT SRDGSPDNFR AGLVFMVKVS DGKAGVRQSA ETALQNLDDV LKNQYQFKVE SATVGGKPVV NWISPFGTLT ATHGWLDDNV VFFGFGAPIS DKIVPKPNNT LANTLRFQQT VPKELNPAKG QFFLDMERTV KSFPLNLESP GQQALLSAIQ TIGITTAVND NRSQEYDIFV ELKKGK
|
| |