Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_1067 |
Symbol | |
ID | 9338863 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 1140613 |
End bp | 1142103 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003720547 |
Protein GI | 298490370 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.363484 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCACCG GGTACATCCT GATAGCAGCT ATTTTGATTT TGGGAGGTGT GATTGCTACA GTGGGCGATC GTATCGGCAC ACGAGTTGGC AAAGCACGCC TCTCACTATT TAAGCTGCGT CCCAAAAATA CGGCAGTGCT GGTAACTATT TTTACTGGTG GTCTAATTTC TGCCTCAACC CTAGCAATTT TATTTGCTGC TGATGAAGGA TTGCGGAAGG GCGTCTTTGA GTTAGAGGAT ATTCAAAAAG ACCTGAGAAA CAAGCGAGAA CAACTTAAAA CCGCAGAAAC GGAAAAAAGC CAAGTAGAAA AACAGCTGAC CGAAGCTAGA AAAGAACAAA CTCAGGCACA ACAAGATTTA CAAAACATTA ATCAGTCTTT ACAAGCTGCC AATGCTAAAC AACGCATAAC ACAAGCTCAA CTCAACCGCA CCATTAGTCA ACAAGCTAAA ACCCAAGCTC AACTGCAAGG TACTCAAAGC CGACTTGGTG AAGTAGTGAT ACAGTATAAA CAAGCTAGAA CTGAACTACA AACCCTTTAT AATCAACGTC AGGCATTGCA AACAGCAGTT GAAGAATTAA AGACAGAACG GCAGCGACTA TATGCAGAAG CGAAAAAAGC GATTGACGAA GCAAAAACAG CTATTGAAAA ACGCGATCAG GAACTTGCTA ATCGCCAAAA AGCTATTGAA AAACGTGATC AGAAAATTGC TAAATTAGAT CAACTAATTC AAAATCGTAA TCTAGAGATT AAAAAACGGG AGCAAGTAAT TGCTACTAGG GAATCTCGTC TCAAAGAATT GGAACAACAG CAAGATTATT TAGAACAAGA AGTAGCAAGG CTGGAAAAAT ATTACCAGTC ATATCGTGAC CTGCGTTTAG GTAAATTGGC TTTAGTTCGT GGACAAGTTT TAGCTGCTGG TGTGGTACAA GTTAATCAAC CTACTGCGGC TCGTCAGGTA CTAGTCCAAA TTTTGCAGGA AGCTAACCGC AATGCCAACA TTGAATTAAG CGAACCTGGT TCTAATCCTG GGAATGCAGA ACTATTGCGT GTTACTCAAG ATAGGGTTGA GCAACTAATC AATCAAATCG ACGATGGAAG AGAATATGTA GTGCGAATCT TCTCTGCGGG TAATTACGTT AGGGGAGAAA AGCAGATAGA ATTTTTTGCT GATGCTACGC GCAATCAATT GGTATTTTCC ACAGGTCAAA TCCTGGCTAC AACTGCGGCT GATATGAAAA ACATGACATC ATATCAATTA AGGCAACGGC TGGACTTGCT GATTTCTGCT TCCCAATTTC GCGCTCGCAA TGCAGGAATT ATCGAAACTG TACAAGTAGA GGGTACTTTT CTGCGCTTTT TCGCCCAATT GCAACAGTCT AATCAACCAT TAGAAATTAA AGCTATAGCT GCGGAGGATA CCTATACCGC TGGACCTTTA AGAGTGAAAT TAGTGGCAAT TTTCAATGGT CAGGTTATTT TCAGCACTTA A
|
Protein sequence | MATGYILIAA ILILGGVIAT VGDRIGTRVG KARLSLFKLR PKNTAVLVTI FTGGLISAST LAILFAADEG LRKGVFELED IQKDLRNKRE QLKTAETEKS QVEKQLTEAR KEQTQAQQDL QNINQSLQAA NAKQRITQAQ LNRTISQQAK TQAQLQGTQS RLGEVVIQYK QARTELQTLY NQRQALQTAV EELKTERQRL YAEAKKAIDE AKTAIEKRDQ ELANRQKAIE KRDQKIAKLD QLIQNRNLEI KKREQVIATR ESRLKELEQQ QDYLEQEVAR LEKYYQSYRD LRLGKLALVR GQVLAAGVVQ VNQPTAARQV LVQILQEANR NANIELSEPG SNPGNAELLR VTQDRVEQLI NQIDDGREYV VRIFSAGNYV RGEKQIEFFA DATRNQLVFS TGQILATTAA DMKNMTSYQL RQRLDLLISA SQFRARNAGI IETVQVEGTF LRFFAQLQQS NQPLEIKAIA AEDTYTAGPL RVKLVAIFNG QVIFST
|
| |