Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_0812 |
Symbol | |
ID | 9338600 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 858188 |
End bp | 859228 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | |
Product | thiamine-monophosphate kinase |
Protein accession | YP_003720358 |
Protein GI | 298490181 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAATAGTG ATTTATCTTA CCTACAAGTT CGAGATATTG GTGAACAAGG TATTTTAGAA AGATTACAAC GCTTTTGTCC ACCAGAAATT ATTGGTGATG ATGCGGCGGT TTTGGCTATA ACAGCACAAG AATCTTTGGT TGTTACCACA GATATGCTAG TTGATGGCGT GCATTTTAGT GATATTACCA CCTCACCAGA AGATACTGGT TGGCGTGCAT CTGCCGCAAA TTTATCAGAT TTAGCTGCTA TGGGTGCTTT TCCCTTGGGT ATCACCGTCG GGTTAGGGTT ACCTGGAGAT TTAGCTGTGA GTTGGGTGGA AAGACTATAT CAGGGAATGA CAGAATGCTT GCAAAAGTAC AATGTGCCTA TCCTTGGTGG TGATATTGTA CGATCGCCCA TCACCACTTT AGCAATTACC GCCTTTGGTC AAGCCAACCC CAATTTCATT ATCCGTCGTT CAACGGCACA GGTGGGAGAT GCAATCATCG TCACAGGTAT ACACGGAGCC TCCCGTGCAG GCTTAGAATT ACTCCTCAAT CCCCAAATAT GTCAAAACCT CGAAAGTGAA GAAAAAGCAG CTTTAATCAA AGCACACCAG CGCCCCAACC CCCGTTTGGA TGTCATACCC ATCCTTCAAG AAATTTTCAC ATCCCCCTGC CCTATTTCCA TCTCTGGTAT GGACAGTAGC GATGGTTTAG CAGACGCTGT ATTACAAATC TGCCGTGCTA GTAGAGTAGG TGCGATTTTA GAACAGAGTA AAATTCCTCT ACCATCAGCT TTTCATAAGT GGCTAACACC AGGGCAATGG CTTCACCCGC CGCAGGCATG GCTAAATTAT GCCCTATACG GTGGCGAAGA TTTTGAATTA GTCCTGTGTT TACCACCACC AGCCGCATTA ACATTAGTCC AAAAATTAGG TGCAGGTGCA TCCATAATAG GCACAATTAC ACCAGGATTA ACAGTCATAT TACACCATGA AAACGAAAGA ATCCCCGACC AAGCCCTAAG TCTCAGTCAG GGATTTCAAC ACTTTAGTTA G
|
Protein sequence | MNSDLSYLQV RDIGEQGILE RLQRFCPPEI IGDDAAVLAI TAQESLVVTT DMLVDGVHFS DITTSPEDTG WRASAANLSD LAAMGAFPLG ITVGLGLPGD LAVSWVERLY QGMTECLQKY NVPILGGDIV RSPITTLAIT AFGQANPNFI IRRSTAQVGD AIIVTGIHGA SRAGLELLLN PQICQNLESE EKAALIKAHQ RPNPRLDVIP ILQEIFTSPC PISISGMDSS DGLADAVLQI CRASRVGAIL EQSKIPLPSA FHKWLTPGQW LHPPQAWLNY ALYGGEDFEL VLCLPPPAAL TLVQKLGAGA SIIGTITPGL TVILHHENER IPDQALSLSQ GFQHFS
|
| |