Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_2042 |
Symbol | |
ID | 9339835 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 2121779 |
End bp | 2123287 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | |
Product | UbiD family decarboxylase |
Protein accession | YP_003721222 |
Protein GI | 298491045 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.527905 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCGAG ATTTACGGGG ATTTATCAAA ATCCTAGAAC AAAGAGGACA ATTAAAGCGA ATTTCAGCTT TAGTTGACCC AAATATGGAA ATTGCCGAAA TTTCCAACCG GATGCTACAA AAAGGTGGGA CAGGATTAAT CTTTGAAAAT GTCAAAGGTG CATCTTTCCC CGTTGCTGTC AATTTAATGG GGACACTGGA AAGGATATGC TGGGCGATGA ACATGGAAAA ACCAGAGGAA TTGGAAACCT TGGGAAAGAA ACTGAGTATG CTTCAGCAAC CCAAACCACC TAAAAAGATT TCCCAAGCGA TAGACTTTGG GAAAGTGCTG TTTGATGTAG TGAAAGCGAA ACCAGGAAGG GATTTTTTTC CTGCTTGTCA ACAGGTGGTA GTGGAAGGGG ATAATGTAGA TTTAAATAAG TTGCCGTTGA TACGTCCTTA TCAGAGAGAT GCCGGAAAAA TTATTACGCT AGGATTGGTA ATTACCAAGG ATTATGACAC AGGAACGCCC AATGTTGGTG TATATCGGCT ACAACTGCAA TCTAAGAACA CCATGACAGT ACACTGGTTA TCAGTGCGGG GTGGTGCGAG ACATTTACGC AAAGCAGCGG AATTTGGTAA AAAATTAGAA ATTGCGATCG CACTTGGTGT TGATCCTTTA ATTATTATGG CAGCAGCCAC ACCCATTCCT CTAGACTTAT CAGAATGGTT ATTTGCAGGG CTTTATGGTG GTTCAGGGGT ACAATTAGCC AAGTGTAAAA CAGTAGATTT AGAAGTTCCC GCAGATTCAG AATTTGTCTT AGAAGGAACA ATTACACCAG GGGAAGTTTT ACCCGATGGA CCCTTTGGCG ATCACATGGG ATATTATGGT GGCGTGGAAG ATTCGCCATT GGTACGCTTC CAGTGTATGA CTCACCGCAA AGATCCAATT TATCTGACTA CATTTAGCGG TCGTCCACCC AAAGAAGAAG CTATGATGGC GATTGCACTC AACCGCATCT ATACCCCTAT ATTACGGCAA CAAGTATCAG AAATAGTCGA TTTCTTCCTA CCCATGGAAG CATTAAGTTA CAAAGCTGCG ATTATTTCTA TAGATAAAGC TTACCCTGGA CAAGCAAGAA GAGCAGCCTT AGCTTTTTGG AGTGCATTAC CCCAATTCAC ATACACTAAA TTTGTGATTG TTGTTGATAA ACATATCAAC ATTCGTGATC CACGTCAGGT TGTCTGGGCA ATTAGTTCTA AAGTAGACCC TTCACGGGAT GTATTCATAT TACCAAAGAC ACCCTTTGAC ACCTTAGACT TTGCTAGTGA AAAACTCGGG TTAGGGGGCA GAATGGGCAT AGATGCAACT ACCAAAATTC CCCCAGAAAC TGAACATGAA TGGGGTGAAC CATTAGAATC AGATCCTGAT ATTGCTGCAA TGGTAGAAAG ACGCTGGGCA GAATATGGTT TAGCAGATTT AAAACTAGGA GAAGTAGACC CCAATTTGTT TGGTTATGAT ATGAAGTAA
|
Protein sequence | MARDLRGFIK ILEQRGQLKR ISALVDPNME IAEISNRMLQ KGGTGLIFEN VKGASFPVAV NLMGTLERIC WAMNMEKPEE LETLGKKLSM LQQPKPPKKI SQAIDFGKVL FDVVKAKPGR DFFPACQQVV VEGDNVDLNK LPLIRPYQRD AGKIITLGLV ITKDYDTGTP NVGVYRLQLQ SKNTMTVHWL SVRGGARHLR KAAEFGKKLE IAIALGVDPL IIMAAATPIP LDLSEWLFAG LYGGSGVQLA KCKTVDLEVP ADSEFVLEGT ITPGEVLPDG PFGDHMGYYG GVEDSPLVRF QCMTHRKDPI YLTTFSGRPP KEEAMMAIAL NRIYTPILRQ QVSEIVDFFL PMEALSYKAA IISIDKAYPG QARRAALAFW SALPQFTYTK FVIVVDKHIN IRDPRQVVWA ISSKVDPSRD VFILPKTPFD TLDFASEKLG LGGRMGIDAT TKIPPETEHE WGEPLESDPD IAAMVERRWA EYGLADLKLG EVDPNLFGYD MK
|
| |