Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_1356 |
Symbol | |
ID | 9339151 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 1428889 |
End bp | 1430097 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | |
Product | cysteine desulfurase NifS |
Protein accession | YP_003720735 |
Protein GI | 298490558 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCAAAG ATTGCATCTA TCTGGATAAT AATGCTACCA CTAAGGTAGC TCCCCAGGTG ATTGAGGCAA TTATGCCTTT TCTGACTGAC TATTACGCTA ATCCTTCTAG TATGCACACC TTTGGTGGGC AATTAGCTAA AAGTGTGAAG GTAGCTAGAG AACAAATTGC AGCTTTAATC GGTGCTGAAG AATCGGAAAT AGTCTTTACT AGCTGTGGAA CTGAAGGTGA TAATACAGCT ATTCGTGCTG CTTTATTAGC TCAACCAGAA AAACGTCATA TCATCACTTC CCAAGTAGAA CATCCCGCAG TTTTAAATGT CTGCAAACAA TTAGAAACTC AAGGTTATCA AGTTACCTAT TTGTCAGTTA ATGATAAGGG ACAAATTGAC TTAAATGAGT TAGAAGCTTC TCTAACTGGC AACACTGCTT TAGTGACAGT GATGTATGCC AATAATGAAA TCGGCACTAT TTTTCCAATT GAAGAAATTG GTGCTAGAGT TAAAGAATAT GGAGCAATCT TTCATGTTGA TGCAGTACAA GCGGTAGGTA AAGTACCCTT GAATATGAAA ACCAGCACCA TAGATATGTT AACTATGTCT GGTCACAAAA TTCATGCGCC CAAAGGTATT GGTGCTTTGT ATGTGAAACG TGGTGTAAGA TTCCGTCCCT TCTTAATAGG TGGACATCAA GAAAGAGGAC GGAGGGCAGG TACAGAAAAT GTTCCCGGAA TTATCGCTTT AGGTAAGGCA GGGGAACTGG AAATGCTACT CTTGGAAGAG GCGACTAAGA GAGAAAGAAA ACTGCGCGAT CGCCTCGAAC AAACTTTACT TGCTAACATT CCCGATTGTG TAGTCAATGG TGATGTGAAA AATAGATTAC CAAATACTAG TAACATCGGT TTCAAATATA TCGAAGGTGA AGCAATTCTC CTACTTCTGA ATAAACACGG TATTTGTGCT TCCTCTGGTT CTGCTTGTAC TTCTGGTTCA CTTGAACCCT CCCATGTTTT GAGAGCAATG GGTTTGCCCT ACACTACTTT ACATGGTTCA ATTCGCTTCA GTCTTTCTCG CTACACCACC GAAGCAGAAA TTGATCAAGT AATTGCAATC ATGCCAGAAA TTGTTGAACG TCTCCGCGCC TTATCTCCCT TCAAAAATGA TGACGCGCGT TGGTTACAAC AGCAAGAGCA CACTTTAGTA AATCGGTAG
|
Protein sequence | MLKDCIYLDN NATTKVAPQV IEAIMPFLTD YYANPSSMHT FGGQLAKSVK VAREQIAALI GAEESEIVFT SCGTEGDNTA IRAALLAQPE KRHIITSQVE HPAVLNVCKQ LETQGYQVTY LSVNDKGQID LNELEASLTG NTALVTVMYA NNEIGTIFPI EEIGARVKEY GAIFHVDAVQ AVGKVPLNMK TSTIDMLTMS GHKIHAPKGI GALYVKRGVR FRPFLIGGHQ ERGRRAGTEN VPGIIALGKA GELEMLLLEE ATKRERKLRD RLEQTLLANI PDCVVNGDVK NRLPNTSNIG FKYIEGEAIL LLLNKHGICA SSGSACTSGS LEPSHVLRAM GLPYTTLHGS IRFSLSRYTT EAEIDQVIAI MPEIVERLRA LSPFKNDDAR WLQQQEHTLV NR
|
| |