Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4096 |
Symbol | |
ID | 9341901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 4160349 |
End bp | 4161599 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | |
Product | carbohydrate kinase |
Protein accession | YP_003722667 |
Protein GI | 298492490 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.70469 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTTTCT ACTTGGGTAT TGATTTTGGT ACATCTGGGG CACGAGCAGT GGTGATTGAT GATGAAGCTA GGATTGTGTC CCAGATGCGT CATCCTTGGA CAAATATTGC AGATTGGGTA AGTTGCTGGA AAGAGGCTTT ATGGAGTCTT CTAGAGGCAA TTTCCCTGGA GCTCAAGGGA GAAATTCGTG CGATCGCTAT TAACGGTACT TCTTCAACAA TCTTGCTCAC TGATGCCACT GGCCAGCCTG TAGATCTCCC CTTATTGTAT AACGATGCAC GGGGATCAAT GGTGCTAAAG GATTTGAGCC ACATTGTACC ACCCAATCAC ACCGTATTAA GTGCTACTTC TAGCCTTGCC AAACTGCTGT GGATGGAACA TTTACCTTCT TTTGGGCAAG CTAGATATTT ATTACATCAA GCTGATTGGC TGGCATTTCT TCTGCATGGG CAATTAGGTA TTAGCGACTA CCATAATGCT CTCAAATTGG GTTACGACGT GGAAAAGCTT CAATATCCAG AATGGCTAGA AAAACTGCAA ATTCCAATTA CTTTGCCCCA AATTTTAGCA CCTGGCACAC CAGTAGGCCA ATTAAATCCA GAAATTGCTG CTAAATTTGG TTTTAGGCAT GATTGCTTGG TATGTGCAGG GACAACAGAC AGCATTGCAG CTTTTTTGGC TAGTGGTGCA AAATCACCAG GGCAAGCAGT AACTTCTTTG GGTTCAACCT TGGTACTTAA ACTTTTGAGC CGTACTCGCA TAGAAGATGC GCGATATGGT ATTTATAGCC ATCGCTTAGG TGATTTATGG TTAACTGGAG GAGCTTCTAA TACAGGATGT GCAGTGTTGA AGAAATTTTT TACTGATGAA GAATTAGTCA GCTTGAGTCG GGAAATTGAT GTATCGAAAG CCAGTGAATT AGATTATTAT CCATTATTGA AAGAGGGTGA TCGATTTCCC ATTAATGATC CCCACCTACC TCCCAAATTA GAACCCCGTC CAGATAATCC CAGAGAATTT TTACACGGCT TACTAGAAAG TATGGCACGA ATAGAGGCAA GAGGGTACAA GTTATTACAG GAACTAGGAG CAGATCAATT AAAGCACGTT TATACTGCTG GTGGTGGGGC GGCAAATTCC ACTTGGACTG CTATTAGAGG TAGGTATTTA AACATCCCAA TAGCTTCCTC TATCAATACA GAAGCAGCCT ATGGAACTGC GCTTTTAGCA ATGCAGCAAT TAAAAATATG A
|
Protein sequence | MTFYLGIDFG TSGARAVVID DEARIVSQMR HPWTNIADWV SCWKEALWSL LEAISLELKG EIRAIAINGT SSTILLTDAT GQPVDLPLLY NDARGSMVLK DLSHIVPPNH TVLSATSSLA KLLWMEHLPS FGQARYLLHQ ADWLAFLLHG QLGISDYHNA LKLGYDVEKL QYPEWLEKLQ IPITLPQILA PGTPVGQLNP EIAAKFGFRH DCLVCAGTTD SIAAFLASGA KSPGQAVTSL GSTLVLKLLS RTRIEDARYG IYSHRLGDLW LTGGASNTGC AVLKKFFTDE ELVSLSREID VSKASELDYY PLLKEGDRFP INDPHLPPKL EPRPDNPREF LHGLLESMAR IEARGYKLLQ ELGADQLKHV YTAGGGAANS TWTAIRGRYL NIPIASSINT EAAYGTALLA MQQLKI
|
| |