Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_3440 |
Symbol | |
ID | 9341244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 3507647 |
End bp | 3509881 |
Gene Length | 2235 bp |
Protein Length | 744 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | |
Product | family 57 glycoside hydrolase |
Protein accession | YP_003722198 |
Protein GI | 298492021 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCACC CTCTATACGT CGCTTTTATC TGGCATCAAC ATCAGCCGTT ATATAAATCT CCTCGTAGTG GTGTTTCTAC ACCTGCTAGT CAGCAGTACC GCTTGCCTTG GGTTAGGTTA CATGGTACTA AGGATTATTT GGATTTAATC TTGCTGCTAG AGCAGTATCC CAAGTTACAT CAAACGGTGA ATTTAGTACC ATCCTTGATA CTGCAACTAG AAGATTATAT TGCTGGTACT GCGTTTGACC CTTATCTGAC TGCCAGCTTA ACACCTGTTG AAAAGTTAAC TCAGGAACAG AAAGAATTTA TTGTTGAACA TTTTTTTGAT GCTAATCACC ATACTTTAAT AGATCCCCAT CCCCGCTATG CCCAGTTGTA CTATCAGAGA CAGGAAAAGG GACAAGCGTG GTGTTTAGCA AATTGGCAGC CACTAGATTA CAGTGACTTA TTAGCTTGGC ATAATCTAGC TTGGATTGAT CCTCTGTTTT GGGATGACCC AGAAATTGAA GCTTGGTTAA AACAGGGACA AAATTTTACT TTAAGTGATC GCCAACGCAT TTATTCTAAA CAACGTCAAA TCCTCAGCCG CATTATTCCC CAACACAGGA AAATGCAAGA AACTGGACAG TTAGAAGTCA CCACCACCCC CTATACTCAC CCAATTTTGC CCTTGTTAGC TGATACCAAC TCCGGGCAGG TAGCAGTGCC AAACATGACA TTACCTAACA ATCATTTTCA GTGGGCGGAA GATATTCCTC GTCATTTACA GAAATCTTGG GATTTATATA AAGACAGATT TGGACAAGAA CCACGGGGTT TATGGCCTTC TGAACAGTCA GTTAGTCCAG AAATATTACC GTATATTATT AAACAGGGCT TTAATTGGAT TTGCTCAGAT GAAGCCGTCT TAGGTTGGAC CTTAAAACAC TTTTTCCATC GAGATGGGGC AGGAAATGTC CAGCAACCAG AATTACTGTA CCGTCCTTAT CTTTTGCAAA CTCCAGCAGG TGATTTATCC ATAGTTTTCC GTGACCATAG GTTGTCAGAT TTAATTGGTT TCACATACAG TTCCATGCAG CTAAAACAGG CCGTAGCGGA TTTAGTGGGA CATTTGCAAG TGATCGCTAA AATGCAAAGA GAGAAACCCA GCGAACAACC TTGGTTAGTA ACCATCGCCT TAGATGGTGA AAATTGCTGG GAATTTTATC CCCAAGACGG CAAACCATTC CTAGAAACCT TATATCAAAG CTTGAGTAAT GAACCTCATA TCAAACTGGT TACCGTCTCG GAATTCCTAG ACAAATATCC CGCCACAGCC ACTATCCCCG GAGAACAACT CCATAGTGGT TCTTGGGTAG ATGGCAGTTT TACCACCTGG ATAGGAGATC CCGCCAAAAA TCGCGCTTGG GACTACCTGA CCCAAGCCAG ACAAGTATTA GCCAATCATC CCGAAGCTAC CGAAGACAAC AACCCTGCAG CCTGGGAAGC CTTATATGCA GCCGAAGGTT CCGATTGGTT TTGGTGGTTT GGAGAAGGAC ATTCTTCAAA TCAAGATGCC ATGTTTGACC AATTATTTCG TGAACATCTC TATGGAATTT ATAAAGCCCT CAATGAACCA ATACCAGTCT ATTTAACAAA ACCAGTAGAA GTCCATGAAA CACGAGCAGA CCGTCGGCCA GAAGCCTTTA TTCACCCAGT TATTGACGGT AAAGGTGATG AACAAGACTG GGACAAAGCC GGCAGAATAG AAGTTGGTGG TGCAAGGGGA ACAATGCACA ACAGCAGCCT CATTCAACGA CTTTGGTATG GAGTAGATCA CCTGAATTTT TACTTACGGG TAGATTTTAA AAGTGGCATT GCACCAGGAA AAGAACTACC AACAGAGTTA AATTTACTTT GGTATTATCC AGATAGAACA ATGGTTAATA GCCCTGTAAC TTTAGCAGAA GTTCCAGATA TATCACCAGT TAATTATCTG TTTCACCATC ATCTAGAAAT TAATTTACTC ACACAATCAA TTCAATTTCG AGAAGCAGGA AATAACTATC AATGGTATCC CCGCGTTAGT CGCGCCCAAG CTGCTTTAAA TACTTGTTTA GAAGTGGCAA TACCTTGGGC AGATTTGCAA GTTCCCCCAG ATTATCCCCT GCGTCTGATT TTGGTACTAG CCGATGATGG GTGTTTCCAT AGCTATTTAC CAGAAAATGC TTTAATTCCT ATTGAAGTAC CTTAG
|
Protein sequence | MSHPLYVAFI WHQHQPLYKS PRSGVSTPAS QQYRLPWVRL HGTKDYLDLI LLLEQYPKLH QTVNLVPSLI LQLEDYIAGT AFDPYLTASL TPVEKLTQEQ KEFIVEHFFD ANHHTLIDPH PRYAQLYYQR QEKGQAWCLA NWQPLDYSDL LAWHNLAWID PLFWDDPEIE AWLKQGQNFT LSDRQRIYSK QRQILSRIIP QHRKMQETGQ LEVTTTPYTH PILPLLADTN SGQVAVPNMT LPNNHFQWAE DIPRHLQKSW DLYKDRFGQE PRGLWPSEQS VSPEILPYII KQGFNWICSD EAVLGWTLKH FFHRDGAGNV QQPELLYRPY LLQTPAGDLS IVFRDHRLSD LIGFTYSSMQ LKQAVADLVG HLQVIAKMQR EKPSEQPWLV TIALDGENCW EFYPQDGKPF LETLYQSLSN EPHIKLVTVS EFLDKYPATA TIPGEQLHSG SWVDGSFTTW IGDPAKNRAW DYLTQARQVL ANHPEATEDN NPAAWEALYA AEGSDWFWWF GEGHSSNQDA MFDQLFREHL YGIYKALNEP IPVYLTKPVE VHETRADRRP EAFIHPVIDG KGDEQDWDKA GRIEVGGARG TMHNSSLIQR LWYGVDHLNF YLRVDFKSGI APGKELPTEL NLLWYYPDRT MVNSPVTLAE VPDISPVNYL FHHHLEINLL TQSIQFREAG NNYQWYPRVS RAQAALNTCL EVAIPWADLQ VPPDYPLRLI LVLADDGCFH SYLPENALIP IEVP
|
| |