Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_1172 |
Symbol | |
ID | 9338967 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 1254711 |
End bp | 1256495 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | |
Product | family 1 extracellular solute-binding protein |
Protein accession | YP_003720617 |
Protein GI | 298490440 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATTTA CTGAAAAAAT ACTCAGTTAC ACAAAACGCT TTTTCTTGCT TATCAGCCTC CTTTCGTTTA CAGCCCTGAC AATTACTGCT TGTAATCCAA CTAAACTAAA AACTAGCGCC GCACAAGTAC CACAATTAGT AACCAGTATT CTCAGCGATC CGAAAACCTT TAACTATGCT CTTAATTCTG ATGCAAATAA TATTTTTGGA TATACTTATG AAGGATTGCT TAACCAAAAT CCTATCACTG GTAAACTTGA ACCAAATTTA GCAGAATCAT GGGAAATTTC TGAGGATAAA TTAAAGATTA CTTTCACTCT GCGCAATAAT TTAAAATGGT CTGATGGACA ACCACTCACA TCTGATGATG TTGTATTTAC TTATAATGAT ATTTATCTGA ATGAAGCCAT ACCAACAGAC GTTAGAGATA TTTTGAGAAT TGGGAAGGAT GGCAAATTTC CCAGTATTAA AAAAATTGAT AAAAGACGGG TAGAATTTAG CATACCAGAA CCTTTTAGAC CTTTTTTACA AAACTCTGGT GTACCTATTT TACCTGCTCA TGCCTTACGA GAATCTTTAC AAACTAAAGA CCAAGACGGA AAACCTAAAT TTCTGACAAC ATGGGGTATT GATACTCCAC CTGAACAAAT AATTGTTAAT GGTCCCTTCA AGCTGGAACG TTACGATACT AGTCAGCGGG TAATTCTCCG GCGTAATCCC TATTATTGGC GTAAAGATGC TCAAGGTAAA CTCCAACCTT ATATTGAACG TATTGTTTGG CAAATTGTCG AATCAACGGA TACTTCTTTA TTACAATTTC GTTCTAGTGG TTTAGATGCT GTGGGGGTAG CACCAGACTA TTTTTCTTTA TTAAAGGTAC AAGAAAAACA AGGTGATTTC CAAATTTATA ATGGAGGACC TTCTACAAGT ACAAGTTTTG TGCTTTTTAA TTTGAATCAA GGAAAAAGAA ACGGTAAATT ACTCATAGAC CCAATTAAAT CACGTTGGTT TAATAATGTA GATTTTCGCC AAGCTGTAGC TTATGCAATT GATAGACGAA CCATGATTAA TAATACATTT CGTGGTTTGG GTAAACCGCA AAATTCACCC ATTTCTGTGC AGAGTCCCTA TTACCTTTCT CCAGAAGCAG GACTAAAAGT TTATAACTAT AATCCCGAAA AATCTAAGGA ATTATTACTC AGATCTGGGT TTAAATACAA TGCTCAAAAT CAGCTAGAAG ATGCTCAAGG AAACCATGTT AGGTTTGCTT TACTTACCAA TGCTGGTAAC AAGATTCGTG AAGCAATGGG TTCACAAATT AAACAGGATT TGAGTAAAGT TGGTATCCAA GTTGATTTTA CTCCTTTAGC ATGGAATACT TTTATAGATA AGCTATCTAA TACTTTAGAT TGGGAAGCTT CTTTACTCGG TTTGACTGGT GGATTAGAAC CAAATGATGG GGCCAATGTA TGGTCTACTG AAGGTGGATT ACATATGTTT AATCAAAAAC CCCAACCAGG ACAAAAACCA ATAGAAGGGT GGAAAGTTTC ACCATGGGAA GCGAAGATTC ATGAATTTTA TATTCAAGGC GCACAGGAAC TTGATGAAGC AAAGGTGACA GAAATTTATG CAGAAGTTCA ACGATTAACA CAGGAGAATT TACCATTTAT TTACTTAGTA AATCCCTATT CTCTTTACGC AATCCGAAAC CGTTTTCAGG GAATTAGATT TTCTGCTTTG GGTGGTCCAT TTTGGAACAT TCATGAAATT AAAATTACAA AATAG
|
Protein sequence | MTFTEKILSY TKRFFLLISL LSFTALTITA CNPTKLKTSA AQVPQLVTSI LSDPKTFNYA LNSDANNIFG YTYEGLLNQN PITGKLEPNL AESWEISEDK LKITFTLRNN LKWSDGQPLT SDDVVFTYND IYLNEAIPTD VRDILRIGKD GKFPSIKKID KRRVEFSIPE PFRPFLQNSG VPILPAHALR ESLQTKDQDG KPKFLTTWGI DTPPEQIIVN GPFKLERYDT SQRVILRRNP YYWRKDAQGK LQPYIERIVW QIVESTDTSL LQFRSSGLDA VGVAPDYFSL LKVQEKQGDF QIYNGGPSTS TSFVLFNLNQ GKRNGKLLID PIKSRWFNNV DFRQAVAYAI DRRTMINNTF RGLGKPQNSP ISVQSPYYLS PEAGLKVYNY NPEKSKELLL RSGFKYNAQN QLEDAQGNHV RFALLTNAGN KIREAMGSQI KQDLSKVGIQ VDFTPLAWNT FIDKLSNTLD WEASLLGLTG GLEPNDGANV WSTEGGLHMF NQKPQPGQKP IEGWKVSPWE AKIHEFYIQG AQELDEAKVT EIYAEVQRLT QENLPFIYLV NPYSLYAIRN RFQGIRFSAL GGPFWNIHEI KITK
|
| |