Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4196 |
Symbol | |
ID | 9342001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 4266435 |
End bp | 4267436 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | |
Product | family 1 extracellular solute-binding protein |
Protein accession | YP_003722726 |
Protein GI | 298492549 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGTC GAGCTATTGT TAACAGATTA TCTCAACGTG CGATCGCAGC TACAGGAGTG GCCATAGTCG GTGGCTGTCA AAAAGCCCAA ACTCAAGGCG CAAAGGGACA AGCAGACTCC AGCAACTTAC CCACCATTAA ATGGCAAATG GCTACTAGTT GGCCATTATC CTTAGAAACA ATCTTTGGCG GAGCGCAGGT TTTAGCAGAT CGTGTCAAAA CCCTAACCAA CGGTAAATTT ATCATTGAAC CCCGTGCAGC AGGAGATATA GCCCCCGGTT TAGAAGTTCT CAATGTTGTT TCTCAAGGTG CAGTGCAAGC AGGACATACT GCCGCTTATT ACTACATTGG TAAAAGTCCC TCCTTAGCTT TTGGTACATC AGTACCTTTT GGACTCAATG CCCAACAACA AAATGCTTGG TTATATGAAG GGGGCGGTTT AGCCAAACTG CAAGAAATTT ATGCCCGGAA ATTTAATGTG ATTCAATTTC CGGCGGGAAA TACGGGTACA CAAATGGGAG GATGGTTTCG TAACGAAGTC AAAACACTCA ATGACCTCAA GGGTCTGAAA ATGCGTATTC CCGCTTTAGG TGGACAAGTT ATGGCCAAAC TAGGGGTAAC AGTGCAGACT CTACCAGGGG GGGAAATCTT CCAAGCCTTA CAAACAGGGG CTATTGATGC CGCTGAGTGG GTGGGACCTG ATGATGATGA AAAATTAGGC TTAAATAAAG TCGCTAAATT TTACTATTAT CCTGGTTGGT GGGAACCAGG CCCAACTCTA GAAGTACAAA TTAATTTAGA CCAATGGAAA AAGCTACCAC CTCTCTATCA AGCAGCTTTA GAAACGGCTG CATATCAGTC GAATACAACA ATGTTAGCTC GTTATGATGC CCAGAATAGT CAGGCATTAG AGAAACTATT AAAAACTGGG ATTCAAGTTC GTGCTTATAG TCAGGAAATT TTGGAAGCAG CGGAAAAAGC TTCTTTTGCT TTATATGATT AA
|
Protein sequence | MKRRAIVNRL SQRAIAATGV AIVGGCQKAQ TQGAKGQADS SNLPTIKWQM ATSWPLSLET IFGGAQVLAD RVKTLTNGKF IIEPRAAGDI APGLEVLNVV SQGAVQAGHT AAYYYIGKSP SLAFGTSVPF GLNAQQQNAW LYEGGGLAKL QEIYARKFNV IQFPAGNTGT QMGGWFRNEV KTLNDLKGLK MRIPALGGQV MAKLGVTVQT LPGGEIFQAL QTGAIDAAEW VGPDDDEKLG LNKVAKFYYY PGWWEPGPTL EVQINLDQWK KLPPLYQAAL ETAAYQSNTT MLARYDAQNS QALEKLLKTG IQVRAYSQEI LEAAEKASFA LYD
|
| |