Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_2972 |
Symbol | |
ID | 9340776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 3055446 |
End bp | 3057917 |
Gene Length | 2472 bp |
Protein Length | 823 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | |
Product | filamentous hemagglutinin family outer membrane protein |
Protein accession | YP_003721898 |
Protein GI | 298491721 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGTAA CTTTTTCTAT CTTTGCTTTA ATTAGCGTCG TATTAACATC CGTCACTTAT AACAGTCGCG TTCAGGCAAA AGTGACTCCC GACAGCAAAC TCAAAACTAC TGTGACTGGC AGTAATAATT ATACCATCAC CAATGGTAAC CGTGTCGGCA ATAATTTATT CCATAGCTTT AGTGAATTCT CTATTCCTAC TAACGGATCT GCATTCTTCG ATAATGCTAA CGATATCCAG AATATTTTTA GCCGCGTGAC TGGCGGCAAT CTTTCCAATA TTGATGGCTT GATTCAAGCC AATGGTAGTG CCAATTTATT CTTACTCAAC CCATCTGGGA TTATTTTTGG TGCAAATGCC CGCTTAGATA TTGGTGGTTC ATTTGTAGGA ACAACTGGCA ATAGTATTAA GTTTGCTGAT GGCACAGAAT TTAGCGCAGT CAATGCGAGT AGTTCGCCAT TGTTAACCAT GAGCGTACCG ATAGGCTTGC AAATGGGTCA AGATTCGGGA GCCATCACAG TCCAAGGGTT AGGGCATCGC ATTATACCAC CATTCTCTGT AGCACAGGAA TTGGATCTCA GCAATAATCC CACTGGGTTA CAAGTCAAGG CAGGTAATAC CCTGGCACTC ATTGGTAGTG GGCTAAATTT CGCGGGGGGC ATTGTGGCAG CAGACGGAGG TGGACATATA GAAATAGGGA GTATCAATCA TGGGCTAGTC AGACTCAATT CTACAGTGAC AGGATGGAAG GGAGATTACT CACAAGTAGA ACAGTTTAAT GATATCCATC TAGCCCAACA ATCTCTACTA GATGCTAGTG GCAGCAATGG TTCAATTCAA CTACAGGGGC AAAATATCAA CTTAACTGAA GGTTCTACTG TAGTAATACA AAACTTGGGG ACACAATCGC AAGGAATTAC TGTCCACGCC ACAGGTTCTT TGAATTTGAC AGGCTATACT CCCGACCAAA AACAGGGCAG TATAATCGCA ATCGAAAACT TGGGAACAAG TTCATCAGGA GATATTGTAG TTTCCGCCAA TCAACTTTTC GTACAAGATG GTGGACAGAT TCAGACTTTC ACTCCTACCG CAGCAGCCAG CGGGAATATT TCAATTGATG TTGAAGACTT GATTTACCTG AATGGTTTTA TCCCTACTAA CCCGACTGTG AACACCAAGA TCATAACAAT CACGGATGGC TCTGGTAAAG CTGGTGATAT TACCATTTCA TCGGGCAGCT TAAAAGTTTT CAATGGGGCT AGTCTCATTT CTGTGACAAT GGGTTATGGG GAGGGCGGAG CGATGCAGAT CAATGCCAAA GACCTCATCG AGATTGTCGG TAACAATCCC ATTATCTTAG TACCTAGTGC AATCTCTTCG GCAACGATTG GTACCGGCAA TGCAAATAGC ATATTGGTTA ACACATCCAG ATTAATCCTT AGAGATGGGG GAGTTCTGGG TTCTAACACT CTGAGTCAGG GTAGAGCGGG AAATGTAACA GTTAATGCTT CAGATTTCCT AGAGGTTAGT GGTAAAGCAC CTGGATCAAT TAGATCGAGT AACATTATGT CCTCATCTGA GATTCTCGAT CAAGTTGTTC AAGAGACTTA TGGACTACCG TCAATTCCCA CTGGTGATGC CGGTTTTCTG ACGATTAATA CCCCATCATT ACGCATTAGT GATAGTGCAT TCGTGAGTGT TAAGAATGAT GGACCTGGCA GAGCTGGAGA TTTACAAATT AACGCTAATT TGCTTTTTCT ATACAAAGAA GGTAGTATCA GTGCATCTAC TGCTTCAGGA AATGGAGGTG ATATTCAGTT AAACTTACAA GATTATCTGT TGATGTATCA AGATAGTGTT ATTTCCGCTA CTGCTCAGGG TAATGGAAAT GGCGGTAATT TGTCAATTAA CTCACCAGTA ATTGTTGGTT TAGAAAACAG TGACATCATC GCCAATGCAG TTCAAGGTCG TGGTGGCAAT ATTAATATCA GCACTCAAAA CATAATCGGT CTAGAGTTTC GCGATATCCT CACTCCCCGC ACAGTCCCAA CAAATGATAT TACTGCTAGT TCCCAGTTTA ATGTTAATGG CACAGTGCAA ATTAATAACA TCAGTATTGT CCCCAGTTCT GGTTTAGTCG AACTACCTGC AAATATTACT GACCCATCAC AGCAAATAGC TATAGGATGT GCAGATACTA GTGGCAGTAG TTTTGTCGCG ACAGGACGAG GTGGAATACC CCAAAATCCC ACTCAGGAAG TGAGGAGCGA TAAACCTTGG TCTGATGTTC GCGATCTCTC TTCATATCGC ACAACAGCAC AAGTGCAAGC ACAAGTACTT CAATCCCGAG CGAATTTTAT ACAAGCTACT TCCTGGCATC GTAATTCCCA AGGGAAAATT GAGTTAGTTG TAGATAAATC TTCTATGAGT ATGCAACCGT CATTAACCTG TGTTGCTGTT CCTAAAAGTT AA
|
Protein sequence | MKVTFSIFAL ISVVLTSVTY NSRVQAKVTP DSKLKTTVTG SNNYTITNGN RVGNNLFHSF SEFSIPTNGS AFFDNANDIQ NIFSRVTGGN LSNIDGLIQA NGSANLFLLN PSGIIFGANA RLDIGGSFVG TTGNSIKFAD GTEFSAVNAS SSPLLTMSVP IGLQMGQDSG AITVQGLGHR IIPPFSVAQE LDLSNNPTGL QVKAGNTLAL IGSGLNFAGG IVAADGGGHI EIGSINHGLV RLNSTVTGWK GDYSQVEQFN DIHLAQQSLL DASGSNGSIQ LQGQNINLTE GSTVVIQNLG TQSQGITVHA TGSLNLTGYT PDQKQGSIIA IENLGTSSSG DIVVSANQLF VQDGGQIQTF TPTAAASGNI SIDVEDLIYL NGFIPTNPTV NTKIITITDG SGKAGDITIS SGSLKVFNGA SLISVTMGYG EGGAMQINAK DLIEIVGNNP IILVPSAISS ATIGTGNANS ILVNTSRLIL RDGGVLGSNT LSQGRAGNVT VNASDFLEVS GKAPGSIRSS NIMSSSEILD QVVQETYGLP SIPTGDAGFL TINTPSLRIS DSAFVSVKND GPGRAGDLQI NANLLFLYKE GSISASTASG NGGDIQLNLQ DYLLMYQDSV ISATAQGNGN GGNLSINSPV IVGLENSDII ANAVQGRGGN INISTQNIIG LEFRDILTPR TVPTNDITAS SQFNVNGTVQ INNISIVPSS GLVELPANIT DPSQQIAIGC ADTSGSSFVA TGRGGIPQNP TQEVRSDKPW SDVRDLSSYR TTAQVQAQVL QSRANFIQAT SWHRNSQGKI ELVVDKSSMS MQPSLTCVAV PKS
|
| |