Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_2326 |
Symbol | |
ID | 9340126 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 2413483 |
End bp | 2415756 |
Gene Length | 2274 bp |
Protein Length | 757 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | |
Product | FHA modulated glycosyl transferase/transpeptidase |
Protein accession | YP_003721411 |
Protein GI | 298491234 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTCCC CGCAACCGCC TCAAAACCCA CAAACCTTAC TTGGTCAAGT GACACAAGCA GTAAACACAA TTCAAGCTAG GGTTGATTTT TCCAAACTGG CGCTCAAGCC CAATGCCAAA GTACCGGAAC TCTGGGTGCA AGATGCGGGG GCGGACAAAG CGGAAATATA TCCCTTATTG GGCGATCACT ATATACTGGG CCGTAGTTCC AAATCCTGTG ATATCGTCGT TCGTAACCCT GTTGTTAGCC AAATTCACCT ATCCTTGTCG CGGGATTCCA GCCAACTGAC TCCCACTTTC ACCATCAAAG ATCAAAATTC CACCAACGGC ATTTATCTAG GGAAGCGACG AGTCACTTCC CTAGAACTGC GTCATGGTGA TGTTTTCACC TTGGGACCCC CAGAACTAGC CGCTTCCGTT CGACTGCAAT ACGTAGATCC TCCTGCCTGG TACGTCAAAG CGGCAACATT GGGACTTTAT GGTGTTGGCG GTGTCAGCGC CTTATTAGCC CTAGCAATTG GGCTGGAATG GACGGAATTT GCTATCAGAC CCCTCCCGAC AGCCACACGC GCCCCTGTAG TTGTTTATGC CCGTGATGGT TCTACTCCGC TGCGAGAACC CCGAAATATA GCTCATGTAG ACCTAAAACA ACTATCAGAT TTTAGTCCCT ACCTACCTGC TGCTGTAGTC GCTTCAGAAG ACAGTCGTTA TTACTGGCAC TTTGGGATTG ACCCATTAGG AATTTTGCGA GCCGTATTAA TTAATAGCCG CACAGGTGAC GTACAGCAGG GAGCTAGTAC AGTTACTCAG CAAGTTGCCC GCAGTTTATT CCGAGAATAT GTAGGTAGAC AAGATTCCTT GGGGCGGAAA GTCCGCGAGG CTGTTGTTTC GTTGAAGTTA GAAACCTTTT ATAGCAAAGA TGATATTTTG CTGACCTACT TAAATCGAGT ATTTTTAGGG GGAGATACTT CTGGCTTTGA GGATGCTGCT AAATATTACT TTGATAAATC TGCCAAAGAA TTAACTCTTG CCGAAGCAGC AACCTTAGTA GGAATTTTAC CGGCTCCCAA CGCCTTCGAT TTTTGTGGAG ATGGACCCAA AAAACTGGGA GCAGCAGATT ACCGCAATCG TGTTGTTAAG CGGATGTTGG AAATGGGCAA AATCACCACT GAAGATGCCA ATCGAGCCAG ACGTTCCACT GTTCAAGTCA GCGCCAAAGT TTGTGAAAGA CAAGCCAACA CAATTGCTCC TTACTTTTAT AATTACGTCT TTCAAGAATT GGAATCAATT TTAGGGGAGG GAGCAGCAAG AGAAGGCAAC TATATTATCG AAACACAATT AGATCCAGCA ATTCAAGCTC AAGCAGAATC ATCACTGCGA AATTCAGTTA GTAACGCTGG TTCAAGCTTT CGGTTTTCCC AAGGTTCTCT TGTTACCTTA GATTCCAGAA CTGGAAGCAT CTTAGCAATG GTAGGGGGAA CCGATTATAA AAAAAGCCAA TTCAATCGTG CTGTGCAAGC CCAAAGACAA CCAGGTTCCA CCTTCAAGAT ATTTGCTTAC ACCGCTGCAC TTGAACAAGG AATATTATCA TCCAGAAGCT ATTCTTGCGC TCCTTTAACT TGGCAAGGTT TTACCTACAA ACCCTGTCGG TCTGGAGGCG GTGGTAGTTT AGATATAGCT ACAGGTTTAG CACTCTCAGA AAACCCCATT GCTTTAAGAG TTGCCAAAGA AGTGGGACTA AATAAAGTAG TGGATATGGC GCAGCGTTTG GGGGTCAAAT CTTCACTTGA TCCTGTTCCT GGTTTGGTTC TGGGTCAAAG TGTAGTCAAT GTTTTGGAAA TGACTGGTGC TTTTGGCGCT ATTGGCAATC GTGGAGTGTG GAATCCACCC CATGCTATTA GCAGGATTTT AGACAGTAGT GATTGTGAAG ACCGTAAGGA TTTAAAAACC TGCCGCGTTA TCTACTCCTT TGACCAAGAT CCAGATGGTA ATAAACGAGT TTTGAAAACA GATGTAGCCG ATAAAATGAT TGGTTTAATG CAGGGTGTAG TCTCCAGGGG TACTGGTCGT AGCGCGTCTA TTGGAGTGGG AGAAGAAGCG GGAAAAACTG GTACAACTGA TAAAAACGTG GACTTGTGGT TTATTGGTTT TATTCCCAGT CAGCAACTTG TAACTGGTAT TTGGTTAGGA AATGACAATA ATAACCCCAC ATCTGGTAGC AGCGCCCAAG CCGCCCAGTT ATGGGGAAAT TATATGCGGA AAATCACTAA GTAA
|
Protein sequence | MSSPQPPQNP QTLLGQVTQA VNTIQARVDF SKLALKPNAK VPELWVQDAG ADKAEIYPLL GDHYILGRSS KSCDIVVRNP VVSQIHLSLS RDSSQLTPTF TIKDQNSTNG IYLGKRRVTS LELRHGDVFT LGPPELAASV RLQYVDPPAW YVKAATLGLY GVGGVSALLA LAIGLEWTEF AIRPLPTATR APVVVYARDG STPLREPRNI AHVDLKQLSD FSPYLPAAVV ASEDSRYYWH FGIDPLGILR AVLINSRTGD VQQGASTVTQ QVARSLFREY VGRQDSLGRK VREAVVSLKL ETFYSKDDIL LTYLNRVFLG GDTSGFEDAA KYYFDKSAKE LTLAEAATLV GILPAPNAFD FCGDGPKKLG AADYRNRVVK RMLEMGKITT EDANRARRST VQVSAKVCER QANTIAPYFY NYVFQELESI LGEGAAREGN YIIETQLDPA IQAQAESSLR NSVSNAGSSF RFSQGSLVTL DSRTGSILAM VGGTDYKKSQ FNRAVQAQRQ PGSTFKIFAY TAALEQGILS SRSYSCAPLT WQGFTYKPCR SGGGGSLDIA TGLALSENPI ALRVAKEVGL NKVVDMAQRL GVKSSLDPVP GLVLGQSVVN VLEMTGAFGA IGNRGVWNPP HAISRILDSS DCEDRKDLKT CRVIYSFDQD PDGNKRVLKT DVADKMIGLM QGVVSRGTGR SASIGVGEEA GKTGTTDKNV DLWFIGFIPS QQLVTGIWLG NDNNNPTSGS SAQAAQLWGN YMRKITK
|
| |