Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_3741 |
Symbol | |
ID | 9341546 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 3798494 |
End bp | 3800959 |
Gene Length | 2466 bp |
Protein Length | 821 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | |
Product | surface antigen (D15) |
Protein accession | YP_003722407 |
Protein GI | 298492230 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATTTAT CTCCGATGTT ACTGGCAGTT GTGGCGATCA CAACCCCTTT TGATAGTTCA TTGAGTGCAA CTGCACAAAC TCCTGATTAT ACTCAGCAAG TAACAGAAGT TGCCACGGTA GGAACAAACC AACATCTGCC ACAGAAAAGC AGTCAGAATC ATCTTGATCA ACAACAAGAA TCAGTCATAG TCACGAAAAC AGAAGTTAGA GAACCCGAGT CTCCATCCTC TATAACTCCC TCCATAACTC CTATTTCAGC CTTAACCATT GACTCAACAA CAATAGCAAC ACCAGAAGTA ATTCCTCCAA ATATCAGCAC CTCATCAAAA ACAGCACAAG CTTTAAAAAA AGCAATAGCT CCTAACTCAG CAAACAGAGA AAGGAAAGCG GTAATTATCC CCACAGCACA AACCCTAAAA GCATCGCTAT TATCTACCAA TTCAGTAGGG ATAGAGACAT CACAAGCAAT TATACCCAAT ATCACCACCC CATCCAAAAC AGCACAAGAG CAAGAATCAG AAGCAACTGC TGACACAAAT TATATACAGG AAAACGTCGA ACCACCCACA GCACAACCAG CTTTCCCGAC TAATCCCGAA ACAACACCAA CAGCAGAACC TCTTGTATTA GTTTCAGAAG TAGCAGTCAA ATCTCTAACT GGTGCTATTG CAAAAGAACT AGAAGATAAA GTTTATCAAG CCATTCGTAC CCAAGCAGGA CAAACAACTA CTCGCTCCCA ACTCCAAGAA GATATTAATG CTATCTTTGC CACGGGCTTT TTCTCTAACG TGCAGGCCAT GCCAGAAGAT ACGCCCTTGG GTGTAAGGGT AAGTTTCGTT GTTAGTCCTA ACCCGGTTCT GAGTAAAGTA CAAGTAGAAG CTAATCCGGG AACTGGTGTA GCTTCAGTCA TACCTGCCAA AACTGTAGAT GAAATCTTCA GTAAACAGTA TGGACAAATC TTGAATTTGC GAGATTTACA AGAAGGAATT AAAGAACTAA CCAAAAAGTA TCAAGACCAA GGTTATGTGC TGGCCAACGT GATTGGAGCA CCTAAAGTAT CAGAGAATGG AGTTGTTACC CTACAAGTAG CAGAAGGGGT AGTAGAAAAT ATCAAAGTCC GTTTCCGCAA CAAACAGCGT GAGGAAGTAG ACGACAAAGG TAATCCCATT CGCGGACGAA CAAAAGATTA TGTAATTAAG CGAGAATCCG AATTAAAGCC TGGTCAGGTA TTTAACCGCA ACATCGTGCA GAAAGACCTG CAAAGGATAT TTGGTTTAGG ATTATTTGAA GATGTAAGTG TGTCCCTTGA TCCTGGTACA GATCCAAGTA AGGTAGATGT GGTACTCAAT GTGGCGGAAC GCAGTAGTGG TTCAATCGCT GCTGGTGCGG GTATTAGTTC TGCCACTGGG TTATTTGGTA CTGTTAGCTA TCAACAGCAA AACCTGGGGG GTAGAGCGCA GAAACTGGGG GCTGAGGTAC AGTTAGGAGA AAGAGAATTG CTGTTTGACC TGCGGTTTAC AGATCCTTGG ATTGCTGGTG ATCCTTACCG TACTTCCTAC ACAGCTAATA TTTTCCGTCG TCGTTCAATT TCTCTGATTT TCGAAGGTAA AAATGACGCT ATAGAAACCT TTGACCCTAG TGATATTACT AATGAAGATG ACCAGGATCG CCCCCGAATT ACCCGTTTAG GCGGTGGTGT ATCCTTTACC CGTCCTCTTG CTGCTAATCC TTACCAAAAT TCAGAATGGA CAGCTTCAGC AGGTTTGCAG TATCAACGAG TTTCTAGCCG TGATGCTGAT GGCAATCTGA GAAAAGAAGG GGCGATATTT GATGATAATG GCAATCAAAT TAGCCCTACA ATTCCTCTGA CGCAATCAGG TACAGGTGAA GACGATTTGC TATTATTGCA ACTGGCCGCA CAACGTGATC GCCGTAATAA TCCCTTACAA CCTACCAATG GTTCTTACCT CCGCGTCGGA GTTGACCAAT CTGTACCCGT GGGACAAGGC AATATTTTAC TGACTAGGCT ACGGGGTAAC TACAGCCAAT ATTTACCAGT AAAATTCATC GGCTTTGGTA AAGGCGCACA AACCCTAGCA TTTAACCTCC AAGGGGGTAC AATTCTTGGT GATGTACCTC CCTACGAAGC CTTTACCCTT GGTGGTAGTA ATTCTGTGCG GGGTTACGAT GAAGGGAGAT TAGCAACTGG ACGTAGCTAT ATACAAGCAT CTGTTGAGTA TCGTTTTCCT GTCTTTTCTG TAGTCAGTGG CGCTCTATTT TTTGATTACG GTAGTGACCT GGGAAGCAAT ACCAGAACAG CAGAAATTTT GAACAAAAAT GGTACTGGCT ATGGTTATGG TCTAGGTGTG CGTGTACAGT CACCATTAGG ACCAATTCGT ATAGACTACG GTATGAGCGA TGATGGCGAT AGCCGCATTA ACTTCGGGAT AGGGGAAAGG TTTTAA
|
Protein sequence | MHLSPMLLAV VAITTPFDSS LSATAQTPDY TQQVTEVATV GTNQHLPQKS SQNHLDQQQE SVIVTKTEVR EPESPSSITP SITPISALTI DSTTIATPEV IPPNISTSSK TAQALKKAIA PNSANRERKA VIIPTAQTLK ASLLSTNSVG IETSQAIIPN ITTPSKTAQE QESEATADTN YIQENVEPPT AQPAFPTNPE TTPTAEPLVL VSEVAVKSLT GAIAKELEDK VYQAIRTQAG QTTTRSQLQE DINAIFATGF FSNVQAMPED TPLGVRVSFV VSPNPVLSKV QVEANPGTGV ASVIPAKTVD EIFSKQYGQI LNLRDLQEGI KELTKKYQDQ GYVLANVIGA PKVSENGVVT LQVAEGVVEN IKVRFRNKQR EEVDDKGNPI RGRTKDYVIK RESELKPGQV FNRNIVQKDL QRIFGLGLFE DVSVSLDPGT DPSKVDVVLN VAERSSGSIA AGAGISSATG LFGTVSYQQQ NLGGRAQKLG AEVQLGEREL LFDLRFTDPW IAGDPYRTSY TANIFRRRSI SLIFEGKNDA IETFDPSDIT NEDDQDRPRI TRLGGGVSFT RPLAANPYQN SEWTASAGLQ YQRVSSRDAD GNLRKEGAIF DDNGNQISPT IPLTQSGTGE DDLLLLQLAA QRDRRNNPLQ PTNGSYLRVG VDQSVPVGQG NILLTRLRGN YSQYLPVKFI GFGKGAQTLA FNLQGGTILG DVPPYEAFTL GGSNSVRGYD EGRLATGRSY IQASVEYRFP VFSVVSGALF FDYGSDLGSN TRTAEILNKN GTGYGYGLGV RVQSPLGPIR IDYGMSDDGD SRINFGIGER F
|
| |