Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_0537 |
Symbol | |
ID | 3682367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 675190 |
End bp | 678069 |
Gene Length | 2880 bp |
Protein Length | 959 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637715865 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_321056 |
Protein GI | 75906760 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.884465 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCTCA GTCTCCGGCA ATGGTTGGCA GAACGCCAGA TAGAAATCAG CGATATAAAA ACATTTGCTT CTGGGCAACT GGCAGGTATT GCCTATCGTA TCGTTCAGGA TATGGAACTT AAAAGCCTGA TGCCGTTGGA TATTTGTACG TTAGCAGAAG TGCTGCAATT ACCATTAGGC ACAGTCGAGG AAGAAATTTC TGTGCTTGCT TCTTTAAGCG AACATCTTCT CCGTCACCTC AGCCAAAAAA AAGCCCTCAA ACGTAATGAA GGTACTTGGT TAGCATTCCA AATTGCTTAC CTATTGGCGT TAGAACAAGT TTTACTCCAA GAAGAACAAT TAAAAAGACC TTGGCTAAAT CGTGCCAAAA TACCTGCACT AGCCACAATA ATTATCTCAG ATCCTCAACT CCAAGGATTA TTAAAAACTC TCTCCCCTGG TAAATTAACT GATACTCAAG CCGAACAAGC GCTGATTTCC GTGGCAGATT CATTACTAGT ACAACAAATG AATCATGCTA CTGTAGCCTG GTTAATGGCC AATGGTGCGG AGGAGTTAGA AGCTAAATTA TTAACTCAGC GTTTAGATAA TTCTCTCCCC GGATACCTGA TAAAAATCAT TGCTCAAAAC CCTGCACCTT TAGCTCAGCT ACAAAAATTT TTCCGGATAG GAACTTTAGA AGATGTTTTA AATATTGACT TATATAAAGA GAAGTATCGC GCTAGTTTGC TGCAAACTCT CAGTACACCA TTATTGATGG AATATTTTGC CCTGAAGAAT ATTTATGTCC CTTTATCTGG TGTACCTCAA GAACCTAATG ATGGACAGTC TATTGATTTA AAAATATGGG TAGAAAAACA ACTTATTGAT TTAGAAACTA TTGCTGTAAT TGAATCAGAA CCTGGTTATG GTAAAACTAG TTTTTGTCAG ATTTGGGCGG CAGAAGTAGC ATTAAAGCTT TATCCTCATT GGCTGCCCAT ACTGATTCGG TTGCGGGATA TTAAATATGG TAAAAGTTTA CTGGAAACCC TCAATTCTGG TTTTACATTC AATGTTCATA TCAACTTATC CGCTTGGTTA GAGCAAACAA ATAATCGGTG TGTTTTATTA CTAGATGGAT TAGATGAACT TCCCTCTTCT CATCAGGGAA ATAGAGATAA ACAAATTTTT ATTCAACAGT TACTCCAATT TCAATCCCAA GAACAACATA AAATTGTCTT GACTAGTTGC TCTCACACAG TAGAGGAAAT TACTTCAGAA ATCCCCCTCC AATGGCGGAC TATTAAAATT CAACCTTTAG AAGTAAACCA GTTAAAACAA TGGTTTCAAC AATGGGCATT ATTCCAGTCG TTACCCATTT CCCAAAACTT TTTTACATTC TTAAAACAAG CAGGATTATT TGCTAATAAG TCTCACTTAT CTGAACTATC TCATTTAGTA CGTCAACCAT TAATGTTGCA TTTATTGGGT GTTTTCCACC GCGACGGACT ATTAGATGAT GAAATATTGC AAAAAAATGC CAAGTTTTCT CTCCATTGGG AAATTTACTC TCGCCTAAAA CGATGGTTGC TGGGGTATCC GTTGACTGCG GGAATGAGGA CAATGCTATT ACGTCCGGGA ACTGCTCATA TTCACCGTAC ACCAGAAGCA ATCACCAATT TACTAGGAGA ATACCATCCC CAAGACCTGA TTGCACAAAT GCAGGCGATC GCTCTCAAAA TTTTACATGG CGATCGTCAT CAAGTTACCT TAACTGGAGA ATTCAATACA AACACTCTCC CGGCTTTATA TTTCCGTTAT TGTGTTAGCA GTCAGTTGCC ACAAGCTAAT GACAAAGCAC AACTAACAGT TAAAGTAGAG TTTTCCCACT CACAAATCGG TGAATATCTC TGTGCTGAAG CTGTAGCAAC CCAACTGCAA AGGCTAACCC AGTGCCAAGA AGATGTTTAC GGAACAGAAA CGTTTATTTT CAATACTCCT AGCAGTGTTG CCCAGCATCT CTACAATTTA CTAGGCTACG GCATTATTAC ACCAGAAATT GAGGGATTAG TCATCGCCGC ATTACAAACC CAGCAAAAAC CTATTTTATT ACGACGACTA GAATCTTTTT GGCGTGGTTG GTGTCAAGGA CGTTGGTTAG ATGAAAGCAT TGCCCACACA GCACTACCCT ATTTTCACAG CTTACAAAAT CTTGTGAATG TTGAGCAGGT GAATACAAAT GTGGGAATAA ATGTATTTTT ATTGCTAGCT GCAATTTGTC GAGACATTCA AGTTGCTTTT AGTCCTTGTG GTAATCCAGC AAATGTCAGT GAGTTTTACC CCCAAACAAT GATGATGTTG TTAGCTAAAG CCTCTTTTTC TGGAAGTAAC ACTCTCATAA AGCGCATCTG TTCCCAATCT TTAGCTAAAA TCAACCTTTC AGGGGCTTTT TTATCGTCAG TCGTCCTGAC TGGCGCAAAT CTTGAACAAG CAAATTTATG CGATGCTGTA TTGGTGAATG CAAATTTAGC TGGTGCTAAC TTAAATAACG CAAATTTAGC TGGTGCTAAC TTAGCCGGCG CAAATCTCAC CGATGTCAGC TTGGAAACCG TCAATCTTAC CAATGCTTGT TTATGTGATG TTCCCCTCAC GGAGGCTGAA AGAGAAATTG CCCAATTCCA CGGCGCATTA TTTTCTCTAG AACAATTTCA AGTAGTGAAA AGTTTATTAT CCAAGCAATA TTACCTTAGT ATTTCTAGTA CCAAAGAGAA AACTAAGTTT TGGAATCAAA ATAGTCTCGA TACCGGCTTA ATTGAAAGCT TGGAAGGTGA AGTAATTATG CCTACAATTT TAGATCACGA GACCTATGAT GAGACGGTTT TTGGTAAAAG TGAGTTATAA
|
Protein sequence | MNLSLRQWLA ERQIEISDIK TFASGQLAGI AYRIVQDMEL KSLMPLDICT LAEVLQLPLG TVEEEISVLA SLSEHLLRHL SQKKALKRNE GTWLAFQIAY LLALEQVLLQ EEQLKRPWLN RAKIPALATI IISDPQLQGL LKTLSPGKLT DTQAEQALIS VADSLLVQQM NHATVAWLMA NGAEELEAKL LTQRLDNSLP GYLIKIIAQN PAPLAQLQKF FRIGTLEDVL NIDLYKEKYR ASLLQTLSTP LLMEYFALKN IYVPLSGVPQ EPNDGQSIDL KIWVEKQLID LETIAVIESE PGYGKTSFCQ IWAAEVALKL YPHWLPILIR LRDIKYGKSL LETLNSGFTF NVHINLSAWL EQTNNRCVLL LDGLDELPSS HQGNRDKQIF IQQLLQFQSQ EQHKIVLTSC SHTVEEITSE IPLQWRTIKI QPLEVNQLKQ WFQQWALFQS LPISQNFFTF LKQAGLFANK SHLSELSHLV RQPLMLHLLG VFHRDGLLDD EILQKNAKFS LHWEIYSRLK RWLLGYPLTA GMRTMLLRPG TAHIHRTPEA ITNLLGEYHP QDLIAQMQAI ALKILHGDRH QVTLTGEFNT NTLPALYFRY CVSSQLPQAN DKAQLTVKVE FSHSQIGEYL CAEAVATQLQ RLTQCQEDVY GTETFIFNTP SSVAQHLYNL LGYGIITPEI EGLVIAALQT QQKPILLRRL ESFWRGWCQG RWLDESIAHT ALPYFHSLQN LVNVEQVNTN VGINVFLLLA AICRDIQVAF SPCGNPANVS EFYPQTMMML LAKASFSGSN TLIKRICSQS LAKINLSGAF LSSVVLTGAN LEQANLCDAV LVNANLAGAN LNNANLAGAN LAGANLTDVS LETVNLTNAC LCDVPLTEAE REIAQFHGAL FSLEQFQVVK SLLSKQYYLS ISSTKEKTKF WNQNSLDTGL IESLEGEVIM PTILDHETYD ETVFGKSEL
|
| |