Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4849 |
Symbol | |
ID | 3679347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 6105041 |
End bp | 6107962 |
Gene Length | 2922 bp |
Protein Length | 973 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637720206 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_325341 |
Protein GI | 75911045 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.692365 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0232391 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAAGC GTTTCTGGCA AACTTGGCAA GAATTTAGGC AGTCATTTTC AGTATCAGAA AGCCTAAGTA CAAGTGTCGA GACAGGTAAG GCAGTCTTGG AAGCTGCCAA TACTCTCAAA GAAGAGGGTG ATAGTATAGA AATCTTGCAA TCAGTCCTGC AAAACTCATC TTCTTTATTA GATGTGTTGT GTTCGCCAAT GGCGCAGGTT ATAGGTGCGG GGTTGCCCTT TGTACCAATT GGCATTGCTT TGTTAAAGTT TGCCCGTGAT ATAAATCAGA AAGAACCATC CTTAGAAGAC TGCTTTTTTA TAGTTAGTCA AGCGGCTTAT TTGGAAAGCA CCAAAGAAAT TTTAAGTTTA AATATATATC AAAATTTTAA CTGGGATGCC AAGCTTGATA TTCAAGCCAT CAGTCAGCAA ATAGAAAAGC TCAATGATGT TGAATTTAAT TCTGATACTG CGAGTAAAGC AATCAGATGT TTCCATGAAT CACCTTTAGC TGAAGCTTTC AACAGGGTTT TATTAGCTAG ATTAGCAGCA GCAAATATTT CCCCAGGACT AGCGGACATT TTAACCCAAC GTGTGGCTCG GAACACTCAC CGCCATATCA TTAAAGCTTG GATAGAAGCG GGTGAGGCGA TAAAAACTTT AATTCAGCCT TCCTTGGGTG ATTGGCAGAG AGAACAGGAA AGGTTTCAGA GTATTGATAA TTATTTAAAA ACTCATATTG AGCAGAAACC TTTTGAGTTA GTTTTTGATG AAAAGTTCGC CTTTAAAGAT ATTTATGTAC CTATCAAAGC CAAGCCTGTA GATGCAAATG GCAAGATAGA TGAAGAAAAG GATTCATTCA ATTTAGACAC TTGGGCGAAA ACGATTCTGT TGAATCCTGA TAACTTGGAA CAGGTGATGT TTATCCAAGG AGGCCCAGGC AGGGGGAAAA GTGTTTTCTG TCGAATGTTT GCTTACACAG TGTGGCGACA GTTACACCCA ATTTGGACAC CAATCTTAAT TCGCTTGAGA GATATCGACA CTTTTGAAAC ACGGTTAGAG AATACTATTA AAGCCGAATT AAAACTGGGT TTTATTCAAG GTGATGCTAA CTGGTTGACT AATGCCAATA CTCGGTTTTT GTTTATTCTT GATGGTTTTG ATGAACTGCA TATTGAAACG AGAAATAACC TCAATTTGGG CGATTTTATT AAACAAGTAG CTGGTTTTCA GAAAGAATGC AAAGATTATA GGGAAATGGG GCATCGAGTT ATCATTACTG GTAGGTCGAT GGCTTTACAA GGTATTGCTG ATTTACCCCG CAATTTGGAA CGGGTGGAGA TTGTGGAAAT GGATGGGCAA CTCCAACAGC AATGGTTAAA TAAGTGGGAA GCTGTACAAG TAAATAAAGG TAAAACCATT GCGTTTGAGC AGTTTTTACA AAGTGATAAA TGCCCCGATG AAGTTAAGAA ATTAGCTCAG GAACCGCTAT TGCTCTATTT ATTAGCGGCA ATGTATCGAG ATTCTAAATT AGATATTCAT AAGTTAGAGC AGGCAAGTGA TAATCGCACC GCTAAAATTA TCATTTACCA AGAAGCTGTA AATTGGGTAC TGACTAAACA GCGTTCTGAA CCAGATGGAA CTGATTTAAA TATTGAGTTA ACTAAACAAA AGCCTGAGGA TTTAAAACGC ATCCTCATGG AAGCTGCGGT TTGTGTTGTG CAGTCTGGTG GTGAGTTTGC TTCTATGTCC ATGTTAGAAG CACGTTTACA AGAAGATGAA GGAGCGAAGG CTTTAATTGA AAAAGCGAAA GAAAAACTGG GGAATGAAGC ACTGAAAACG GCTTTGGCTG CATTTTACAT TCGTCCGGCG GAAAAACAAG AAGGTGGGGT TGAGTTCTTC CATAAAAGTT TTGGGGAGTT TCTCTTTGCT GAACGCCTGA AAGCACGGCT TAAGGCTTGG ACGCAATATT ACGATGGGGA TGAGGGGAGA CAGCCAATTA TTTCTGAAGC TGTAATGAAT TGGGAAATCT ATGATTTACT CGGTTACGGT GGGTTAACAC AGGAAATTGT AGACTACCTG ATGGGGTTGT TAACTGAGAG TCAAGATTTC CGGTGGGTGG AATTATTTAA GCGGTTAGAC AAGTTTTATA GTAAGTGGTG TCAAGGGAAA TTTATTGACA CATCTGAGGA GACTTTACCC CAGAAGAAGT TGCGACAGTT GCAAAGGTAT GGCATTCAAG GGTTAGGTCA GCGTCAGGTG GATGTTTATG CTGGGTTGAA TGTGATGATT CTGCTGTTGG AGTTACACCG CTATGCTCAA GGGCGAGATG AGCTGAAAGC AGAAATTGTC TTTTATCCAT CTGGGAAACC GCAGGGACAT AGGCTCACAG CCCGATTGCT TCGCATCATG AATTATAGTG ATGGGTTGGA TTTAGGGAAC TTTATTAGGA TAGTTGGCAA ATTCCTCAGA GGCGCAGACC TCAGTGGCGC AGACCTCAGT GGCGCATTCC TCAAAGGAGT ATTCCTCAGA AGCGCAGACC TTAGTGGTGC ATATCTCAGA GGTGCAGACC TCAGAGATGC ATACCTCAAT GGCGCAGACC TCAGTGGCGC AGACCTCAGT GGCGCATACC TCAATGGCGC ATACCTCAAT GGTGCATACC TCAATGGCGC ATACCTCAGC CACGCAGACC TCAGTCGTGC AGACCTCAGA AGTGCAGACC TCAGAAGTGC AAACCTCATT AGTGCAGACC TCATTAGCGC AGACCTCATT AGCGCAGACC TCAATGGTGC AGACCTCAGT CACGCAAACC TCGGTGATGA ATTTTGGGGA GATGTCAAAT GGGATGAAAA GACAAACTGG GAGAATGTAC GAGGGCTGGA TACAGCGATT AATGTGCCAG AAGCGTTAAA GCGACAGTTA GGACTGAGTT AA
|
Protein sequence | MGKRFWQTWQ EFRQSFSVSE SLSTSVETGK AVLEAANTLK EEGDSIEILQ SVLQNSSSLL DVLCSPMAQV IGAGLPFVPI GIALLKFARD INQKEPSLED CFFIVSQAAY LESTKEILSL NIYQNFNWDA KLDIQAISQQ IEKLNDVEFN SDTASKAIRC FHESPLAEAF NRVLLARLAA ANISPGLADI LTQRVARNTH RHIIKAWIEA GEAIKTLIQP SLGDWQREQE RFQSIDNYLK THIEQKPFEL VFDEKFAFKD IYVPIKAKPV DANGKIDEEK DSFNLDTWAK TILLNPDNLE QVMFIQGGPG RGKSVFCRMF AYTVWRQLHP IWTPILIRLR DIDTFETRLE NTIKAELKLG FIQGDANWLT NANTRFLFIL DGFDELHIET RNNLNLGDFI KQVAGFQKEC KDYREMGHRV IITGRSMALQ GIADLPRNLE RVEIVEMDGQ LQQQWLNKWE AVQVNKGKTI AFEQFLQSDK CPDEVKKLAQ EPLLLYLLAA MYRDSKLDIH KLEQASDNRT AKIIIYQEAV NWVLTKQRSE PDGTDLNIEL TKQKPEDLKR ILMEAAVCVV QSGGEFASMS MLEARLQEDE GAKALIEKAK EKLGNEALKT ALAAFYIRPA EKQEGGVEFF HKSFGEFLFA ERLKARLKAW TQYYDGDEGR QPIISEAVMN WEIYDLLGYG GLTQEIVDYL MGLLTESQDF RWVELFKRLD KFYSKWCQGK FIDTSEETLP QKKLRQLQRY GIQGLGQRQV DVYAGLNVMI LLLELHRYAQ GRDELKAEIV FYPSGKPQGH RLTARLLRIM NYSDGLDLGN FIRIVGKFLR GADLSGADLS GAFLKGVFLR SADLSGAYLR GADLRDAYLN GADLSGADLS GAYLNGAYLN GAYLNGAYLS HADLSRADLR SADLRSANLI SADLISADLI SADLNGADLS HANLGDEFWG DVKWDEKTNW ENVRGLDTAI NVPEALKRQL GLS
|
| |