Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4850 |
Symbol | |
ID | 3679348 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 6108360 |
End bp | 6110090 |
Gene Length | 1731 bp |
Protein Length | 576 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637720207 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_325342 |
Protein GI | 75911046 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00626433 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATAAAA AAATGTCACA ACTACCTCAT GTTTGGAAAC TATTACAGAA TTACGTTCGG GGTACAAGGT CAACAGATAG GCCTACAACA AAGTATGTAG GTAGTCGCTC ATTTTTACCT CTGGGAAAAT ACCAACAAAA ACCAGCCAAA GATGTAAACT TTTCTCAAGT TAATTTACAG AAAGAAATTT TATGTGAAAG ATTACAAATC AGTTTAGAGG CATGGATAAG AGAGGATAGT AGGGGGCAAT TTTCAACTAG TAGTAGCCAA CTCCAACAAG AGATTTATGA TTTACTCGGT AATGGTGCAT TAACTCCAGA AATAGTCGAA AAAGTTATAG AATTGCCTAT CCTTGAAAGA GAATTTCGCC CAGAGTTGCT ATTTCACCGC CTAGAAGACT TTTATCAACG TTGGTGTCGA GGAGAATTTA TCGATGCGCC ACCAAGCGAG AACTTACCTC AGCGAAAATT GCTGGAGATG GCAGCACAAA ATCAGGGTAT TGGACTGAGG CAGATTGATA TTTACGCCGG GCTTAATGTC CTGATTTTAC TTCTAGAATT ACACCGCCAC GCCCAAGAGC AAGAAGAACT GCGGCAACAA ATCAACTTTT ATCCTTCTTC CCCACCAGAT ACAGACAGCT TTTTCACCTC CCAACTACTG CGGGTAATTA ATTATAGCGA TGCCATTGAA ATTGGCAACT TTAGTAATAT CGTGGGGGAA TTTCTCCGGG ATGGTAATTT TCAAGGTGCA TACCTGGGAA ATGCCAACTT GACGGGAGTC AACTTCAGTG GTGCTAACCT CAGTGGTGCA TACCTTGGTG ATGCCAACCT TACAGGCGCA AACTTCCAAG GTGCTAACCT CACAGGTGCA GACTTTGGTG ATGCCAACCT CAGTAGTGTT AATCTCAGTG GTGCTAACCT GAGTAGTGCC GACCTCAGCA GCGCCAACCT TACAGGTGCA AACCTAAGCG GTGCTAACTT GGAACGTGCC GACCTCAGCC GCGCTGACTT GAGTAGCTGC ATCCTCAATG ATGGCGAATT AAGCCACGCC AACCTCAGTG GTGTCAACTT CAGAGATGCC GAACTCTGTC GCGCTAACCT CAGCAACGCC ATCTTATTTG GTGCTAACCT CAGTGATGCC AACCTCAATC ATGTTGACCT CAGTCGCGCC GACCTTTGTC GTGCAGACCT GAGTGGTGCA GACCTCACCC ACGCCACCCT CAACGGTACT AACCTCAGCG ACACCATTCT TTTTAGTACT AACTTAAGTG ATGCCATTCT GGAGGCAGCC GACCTCAGTT ATGCCAAACT CAACGGCGCG AAACTCAACT ACGCCAGACT CAACGGTGCT ATGTTCTTAG GTGCAGACCT CAGTGGCGTA GATTTAACTG GCGTGGTTCT CAACGATGCC GATTTGAGTG GCGGAATTCT CAGCGAAGCC GACCTCACAG GTGCAGACCT CAGCGATGCG GTACTTTTGG GTACTGACTT CAGCTTTGCC AATCTCAACA GCGCCAACCT CAGTGGTAGT AACTTGAGTG GCGCAATTTT GAATGGTGCA GACCTCAGTA GCGCTAACTT CAGTTATGCC ATTCTCGACG ATACAGACTT AAGTGAAGCC AACCTAGAAG ATATGACCTG GGGAGAAATT CAGCAATGGG AAGGTGTGCG CGGTTTGGAG ACAGCGCTGA ATCTGCCGGA GGCGTTGCGG GAAAAGTTGG GAAAGAGGTA A
|
Protein sequence | MDKKMSQLPH VWKLLQNYVR GTRSTDRPTT KYVGSRSFLP LGKYQQKPAK DVNFSQVNLQ KEILCERLQI SLEAWIREDS RGQFSTSSSQ LQQEIYDLLG NGALTPEIVE KVIELPILER EFRPELLFHR LEDFYQRWCR GEFIDAPPSE NLPQRKLLEM AAQNQGIGLR QIDIYAGLNV LILLLELHRH AQEQEELRQQ INFYPSSPPD TDSFFTSQLL RVINYSDAIE IGNFSNIVGE FLRDGNFQGA YLGNANLTGV NFSGANLSGA YLGDANLTGA NFQGANLTGA DFGDANLSSV NLSGANLSSA DLSSANLTGA NLSGANLERA DLSRADLSSC ILNDGELSHA NLSGVNFRDA ELCRANLSNA ILFGANLSDA NLNHVDLSRA DLCRADLSGA DLTHATLNGT NLSDTILFST NLSDAILEAA DLSYAKLNGA KLNYARLNGA MFLGADLSGV DLTGVVLNDA DLSGGILSEA DLTGADLSDA VLLGTDFSFA NLNSANLSGS NLSGAILNGA DLSSANFSYA ILDDTDLSEA NLEDMTWGEI QQWEGVRGLE TALNLPEALR EKLGKR
|
| |