Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4398 |
Symbol | |
ID | 3680525 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 5510609 |
End bp | 5512174 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637719751 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_324891 |
Protein GI | 75910595 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0324397 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.0000237208 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATGTAG AGGAATTGCT GGCACAATAT GCGACTGGAG TCATCAATTT TAGTGGTGTT GACCTCTCGG AAGCTAATTT GAGTGGTGTC AAACTCTGTG GTGTGAATTT TAGCCAAGCA AATTTGAGTA TAGCTAACCT GAGTGGATCA AATTTGAGCG AAGCTGACTT CAGCCATGCC AAACTGAACG TAGCTAGACT GAGTGGTGCC AATCTCACTA ATGCTATTTT CAATCATTCC AGCCTCAATG TTGCTAATTT AATTCGTTCT GACCTCAGTC GCGCTCAATT ACGTGGGGCT TCCTTGGTAC GTGCTGAGTT AATTCGTGCG GAACTCAGTC GGGTTGATTT GTCGGAAGCA AACCTCAACA GTGCTGATTT GCGAGAAGCT ACACTCCGCC ACGCCAATCT TCGCCACGCC AATTTGAACG GAGCCAGCTT GAAAGGTGCA TCCTTGGTAG GAGCTAATTT GGAAATGGCT AACCTTAATG GCTCTGACTT GAGTCGTTGT GATTTAACCA GCGCCAACCT GCGAGATGCT GAACTAAAAC AGGTAAACTT CCGCCATGCT AACCTTAGTG GTGCAGATTT GAGCGGAGCG AACCTCCGGT GGGCTGATTT GAGTGGTGTA AACCTGAGTT GGGCTGATTT AAGTAATGCA AAATTGAGTG GTGCTAATTT AGTCGGCGCA GACTTGAGCA ATGCCAATTT AACAAATGCT AGTTTAGTCC ATGCCAATTT AATTCAAGCA AAATTAATTA GAGCAGAATG GGTGGGTGCT GATTTAACCA GCGCCATTTT GACTGGTGCA AAACTTTACT CTACTTCCAG ATTTGGCTTA AAAACAGAGG GTTTGATTTG TCAATGGATT GACTTAAGTC CTACAGGCGA TCGCTCCATT ATCCAACGGT TTGACACTGA AGATCCACGA GAATTTTTCA ACGAAACCCC ACCAACCATT CAAATTGTTA TCGATGCAGC TTTAGAAGCA GAAGCGAACT TTGCCTTAGC TGGCGCTTAC TACCACATCG CCCAAGAATA CTCCATCCTC AAACAACCCC CCAGCATGGA AATCAGTCGT CGTCGGACTG TGTTTACATT CCGGGTAGAT AATGACGCGG ACTTATTACC TACAGCCTGT ATGGCAATTT TACCTTTTAA AGATGCTGTT AATACCCAAA AGAATATTTA TGCCCTGTTA GCCATGATGG AACAGGAAAA TATATCTTCC TTAGGAATAA AAACACCCCA TCGGGTGAAG GAATTAACAA GTGCGATCGC GGAAGCTATA AGTCAGGCAC AGACCATGAA AAAAACCAAA AAGAACCTGC ACTTAGCTGC AAAAATCGAA TTTTTTCAAG CTCCAACCCA AACCATATTA ACTAATTCCA GCGCTCAAAC TTTGATTGTT CATGACAGCC CCAACTTTGG TAAGCGATTT ATCAACTCCT CAATTACCGA AATGACTTTT TCATCCGATG TATCCGGTGA ATCTCTGCCA CATAACTTAC CTGGGTTAAA CACACTTACA GATTTTGTTA ACAGTTTTCA CTATGTGAAT GAATAA
|
Protein sequence | MNVEELLAQY ATGVINFSGV DLSEANLSGV KLCGVNFSQA NLSIANLSGS NLSEADFSHA KLNVARLSGA NLTNAIFNHS SLNVANLIRS DLSRAQLRGA SLVRAELIRA ELSRVDLSEA NLNSADLREA TLRHANLRHA NLNGASLKGA SLVGANLEMA NLNGSDLSRC DLTSANLRDA ELKQVNFRHA NLSGADLSGA NLRWADLSGV NLSWADLSNA KLSGANLVGA DLSNANLTNA SLVHANLIQA KLIRAEWVGA DLTSAILTGA KLYSTSRFGL KTEGLICQWI DLSPTGDRSI IQRFDTEDPR EFFNETPPTI QIVIDAALEA EANFALAGAY YHIAQEYSIL KQPPSMEISR RRTVFTFRVD NDADLLPTAC MAILPFKDAV NTQKNIYALL AMMEQENISS LGIKTPHRVK ELTSAIAEAI SQAQTMKKTK KNLHLAAKIE FFQAPTQTIL TNSSAQTLIV HDSPNFGKRF INSSITEMTF SSDVSGESLP HNLPGLNTLT DFVNSFHYVN E
|
| |