Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_3427 |
Symbol | |
ID | 3679895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 4259330 |
End bp | 4260427 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 637718779 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_323929 |
Protein GI | 75909633 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAAA CAAAAGGTAG TAATAAATTT CTATTTTTTC TGAAGCTAGA ACTCATATTA CCTCTATTAG GACTTTCTGG TCTTTTAACA GCGATATTTC TTCCTCGTTG GACAGTTGAC AATGCAAAGC TTACCGGTAC AAAGCGTCTT GAAGTCGCTA ATGCTTATCG TGTCACCATA ATCCAGGGTT TCGGAGGATT ATTTTTCCTG AGTACAGCTT ATTTTACGTG GCGGAACTTA AAAGTTGCCG AGGGTAACTT AACAACCGCA CAAAATAAAC AAATAGCAGA TGCAAAAGTT GCTGAATTAA ACCTAAAAGC TATACAAGAT AAACAACTTA CAGAACGTTT TGCCAAAGCC GTTGATATGT TGGGGCATCA AGATATTCAT GTTCGCTTAG GTGCAATTTA TTTACTAGAA AGAATTTCAA AAGATTCTGA GCAAGATTAC TTGCAGATAA TGGAAATTCT CACGGCTTAT GTGCGAGAAA AGTCTCCTTA TCAATACCCT CACAATACAG TAAACAACTT AGAAGACGAA TCTTTTGTTC CACTACCAAT AGATATTCAA GCAGTTCTGA CTGTTATTGG TAGACGCAAA GATCCTGAAA CAGAAGAATT AAATTCTTTA GATTTGAGTG GCAGTAATCT TAGGGGAGCT AAACTTGGTT CAGCTAACCT AAAAAAAGTG GATTTTAGCA GAGCTAACCT CAGCTATGCT TACCTTGTAG AAGCAGAACT TGAGAACTCT ATCTTTGATA AAGCTGTTAT TAAAAAAGCT GACTTAACAG AAGCCAAATT AAGCAATGCT AATCTTAAAG ATGCCAACTT ATTTGACAGT ATCCTTAATG GAGCAAATCT CACGAGCGCT AAGTTAGGTC ATACTAACCT CAGTTACACC AGTCTTGATG GAGTTAATTT CAAAGATGCT TACTTAGTAG TAGCAAACCT TTTAGAGGCT CACTTATACA AAGCAGAAAA CCTTGATCCA AATCAAATTG TCAAAGCAAT TAATTGCGAG AAAGCCCACT ACGACCCTGA ATTTCGCAAA CAAATAGGTT TAAGAAACAC AACTATAACT AGTACAAATA ATTTATAG
|
Protein sequence | MNKTKGSNKF LFFLKLELIL PLLGLSGLLT AIFLPRWTVD NAKLTGTKRL EVANAYRVTI IQGFGGLFFL STAYFTWRNL KVAEGNLTTA QNKQIADAKV AELNLKAIQD KQLTERFAKA VDMLGHQDIH VRLGAIYLLE RISKDSEQDY LQIMEILTAY VREKSPYQYP HNTVNNLEDE SFVPLPIDIQ AVLTVIGRRK DPETEELNSL DLSGSNLRGA KLGSANLKKV DFSRANLSYA YLVEAELENS IFDKAVIKKA DLTEAKLSNA NLKDANLFDS ILNGANLTSA KLGHTNLSYT SLDGVNFKDA YLVVANLLEA HLYKAENLDP NQIVKAINCE KAHYDPEFRK QIGLRNTTIT STNNL
|
| |