Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_3413 |
Symbol | |
ID | 3679938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 4240074 |
End bp | 4242425 |
Gene Length | 2352 bp |
Protein Length | 783 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637718764 |
Product | TPR repeat-containing protein |
Protein accession | YP_323915 |
Protein GI | 75909619 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.335652 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGCAG AAGAAGCTCT AGAGCTTTTG GATAGCTTGG TTTATAGGAA AACAGGGGAA CGTTTAGGCA CTACTCAAAG AATAATTCTA CGTAATTTAT GGGAGGATAG AAAACAAACT TATCAAAATA TTGCGGATAT TTGTGGTTAT ACGGAAGCCC ATTTAAAAGC AGTTGGCGCG GAATTGTGGC AAATACTGAC GAACGTCTTA GGGGAAAAAG TCTCAAAGTC AAACTTTTCT TCAGTGGTGC AGAGACTTTG GCAATCTCAA CGGCTACAAA AGATGTCGCC TATTGTAAAT TCAGTCCTCA ATAATGGCAA GTCACAAGAA CTGGATTATA ACTTTGTGGG GCGCGATCGC GAAATAGGCG AACTTGATAG CTATGTTGTC CGAGGTGCAA AAATTATCCT CATCCAAGGT GAAGGCGGTG TTGGCAAAAC TACCTTAGCG CGGCGATATT TTAAGGCGCA AGGTTTTGAT TTCCTAGAGT TGTGGATGGC TAAGGAAAGC CAACATATCG TTTCTGTGGA GAGTGTGGTT GAGGAATGGC TAAGGGTTGA TTTTAATGAG GAACCAGGTA AAGAGTTTGG TATTAATCTA GATAGGCTAC GGCGCAAGCT ACGGGATGAA ACGCGCAAGA TAGGGGTTTT AATTGATAAT CTTGAGTCTG CGTTGGATAG AAATGGTCAA ATTATTGCTT CTCGTCGTTC CTATGTGGAA CTGTTACGGG TATTAGCAGA CCCTAGCATT CAATCTATCA CTTTGATTAC TAGTCGTGAG CGTCTGTATG AATCAGATGT AGATTTGACT TTCTATCCCC TGGGGGGTTT GGATGAGTGT ACTTGGCAAA AGTTTTTTAC TAGTTGTCAA ATCAAATCTA ATTCATCCGC ACTGAGTGAA ATGTATAAAG CTTTCGGTGG AAATGCCAAA GCCATGCAAA TTATTAGTGG CGCAATTACT ACAGATTTTG AAGGAAATGC AGATATTTAT TGGCGAGAAA ATAAGCATGA TTTATTAATT GAACCAGAGT TAAAAAATTT AGTTGCTAGT CAATTTGACC GTTTAGAACA AATGGATAGC GAAGCATATC GGCTGCTTTG TCGTTTGGGA TGCTATCGCT ATCAAGATGT TACTCATGTG AGTGTTCAGG GATTACAATG TTTACTTTGG GATGTGCCGG AACAACAAGC TAGGCGAGTC GTTAGATATC TAACAGATCG TTTATTGATA GAGTTTCGTA AAGGAAAATA TTGGTTACAT CCGGTGATTT GTACAGAGGC GATCGCCAGA TTAAAACAAA GCGGTGAATG GCAAATTGCT AATCAACAAG CGGCGGAATT TTGGAATCAA AGCGTTACTC AAGTAGAAAA CCCTCAAGAT GCTTTGATGG CGTTAGAAGC TTATCATCAC TATATGGAAA TTGGGGATTA TGAACAAGCT GCTGATGTGA TTATTCGTAG TAGACCCAAA AAGTGTGATC ACAGTATATC TTTAGGAGTT TTATTTAATC GACTGGGTTT ATTGGAAACC TTAATTTCTG TGATTAATCC CCTCATTTAT AATTTACATT CCGACTATCA TTTGAATATA CTGTATAACC TATTGGGACG AGCTTATCAC CAAATAGGTA ATATCAACTT AGCTCTGGAA TGCCACTACA AGTCTAATAA AATTGCTGAA AAAAATAACT TTCTGCAAGA GAGAATTTCC AGTAGTTTCA ATTTGGGTCT TTGTTACACA GATTTATGGG AAATTGACAA AGCAAGTGAA ATTTTTTACT ACGTCAAGAA TCTAGCAGCA ACAGACAGAA ATTATTATCA ATATGTTGTT TATTCTCTGT GTTGTTTAGC TTATTTAGAT TCTTCTGTCG GCAATAATGA AAACGTAACG TTGATGTTAC GAGAAGCAGA AGCCGGATTA TCTCACGACA GATTAACTTC CTGGGGTATA GGCACTAGTT TACTATTTCT CAGCTTGACT TATAAAAATT TAGGTTTAAT AGATAAGGCT TTGTTAATGT GCCATCAAGC TATTAACCAT TGTCGTCAAA ATCAGTTTAG TTTTTTGGAA GCAAGGGCTA CATCTTGTTT AGCATCATTA TATCGAGAAC AGGGACAGTT TACAGTGGCG ATAGATAAAC ATTTAGAAGC GATCGCCAAC ATGAACAAGG TGTCTGATAA ATGCAATCTC GCCAAAGCCT ACTACCAATT AGGGTTGACC TATCAGAGAA TGGGCGAAGT TAACCAAAGT AGAGAGACAT TTCATCAGGC GATCGTTATT TTCAATGATC TACCTGCACC AAAGCAAGTA GAAAAAGTGC AAATAACAAT GATGCGTTTA GAAAATAGTT AG
|
Protein sequence | MGAEEALELL DSLVYRKTGE RLGTTQRIIL RNLWEDRKQT YQNIADICGY TEAHLKAVGA ELWQILTNVL GEKVSKSNFS SVVQRLWQSQ RLQKMSPIVN SVLNNGKSQE LDYNFVGRDR EIGELDSYVV RGAKIILIQG EGGVGKTTLA RRYFKAQGFD FLELWMAKES QHIVSVESVV EEWLRVDFNE EPGKEFGINL DRLRRKLRDE TRKIGVLIDN LESALDRNGQ IIASRRSYVE LLRVLADPSI QSITLITSRE RLYESDVDLT FYPLGGLDEC TWQKFFTSCQ IKSNSSALSE MYKAFGGNAK AMQIISGAIT TDFEGNADIY WRENKHDLLI EPELKNLVAS QFDRLEQMDS EAYRLLCRLG CYRYQDVTHV SVQGLQCLLW DVPEQQARRV VRYLTDRLLI EFRKGKYWLH PVICTEAIAR LKQSGEWQIA NQQAAEFWNQ SVTQVENPQD ALMALEAYHH YMEIGDYEQA ADVIIRSRPK KCDHSISLGV LFNRLGLLET LISVINPLIY NLHSDYHLNI LYNLLGRAYH QIGNINLALE CHYKSNKIAE KNNFLQERIS SSFNLGLCYT DLWEIDKASE IFYYVKNLAA TDRNYYQYVV YSLCCLAYLD SSVGNNENVT LMLREAEAGL SHDRLTSWGI GTSLLFLSLT YKNLGLIDKA LLMCHQAINH CRQNQFSFLE ARATSCLASL YREQGQFTVA IDKHLEAIAN MNKVSDKCNL AKAYYQLGLT YQRMGEVNQS RETFHQAIVI FNDLPAPKQV EKVQITMMRL ENS
|
| |