Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_1429 |
Symbol | |
ID | 3682568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 1766415 |
End bp | 1768343 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637716766 |
Product | glycoside hydrolase, starch-binding |
Protein accession | YP_321947 |
Protein GI | 75907651 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.56449 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTACAAA CACCACCATC ACAATTTTCC GAAGAGCAAT ACAAAGTCGA AAATGCTAAT GCAGAGGTAC AAGCATTAAT CGAATCGCCT ACGCCAGATA CAGAAATAGA TTTAGAATTT CTCTACACTA GAGATATTGA ATTTCGCCAA GAAACTATTT ACTTCCTCGT TGTAGATCGC TTTTATGATG GCGATGCTGA AAATAGCGAA GGTTATAATC CAGAACTTTA TGACCCCAAT GGGAAAGATT GGGGCAAGTA TTGGGGGGGT GACTTACAAG GAGTAATCGA CAAATTAGAC TACTTGAAAG ATATGGGAGT AACCGCACTT TGGCTAACTC CTTTATTTGA ACAAGTCGAG GAATTATTTG TTGGTAACGC AGCCATGCAC GGCTATTGGA CAAAAGATTT TAAACGGCTA AATCCTCGCT ACATTGGTAA TGGAGAAGAT CCTTCCTTAA ACAATACTCA AGAAACCAGA AATACGACTT TTGACCGCTT AATTGCAGAA TTACACAAGC GGAAAATGAA GCTGATACTA GATATTGTCT GTAACCATAG CAGTCCTGAT ACTAGTGGTA GCAAAGGTGA GTTATATGAT GACGGCGTAA AAATTGCCGA CTTTAATGAT GATGTGAATC ACTGGTATCA CCACTATGGT GAAGTGCAGA ACTGGGAAGA TGATTGGCAA GTCCAAAACT GTGAACTGGC TGGTTTAGCT ACTTTTAATG AAAATAATAC TGAGTATCGT AACTATATCA AGTCTGCAAT TAAGCAATGG CTAGACCGGG GTGTGGATGC GCTGCGGGTA GATACAGTCA AACATATGCC AATTTGGTTT TGGCAAGAAT TTACTGGTGA TATGTATAAT CACAAACCAG ATGTATTTAT TTTTGGTGAG TGGATTTACA ATTATCCCAG TGACGATCGC TCGGTGGAAT TTGCCAATAA TTCCGGTATG ACTTTACTTG ATTTTGGTCT GTGCGTAGCA ATTCGCGGCG CATTAGCCCA AGGTGCGGAA GGGGGATTCC ATCTCATCCA AGAAATATTC GACCAAGATG ATCGCTACAA CGGGGCTACG GAGTTAATCA CCTTTGTTGA TAACCATGAT ATGCCCCGCT TTCAATCCCT CAACCCCGAT CCAGCGATGT TGAAAGTAGC GATCGCTCTC ATTATGACAT CACGGGGTAT TCCATGTATC TATTACGGTA CAGAACAATA TCTGCACGAT GATACCAACG GCGGTAATGA CCCCTATAAC CGCCCCATGA TGGAAAATTG GGATACTGAT ACTGAGGTTT ATAGATACCT CCGATTGTTG TCTGGTATCC GACGGTTAAA TCCAGCCGTC TCTATGGGTA GCCAGTGGCA AAAATACCTC ACACCCGATG TTTATTGTTA TGTCCGCCGT TATCGTTCTT CTGTTTGCTT CGTCGCCCTT AATCGTGGTG GAGAGGTGAC TTTACCAGAA GTCCAAACAG ACTTACCAGA TGGTGAACAT ACTTGTGCGG TGACTCGGAA TAAATTTGAG GTAAAAGACG GTAAAATTTA CAATCTACAA CTGGAAGAAC GGGGAGTCAT CGTTTTAAGT CATGTAGGCG AACGAGTAAA AGCGCAAACC ATTATCCGCG TTCAACTTAA TGGTGTACAT ACTCAACCAG GGGAAACTAT CGTAGTTGTC GGCGACTGTC CAGAATTGGG TAACTGGGAT ATTAGTAAAG CCTATCCCCT GGAATATATC AACTCTAATA CTTGGTTTGC CGAAATTCCC TTTGATGAAA GTGCAGGCAA ACTCATTAGT TACAAATATG CCATGTGGCG TGAAGGGCGA TCGCCTCTGC GAGAAAATAC CTTAAATCGT CGCTGGGTAG TGGCGAAGGA AGGCACTGTT AAATGGCGTG ATACTTGGGC TTCTGGGAGA GAGTCTTAA
|
Protein sequence | MVQTPPSQFS EEQYKVENAN AEVQALIESP TPDTEIDLEF LYTRDIEFRQ ETIYFLVVDR FYDGDAENSE GYNPELYDPN GKDWGKYWGG DLQGVIDKLD YLKDMGVTAL WLTPLFEQVE ELFVGNAAMH GYWTKDFKRL NPRYIGNGED PSLNNTQETR NTTFDRLIAE LHKRKMKLIL DIVCNHSSPD TSGSKGELYD DGVKIADFND DVNHWYHHYG EVQNWEDDWQ VQNCELAGLA TFNENNTEYR NYIKSAIKQW LDRGVDALRV DTVKHMPIWF WQEFTGDMYN HKPDVFIFGE WIYNYPSDDR SVEFANNSGM TLLDFGLCVA IRGALAQGAE GGFHLIQEIF DQDDRYNGAT ELITFVDNHD MPRFQSLNPD PAMLKVAIAL IMTSRGIPCI YYGTEQYLHD DTNGGNDPYN RPMMENWDTD TEVYRYLRLL SGIRRLNPAV SMGSQWQKYL TPDVYCYVRR YRSSVCFVAL NRGGEVTLPE VQTDLPDGEH TCAVTRNKFE VKDGKIYNLQ LEERGVIVLS HVGERVKAQT IIRVQLNGVH TQPGETIVVV GDCPELGNWD ISKAYPLEYI NSNTWFAEIP FDESAGKLIS YKYAMWREGR SPLRENTLNR RWVVAKEGTV KWRDTWASGR ES
|
| |