Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_5047 |
Symbol | |
ID | 3683528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 6337386 |
End bp | 6338525 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637720407 |
Product | thiamine-phosphate pyrophosphorylase |
Protein accession | YP_325539 |
Protein GI | 75911243 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0352] Thiamine monophosphate synthase |
TIGRFAM ID | [TIGR00693] thiamine-phosphate pyrophosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.554063 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.412974 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAATCC TTGATGCGAG AAACATTGCA AATATGAAAG AGGCTGGCTA TTACGATGAA AGCATCACTA ATGGGGTAGT TGTAATGGTA GAACCATACA GCCAACAAAA GCAAATCCAG CAGGTTGTTT ACCGTATATT AGATGCTAAT TTAGACCGCG CTCGTGAAGG GTTGCGAATT ATTGAAGAAT GGTGTCGCTT TGGGTTGAAT AGCGGCCAGT TGGCAGGGGA ATGTAAGTAC TTGCGCCAAG AGGTGGCTGT TTGGCACACA GAAGAATTGC GAGCAGCAAG AGATACAGCA GGTGATCCCG GCACTGATTT GAGCCATCCA CATGAGGAAC AACGCTCTAG TATTAAGGCG TTGTTACAAG CTAACTTTTG TCGTGTGGAA GAAGCTTTGC GGGTGTTGGA GGAATACGGT AAGCTTTATC ACCCGAAAAT GGGACAGGCT TGTAAGCAGA TGCGATATCG AGTTTACAGC CTGGAAACTA ATTTGATGGG TCATCAGCGC CATCAACTGC TGCGGCGATC GCGTTTATAT CTTGTCACTT CCCCATCGGA AAGTTTATTA CCGACGGTGG AAGCTGCTCT CAAGGGCGGC TTGACATTGT TACAATATCG TGACAAGGAC GCTGATGATT CTGTCCGTTT GGAACTGGCG ACGAAACTCC GCCAACTCTG TCACAGCTAC GGCGCTTTAT TTATTATCAA TGACAGGGTG GATTTGGCTC TGGCTGTGGA TGCTGATGGT GTGCATCTAG GTCAGCAAGA TATGCCCATC GCTACTGCTA GGCAATTACT AGGCCCCCAA CGTCTTATCG GTCGTTCTAC TACCAATGCT GATGAAATGC AAAGGGCGAT CGCTGAAGGT GCAGACTATA TAGGTGTAGG GCCCGTATAC GAAACTCCCA CCAAAGTAGG TAAGGCGGCG GCTGGTTTAG GATATGTGAG TTATGCGGCT CAACATAGTT CAGTTCCCTG GTTTGCTATT GGGGGCATTG ATGCCAATAA TATCAATGAT GTGATTGATG CAGGAGCTGA AAGAGTGGCG GTAGTGCGAT CGCTCATGCA GGCGGAACAA CCTACCCTAG TCACACAATA TTTGATTTCT CAACTCCATC GCATTCAGCC AGAAAGTTAA
|
Protein sequence | MQILDARNIA NMKEAGYYDE SITNGVVVMV EPYSQQKQIQ QVVYRILDAN LDRAREGLRI IEEWCRFGLN SGQLAGECKY LRQEVAVWHT EELRAARDTA GDPGTDLSHP HEEQRSSIKA LLQANFCRVE EALRVLEEYG KLYHPKMGQA CKQMRYRVYS LETNLMGHQR HQLLRRSRLY LVTSPSESLL PTVEAALKGG LTLLQYRDKD ADDSVRLELA TKLRQLCHSY GALFIINDRV DLALAVDADG VHLGQQDMPI ATARQLLGPQ RLIGRSTTNA DEMQRAIAEG ADYIGVGPVY ETPTKVGKAA AGLGYVSYAA QHSSVPWFAI GGIDANNIND VIDAGAERVA VVRSLMQAEQ PTLVTQYLIS QLHRIQPES
|
| |