Gene Ava_5047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_5047 
Symbol 
ID3683528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp6337386 
End bp6338525 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content47% 
IMG OID637720407 
Productthiamine-phosphate pyrophosphorylase 
Protein accessionYP_325539 
Protein GI75911243 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0352] Thiamine monophosphate synthase 
TIGRFAM ID[TIGR00693] thiamine-phosphate pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.554063 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.412974 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAATCC TTGATGCGAG AAACATTGCA AATATGAAAG AGGCTGGCTA TTACGATGAA 
AGCATCACTA ATGGGGTAGT TGTAATGGTA GAACCATACA GCCAACAAAA GCAAATCCAG
CAGGTTGTTT ACCGTATATT AGATGCTAAT TTAGACCGCG CTCGTGAAGG GTTGCGAATT
ATTGAAGAAT GGTGTCGCTT TGGGTTGAAT AGCGGCCAGT TGGCAGGGGA ATGTAAGTAC
TTGCGCCAAG AGGTGGCTGT TTGGCACACA GAAGAATTGC GAGCAGCAAG AGATACAGCA
GGTGATCCCG GCACTGATTT GAGCCATCCA CATGAGGAAC AACGCTCTAG TATTAAGGCG
TTGTTACAAG CTAACTTTTG TCGTGTGGAA GAAGCTTTGC GGGTGTTGGA GGAATACGGT
AAGCTTTATC ACCCGAAAAT GGGACAGGCT TGTAAGCAGA TGCGATATCG AGTTTACAGC
CTGGAAACTA ATTTGATGGG TCATCAGCGC CATCAACTGC TGCGGCGATC GCGTTTATAT
CTTGTCACTT CCCCATCGGA AAGTTTATTA CCGACGGTGG AAGCTGCTCT CAAGGGCGGC
TTGACATTGT TACAATATCG TGACAAGGAC GCTGATGATT CTGTCCGTTT GGAACTGGCG
ACGAAACTCC GCCAACTCTG TCACAGCTAC GGCGCTTTAT TTATTATCAA TGACAGGGTG
GATTTGGCTC TGGCTGTGGA TGCTGATGGT GTGCATCTAG GTCAGCAAGA TATGCCCATC
GCTACTGCTA GGCAATTACT AGGCCCCCAA CGTCTTATCG GTCGTTCTAC TACCAATGCT
GATGAAATGC AAAGGGCGAT CGCTGAAGGT GCAGACTATA TAGGTGTAGG GCCCGTATAC
GAAACTCCCA CCAAAGTAGG TAAGGCGGCG GCTGGTTTAG GATATGTGAG TTATGCGGCT
CAACATAGTT CAGTTCCCTG GTTTGCTATT GGGGGCATTG ATGCCAATAA TATCAATGAT
GTGATTGATG CAGGAGCTGA AAGAGTGGCG GTAGTGCGAT CGCTCATGCA GGCGGAACAA
CCTACCCTAG TCACACAATA TTTGATTTCT CAACTCCATC GCATTCAGCC AGAAAGTTAA
 
Protein sequence
MQILDARNIA NMKEAGYYDE SITNGVVVMV EPYSQQKQIQ QVVYRILDAN LDRAREGLRI 
IEEWCRFGLN SGQLAGECKY LRQEVAVWHT EELRAARDTA GDPGTDLSHP HEEQRSSIKA
LLQANFCRVE EALRVLEEYG KLYHPKMGQA CKQMRYRVYS LETNLMGHQR HQLLRRSRLY
LVTSPSESLL PTVEAALKGG LTLLQYRDKD ADDSVRLELA TKLRQLCHSY GALFIINDRV
DLALAVDADG VHLGQQDMPI ATARQLLGPQ RLIGRSTTNA DEMQRAIAEG ADYIGVGPVY
ETPTKVGKAA AGLGYVSYAA QHSSVPWFAI GGIDANNIND VIDAGAERVA VVRSLMQAEQ
PTLVTQYLIS QLHRIQPES