Gene Ava_3474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3474 
Symbol 
ID3679786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4307191 
End bp4308744 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content43% 
IMG OID637718826 
Producturoporphyrinogen-III synthase / uroporphyrinogen-III C-methyltransferase 
Protein accessionYP_323976 
Protein GI75909680 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0007] Uroporphyrinogen-III methylase
[COG1587] Uroporphyrinogen-III synthase 
TIGRFAM ID[TIGR01469] uroporphyrin-III C-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.027722 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.984084 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAAC AAAATGGCAA AGTCTACCTT GTGGGTGCGG GGCCGGGAGA TGTGGCATAC 
CTCACAGTGA AAGCTTATAA TCTACTGGCT ACGGCTCAAG TGTTGGTTTA TGATGCCCTA
GTTGATGAGC AATTGTTGCA GTGTGTATCA CCCGATTGTC TTAAACTAGA TGTAGGTAAG
CGTGGCGGTA AACCCAGCAC ACCACAAGCC GATATCAATC ATTTACTAGT CAAATACTGC
CAAGAAGGTG CATTAGTCGT TAGGTTGAAA TCAGGCGACC CGTTTATTTT TGGACGCTGT
ACCTCTGAAA TTGAGGCTTT GAAAGCAGCA GGCTGTAAAT TTGAGGTAGT ACCAGGAATT
TCCTCATCCA TAGCCGCGCC TTTATTAGCA GGAATCCCTC TCACTGACCC GGTGATGAGT
CGTTGCTTTG CAGTGTTGAC AGCCCACGAA CCAGAAGTTT TAGACTGGGA GGCGCTGTCA
AGGTTAGACA CCCTGGTCAT ACTAATGGGT GGGAAAAACT TAGCAGATAT AATTAATGAA
CTTTTAAGAA GAGGCCAATC GCATCTCACA CCCATAGCTA TTATTCGCTG GGCAGGAACC
CCCAGTCAAC AAATCTGGAC TGGTCAACTG GGCGATATCC TTGAACAAAC AAGGGGTTTA
TCTCTTTCCC CAGCAGTCAT CGTCATTGGT GAAGTTGTCG GACTACGCAA GTACCTACAA
CCTGAGAAAA TATTTCCAGA GAACTCAACT ACACCTATGT CCAGCAACCA ACCTCTCACT
GGAAAAACCA TCCTCGTCAC CCGTTCATCT GGTCAATCGA GTCAATTTAG CGATCGCCTA
ACTACACTTG GCGCTACAGT AATAGAAATG CCAGCATTAG AAATAGGCTC CCCTTCTAGT
TGGGAGGAAT TAGATCAGGC GATCGCTAAT TTATCTCAAT TCGAGTGGTT AATTCTCACT
TCTACCAACG GTGTAGATTA TTTCTTTGAA AGACTCAACC TCCAAGGTAA AGATACTCGT
GCTTTAGCTG GGGTAAAAAT TGCCGTTGTT GGGGAAAAAA CCTCCCAAAG CCTCAAACAA
AGAGGAATCC AAGCAGATTT TATCCCCCCT AACTTTGTAG CTGATTCTTT AGTAGAGCAT
TTCCCTGAAT CACTACTAAA TAAAAAGATT TTATTTCCCA GAGTTGAAAG CGGTGGTAGA
GAAATTTTAG TCAAAGAACT ATCAGCAAAA GGCGCAGAAG TAATTGAAGT AGCTGCTTAT
CAATCTTGTT GTCCTAGTAG TATTCCCCCA GCAGCAGAAC TAGCCTTAAA AAATCACACA
ATAGATATCA TCACCTTTGC CAGTTCTAAA ACTGTGCAAT TTTTCCATCA ACTTGTAGAC
AACATATTTC CTCACAATAT CCCCAACACT TTAGCAGGAA TATGTATTGC TTCCATCGGC
CCCCAAACCT CGAAAACTTG TCATACCCTA TTAGGTCGAG TAGATGTAGA AGCTGAAGAA
TATACTTTAG ATGGATTAAC CCAATCCATC ATAAATTGGG CATTAAAAGT TTAA
 
Protein sequence
MTQQNGKVYL VGAGPGDVAY LTVKAYNLLA TAQVLVYDAL VDEQLLQCVS PDCLKLDVGK 
RGGKPSTPQA DINHLLVKYC QEGALVVRLK SGDPFIFGRC TSEIEALKAA GCKFEVVPGI
SSSIAAPLLA GIPLTDPVMS RCFAVLTAHE PEVLDWEALS RLDTLVILMG GKNLADIINE
LLRRGQSHLT PIAIIRWAGT PSQQIWTGQL GDILEQTRGL SLSPAVIVIG EVVGLRKYLQ
PEKIFPENST TPMSSNQPLT GKTILVTRSS GQSSQFSDRL TTLGATVIEM PALEIGSPSS
WEELDQAIAN LSQFEWLILT STNGVDYFFE RLNLQGKDTR ALAGVKIAVV GEKTSQSLKQ
RGIQADFIPP NFVADSLVEH FPESLLNKKI LFPRVESGGR EILVKELSAK GAEVIEVAAY
QSCCPSSIPP AAELALKNHT IDIITFASSK TVQFFHQLVD NIFPHNIPNT LAGICIASIG
PQTSKTCHTL LGRVDVEAEE YTLDGLTQSI INWALKV