Gene Ava_1574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1574 
SymbolhemH 
ID3681117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1944974 
End bp1946140 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content46% 
IMG OID637716914 
Productferrochelatase 
Protein accessionYP_322092 
Protein GI75907796 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.634764 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCGTG TAGGCGTATT ATTACTCAAT CTCGGTGGTC CCGATAAGCT GGAGGATGTA 
GGGCCTTTTT TGTTTAACCT ATTCTCCGAT CCGGAAATTA TACGCTTACC ATTCCGGTGG
TTGCAGAAGC CCTTGGCTTG GTTTATTGCT TCTCGACGCA CCAAAACCTC CCAAGAGAAC
TATAAGCAAA TTGGCGGTGG CTCCCCACTA CGGCGGATTA CGGAAGCCCA AGGAGAAGCC
TTAAAGGAAC AGTTGCATGA TTTGGGTCAA GAAGCGAATA TCTATGTGGG AATGCGTTAT
TGGCATCCAT ATACGGAAGA AGCGATCGCT CTTTTGACCC AAGATAACCT GGATAACTTG
GTGATTTTGC CACTATACCC CCAATTCTCC ATCAGCACTA GTGGCTCTAG CTTCCGTCTA
CTAGAAAGAC TTTGGCAAGA AGACCCCAAA CTACAACGTC TGGACTACAC CGTCATCCCC
TCTTGGTATA AAGAACCATG TTATTTACAG GCGATGGCGG AACTCATTAG CCAAGAAGTA
GACCAATTTC CTGATCCTGA TCAAGTTCAT GTGTTCTTCA GCGCTCATGG TGTACCCAAA
AGCTATGTTG AAGAAGCAGG CGACCCCTAT CAGCAGGAGA TTGAGGAATG TACTGCATTA
ATTATGCAAA CCCTCAATCG ACCAAATCCT CACACTTTAG CCTATCAAAG TCGCGTCGGC
CCAGTTGAAT GGCTGCAACC CTATACCGAA GATGCGCTCA AAGAACTAGG CGCGCAAGGT
GTCAAAGATT TAGTTGTCGT ACCTATCAGT TTCGTCTCCG AACACATCGA GACACTACAA
GAAATTGATA TCGAGTATCG GGAAATAGCA GAAGAAGCCG GAATCCACAA TTTCCGTCGT
GTCGCTGCAC CTAATACCCA TCCGGTATTT ATTAGAGCTT TGGCGAATTT AGTAATTGAC
GCGCTCAACA AACCCAGCTT CAAGCTGTCG CAAGCAGCCC AAATCAAGAA AATGGTGAAA
ATGTATCCTC CTGAGAGTTG GGAATGGGGT ATGACTTCTA GTGCGGAAGT TTGGAATGGA
CGGATTGCGA TGTTAGGTTT TATTGCTCTC ATCATCGAGT TAGTGACAGG TCAAGGCCTA
CTGCATATGA TTGGGCTTTT GCAGTAG
 
Protein sequence
MGRVGVLLLN LGGPDKLEDV GPFLFNLFSD PEIIRLPFRW LQKPLAWFIA SRRTKTSQEN 
YKQIGGGSPL RRITEAQGEA LKEQLHDLGQ EANIYVGMRY WHPYTEEAIA LLTQDNLDNL
VILPLYPQFS ISTSGSSFRL LERLWQEDPK LQRLDYTVIP SWYKEPCYLQ AMAELISQEV
DQFPDPDQVH VFFSAHGVPK SYVEEAGDPY QQEIEECTAL IMQTLNRPNP HTLAYQSRVG
PVEWLQPYTE DALKELGAQG VKDLVVVPIS FVSEHIETLQ EIDIEYREIA EEAGIHNFRR
VAAPNTHPVF IRALANLVID ALNKPSFKLS QAAQIKKMVK MYPPESWEWG MTSSAEVWNG
RIAMLGFIAL IIELVTGQGL LHMIGLLQ