Gene Avin_41370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_41370 
SymbolhemH 
ID7763020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4171560 
End bp4172576 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content70% 
IMG OID643806994 
Productferrochelatase 
Protein accessionYP_002801245 
Protein GI226946172 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.507238 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGATC ACGCGCTGTT GCTGGTCAAC CTGGGTTCGC CCGACTCCCC CGAGGTGGCC 
GATGTGCGCC GCTACCTCGA CCAGTTCCTC ATGGACCCCT ACGTGATCGA TCTGCCCTGG
CCGCTGCGCC GCCTGCTGGT CTCGCTGATC CTGCGCAAGC GGCCGGAGCA GTCGGCGCAT
GCCTATGCCT CGATCTGGTG GCCGGAAGGC TCGCCGCTGA TCGCCCTCAG TCGCCGCCTG
CAGGAGGCGG TGCAGGCCCA CTGGCACGAG GGGCCGGTGG AGCTGGCGAT GCGCTATGGC
AACCTGTCCA TCGAGGCGGC GCTGAACCGG CTGGCGGAGG AGGGCGTGCG CCGGGTGACC
CTGGCGCCGC TCTATCCGCA GTTCGCCGAC AGCACCGTGA CCACCGTGGT GGAGGAGACC
CGTCGGGTGC TGCGCGCCAG CGGCCTGACG CTGGAGCTGA GGGTGCTGGA GCCCTTCTTC
GCGCGGCCGG AGTATCTCGA CGCCCTGGCG CGGAGCGCCG GACCCCATCT GCAGCAGGGG
TTCGACCATC TGCTGCTGAG TTTCCACGGC CTGCCGGAAC GCCATCTGCG CAAGGCCGAT
CCGAGCGGCC GGCACTGTCT GGGCAGCGCC GACTGCTGCC GCGAGGCGCC GGTCGAGGTG
CTGGCGCGCT GCTATCGGGC GCAGTGCCTG CGGAGCGCCG AAGGTTTCGC CCGGCGGATG
GGGCTGGATG AGGGGCGTTG GTCGGTGTCC TTCCAGTCGC GCCTGGGGCG TGCCCGCTGG
ATTTCGCCCT ATACCGAGGA GCAGCTCGAC GCGCTGGCCG CGCGCGGGGT GAAGCGGCTC
CTGGTGATGT GCCCGGCCTT CGTCACCGAC TGCATCGAGA CCCTGGAAGA GATCGGCCAG
CGCGGCCGCG AGCAGTTCCA GGCGGCCGGC GGCGAGGAAC TGATCCTGGT GCCTTGCCTC
AACGACCATC CGGCGTGGGC GTCCGCACTG GCGCAACTGT GTCGTACGCC GGGTTAG
 
Protein sequence
MTDHALLLVN LGSPDSPEVA DVRRYLDQFL MDPYVIDLPW PLRRLLVSLI LRKRPEQSAH 
AYASIWWPEG SPLIALSRRL QEAVQAHWHE GPVELAMRYG NLSIEAALNR LAEEGVRRVT
LAPLYPQFAD STVTTVVEET RRVLRASGLT LELRVLEPFF ARPEYLDALA RSAGPHLQQG
FDHLLLSFHG LPERHLRKAD PSGRHCLGSA DCCREAPVEV LARCYRAQCL RSAEGFARRM
GLDEGRWSVS FQSRLGRARW ISPYTEEQLD ALAARGVKRL LVMCPAFVTD CIETLEEIGQ
RGREQFQAAG GEELILVPCL NDHPAWASAL AQLCRTPG