Gene Haur_1001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1001 
Symbol 
ID5732904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1145734 
End bp1147008 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content51% 
IMG OID641278135 
Productputative oxygen-independent coproporphyrinogen III oxidase 
Protein accessionYP_001543777 
Protein GI159897530 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACAC ATGCAATCAA ACATCTTTAC GTTCATATAC CATTTTGCCA AACCCGTTGT 
GCCTATTGCG ATTTCAATAC CTTTGCCAAT CGCGAAGATT TTATGCAGCG CTATATTGAT
GCGCTGTGCC TGCATCTCAA GCGCATGGCG AGCGGCGAGA CAATCGCTGA CCCAACATGG
CCGCAGGTCG CGGATGGGCC GATTCCGTGG GCAAGCTATC AATTACACGA TCTTGTTGGG
CCGTTACAAC AGGCCGATTT GCCCGCGACG GTGTTTCTGG GCGGCGGCAC GCCAACTGCG
CTACCATTGC ACTTACTTGA ACAGTTGATG CAAACAATCG GCCAAATTAT TCCGCTGGCG
CAGGCCGAAG TAACCAGCGA AGCCAACCCA GGCACGGTGC TCGACCACAA TTATCTACGG
GCAATGCGGT CGATGGGGAT TAATCGCCTA AGTATGGGCG TGCAAACCTT GCATGACCCA
ACCTTGCGGG TGTTGGGGCG GATTCATACC GCCAGCGAGG CCTATGCCTC GTATCAAGCA
GCTCGTAAAG TTGGCTTCGA AAATATCAAT CTTGATTTTA TGTTTGGCTT ACCAGGCCAA
GATACTGCTC AATGGCGAGC GATGCTCAAC GAAATTGTAG GTTGGGATGC TGAGCATTTT
GCGCTGTATT CATTAATTGT CGAGCCAAGC ACACCGCTAG CCGCCCAAGT AACTGCTGGT
CGGGTCAGCA TTCCCAACGA CGATGCGACT GGCGAAATGT ATGAAGCTGC GATGGAAATT
TTAGGGGCGG CTGGTTATGG CCATTATGAG ATTTCCAATT GGGCCAAAAC GCAGAACTCA
GCGTTTAACC CAAGCGAGCG TTTGCCGGCC TATGCATCGC GCCACAATGT GGCCTATTGG
CTCAACGCCG ATTATTTGGG AGTTGGGGCT GGTGCACATA GCCATTGGCG GGGCTGGCGC
TGGGTCGATC AACGGATCTT AGAGCCTTTT GCTCAGCAGG TTGAACATGG TCAAGCACCG
TTAATCGATA TCCAAGTATG CGAGCCACAA GACCGCGATT TTGAAACAAT TATGATGGGA
TTGCGGCTCA ATTGTGGTTT AGGCTTTGCC CACTTTCAAC AACGGACAGG CCACGATTTG
CTTGCGCAGT ATCAGCCGAT AATCGAACAA TTGGTTGGGC AAGGCTTGCT CGAACAAACC
ACCAACGCCA TTCGGTTTAC CCCCCGTGGC CGAATGGTTG GCAATCAAGT GATTGAACGT
TTTTTGCTTG ATTAA
 
Protein sequence
MTTHAIKHLY VHIPFCQTRC AYCDFNTFAN REDFMQRYID ALCLHLKRMA SGETIADPTW 
PQVADGPIPW ASYQLHDLVG PLQQADLPAT VFLGGGTPTA LPLHLLEQLM QTIGQIIPLA
QAEVTSEANP GTVLDHNYLR AMRSMGINRL SMGVQTLHDP TLRVLGRIHT ASEAYASYQA
ARKVGFENIN LDFMFGLPGQ DTAQWRAMLN EIVGWDAEHF ALYSLIVEPS TPLAAQVTAG
RVSIPNDDAT GEMYEAAMEI LGAAGYGHYE ISNWAKTQNS AFNPSERLPA YASRHNVAYW
LNADYLGVGA GAHSHWRGWR WVDQRILEPF AQQVEHGQAP LIDIQVCEPQ DRDFETIMMG
LRLNCGLGFA HFQQRTGHDL LAQYQPIIEQ LVGQGLLEQT TNAIRFTPRG RMVGNQVIER
FLLD