Gene Haur_4659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4659 
Symbol 
ID5736506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5955196 
End bp5956626 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content53% 
IMG OID641281823 
Productprotoporphyrinogen oxidase 
Protein accessionYP_001547418 
Protein GI159901171 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1232] Protoporphyrinogen oxidase 
TIGRFAM ID[TIGR00562] protoporphyrinogen oxidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000642175 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGTTG CTGATCAGCC ACGCATTGCC ATTATTGGCG GTGGTATCGC TGGCCTTAGT 
ACAGCATGGT ATTTACAACA ACAAGGTCTC ACGAATATTC AGCTTTTTGA ACGTGATCAA
CGGCTGGGCG GCAAATTGCG CACCAGCCAT GTTGCCTTGC CCGATGGCGC TGGCGAATTG
CTGGTCGAAG CTGGCCCCGA TGCCTTTATC AGCCAAAAGC CTTGGGGCTT GCAATTGGCG
CGTGAGTTGG GCTTGGAAGA TCAGTTAATT TCAACTGAGC CAGCTCGCCA TAAGGTGTTT
GTGTTGCATC GTGGCAAGCC CGAACCCTTG CCTGATGGCA TTAACTTGGT TGTCCCAACT
GAGTTTTGGC CGTTGCTGCG CACGCCGATT CTCTCGCTGC CAGGCAAATT GCGTATGTTG
CTCGATTTGG TCTTGCCTGC CCGCCAAAGT AATACTGATG AATCGCTGGC CGATTTTGTG
CGCCGCCGAT TTGGGGCCGA AGCGCTGGAT AAATTGGCCG AGCCGTTGAT GGCAGGCATT
CACAATGCAG AATCGGATCG CCAAAGCCTC GAAGCCACCT TTCCACGCTT TATCGAGGCC
GAACGCAGCC ATGGCAGTGT GATTCGTGGG ATTCTGGCGG CTAAACTTAA AGCAGGCAAG
CCCAAAGGTC AGCAACTTAG CCCATTTATT AGCTTACGCG GCGGGATCGA GCAATTAATT
ACCGCGCTGG TTGAGCAGCT CAACGTTGAA ATTCGGACAA ATTGTGGGGT TCAAGCGCTG
CGCTACGACC CAACCAACGC CTCAGCCTAT CAACTGACCC TCGATGATGG CACGAAGATT
GATGCTGATG CAGTGGTGTT GGCAGTGCCT AGTTTTGTGG CCGCTGAGTT GGTCGCACCT
TGGGCTGAAG CCTTGGCCGA GCGCTTGAAG GCGATTCGTT ATGTCAGCAC TGGCACAGTT
TCGTTGGCAT TTCGGCGTAG CGAAACCAAC ATGGCCTTCG ATAGTTATGG CTTGGTGATT
CCGCGCAGCG AATATCGGCT GATTAATGCT GTAACGATCA ACTCACGCAA ATTTGCTGGG
CGTGCTCCCG CCGATTATAT GCTGTTGCGG GCCTTTGTGG GCGGCTCGAA ACATCCCGAA
GTGCTGCGCT TGGATGATCA GGCATTAACT CAATTGGTGC GTGATCAGCT TAAATCGATT
TTTGGCCTGA CCGCCGAGCC AATTTGGAGC GGGGTTGCCC GTTGGAACGA GGCTAATCCC
CAATACGATG TTGGTCATTT CCAACGTATG GATCAGCTTG AGGCCTTGTG TCCAGAAGGC
TTGTTGTTGT GTGGCAGCGG CTTTCGGGGC GTGGGCATTC CCGATTGTGT GCGCCAAGGC
CAAGCAACCG CTCAGGCCAT TAGCCAATTG TTCGCTTTGG CTAACGCTTA A
 
Protein sequence
MDVADQPRIA IIGGGIAGLS TAWYLQQQGL TNIQLFERDQ RLGGKLRTSH VALPDGAGEL 
LVEAGPDAFI SQKPWGLQLA RELGLEDQLI STEPARHKVF VLHRGKPEPL PDGINLVVPT
EFWPLLRTPI LSLPGKLRML LDLVLPARQS NTDESLADFV RRRFGAEALD KLAEPLMAGI
HNAESDRQSL EATFPRFIEA ERSHGSVIRG ILAAKLKAGK PKGQQLSPFI SLRGGIEQLI
TALVEQLNVE IRTNCGVQAL RYDPTNASAY QLTLDDGTKI DADAVVLAVP SFVAAELVAP
WAEALAERLK AIRYVSTGTV SLAFRRSETN MAFDSYGLVI PRSEYRLINA VTINSRKFAG
RAPADYMLLR AFVGGSKHPE VLRLDDQALT QLVRDQLKSI FGLTAEPIWS GVARWNEANP
QYDVGHFQRM DQLEALCPEG LLLCGSGFRG VGIPDCVRQG QATAQAISQL FALANA