Gene Apar_0799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0799 
Symbol 
ID8413664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp880218 
End bp881465 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content43% 
IMG OID645022381 
Productoxygen-independent coproporphyrinogen III oxidase 
Protein accessionYP_003179819 
Protein GI257784602 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCCT GGCAGTTTAA ATATAATCAA GCGGCTGTTT CTGCACTTTA CCTGCATATC 
CCATTTTGCT CGCAAAAATG TTTCTATTGC GATTTTTCTT CTTGGTCTAC AAGACAAGAT
GACAGTCGTA TGAAAAAGTA TGTAAATGCT TTAAAACATC AGTTAGACGA AGCTGCTCAA
CTGGGTATAC TCGCAACTAC AAAAACAGTT TATATGGGCG GTGGAACTCC CAGCTTACTT
GATCAGGGTG CGGTTGATCT GGCGCATCAT ACCTCATCTA TTACACATCC TATTGAATTT
AGTATGGAAG CAAATCCAGA CTCACTATCT GACGAACTTC TCGCTAGTCT TTCTGCAGGA
GGAGTAACGA GAATTTCTTT GGGAGTTCAG AGCTTTAATG ATAATGAGCT TAAGGAGCTT
GGTAGAATTC ATTCAGCTGA TCTGGCATAC GATAGAGTTT TAGCTGCAAA AGAGAGCGGC
TACGAAGTGT CGGTTGATCT CATGTGCGCT ATTCCTGAGC AAACAGAGAG TTCTTGGGAA
TATACGCTTT CAAGGTTTAT CTCGCTTGGG GTAAATCATG TGAGTGTTTA TCCACTTACC
ATTGAAGATG GCACGGCACT AGCTAAGCAA ACCCAAGATA AAGACATTCC ATGGAATGTT
TATGACGTGC AGGCAGATCG AATGCAAACG GCTTCAAAGA TGCTTCAAGC AGCAGGATTT
GAGCGTTACG AGGTGGCAAG TTATGCTCGT AATCAGAAAA GTTGCAAGCA TAATAAGATG
TACTGGACAG GTGAGTCGTA TCTTGGTCTA GGTACTAGTG CTGCAAGTAT GTTGACAGCT
TTTGAGTATG ATGCTCTGGC AAAGGAAAAC GCTTTTTTGC CTTCGAGACC ACAAGATGCT
ATCCGTGTGC GACTTGTGGT GCTTGATTCT CCAAAAAAAA TTGCTGAAGG CATATCGCTT
TTCTCGACAG AGTTTGACGT TGAATTTTTG ACCTACAGAG AAGCTGTGGC AGAAGATTTG
ATGCTCCATG CACGCCTCAC AGAGCTAATT GCGCCTGCGC TTTTGGATGA GTCTGAGCAG
GTATTTGGTG CATTAACTTT ACAAGAAGTG TTTGATGCCT GTGTACAAGA TGAGTTACTA
GAATGCGTTG ATGCAGCAGA TTCTGAGATT AAGGCTTCAT ATAGGCCTAC CAAGAAGGGC
TGGCTGCTTG GAAACGAGCT TTATGGCCGT TTTTGGGAGT TAAGATAA
 
Protein sequence
MNSWQFKYNQ AAVSALYLHI PFCSQKCFYC DFSSWSTRQD DSRMKKYVNA LKHQLDEAAQ 
LGILATTKTV YMGGGTPSLL DQGAVDLAHH TSSITHPIEF SMEANPDSLS DELLASLSAG
GVTRISLGVQ SFNDNELKEL GRIHSADLAY DRVLAAKESG YEVSVDLMCA IPEQTESSWE
YTLSRFISLG VNHVSVYPLT IEDGTALAKQ TQDKDIPWNV YDVQADRMQT ASKMLQAAGF
ERYEVASYAR NQKSCKHNKM YWTGESYLGL GTSAASMLTA FEYDALAKEN AFLPSRPQDA
IRVRLVVLDS PKKIAEGISL FSTEFDVEFL TYREAVAEDL MLHARLTELI APALLDESEQ
VFGALTLQEV FDACVQDELL ECVDAADSEI KASYRPTKKG WLLGNELYGR FWELR