Gene YpAngola_A0143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0143 
Symbol 
ID5798607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp141515 
End bp142645 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content49% 
IMG OID641338166 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_001604773 
Protein GI162420837 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.000483536 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTTAAAT TACCCCCGCT CAGTCTCTAC ATCCATATCC CTTGGTGCGT CCAGAAATGC 
CCTTATTGTG ATTTCAACTC ACATGCGTTG AAAGGCGATG TCCCTCATCA GGAATATGTA
GAGCACGTAC TGGCGGATTT GGATGCTGAC GTGCCTCTGG TGAGTGGCCG TGAAATCAGC
ACTATTTTTA TTGGTGGCGG CACACCCAGC TTGCTGAGTG CAGAGGCCAT GCAGCAACTA
CTTGATGGTG TCCGCGCGCG TTTGCCGGTT GCCAGTGACG CAGAGATCAC CATGGAAGCC
AACCCCGGAG CGGTCGAAGC CGATCGCTTT AGCGGCTATC AGCGAGCGGG TATTAACCGC
ATATCCATTG GCGTCCAGAG CTTTAGCGCA CAGAAATTAA CGCGGCTGGG GCGGATACAT
GGGCCAGATG AAGCAAGGCG GGCAGCAGAG CTGGCAACCT CACTGCAATT ACGTAGCTTT
AATCTAGATT TGATGCACGG TCTGCCTGAT CAAACACTGG AAGAAGCATT GGATGATTTG
CGTCAGGCTA TCGCCTTAAA TCCACCACAC TTATCTTGGT ATCAGCTGAC TATTGAGCCG
AATACTGGCT TTAGTTCCCG GCCCCCAATC CTGCCCGATG ACGATGCGTT ATGGGATATT
TTCCAGCAAG GTCATCAGCT ACTCAGTGCC GCTGGTTATC TGCAATATGA AACCTCGGCC
TATGCCAAGC CAGGTTACCA ATGCCAGCAT AATCTTAACT ACTGGCGCTT TGGTGATTAT
CTGGGGATTG GCTGTGGGGC GCATGGCAAG ATCACATTTA ACGATGGGCG TATTTTACGG
ACCATAAAAA CCAAACATCC ACGTGGTTTT ATGCAGGGAA AATACTTGGA TAAACAATAT
GAAGTGGCAG CAGTCGACCG GCCTTTTGAG TTCTTTATGA ACCGCTTCCG TTTGTTGGAA
GCTGCCCCTC GGGCTGATTT CTATCACTTC ACCGGGTTAA CTGAGAGTAC TATTCGCCCA
CAGTTAGATG AGGCAATAGC AAAAGAGTAT TTAATCGAAA CAACAGAATA CTGGCAGATT
ACCGAAAAAG GAAAATTGTT CCTTAACTCG CTACTGGAGC TATTTTTATA G
 
Protein sequence
MLKLPPLSLY IHIPWCVQKC PYCDFNSHAL KGDVPHQEYV EHVLADLDAD VPLVSGREIS 
TIFIGGGTPS LLSAEAMQQL LDGVRARLPV ASDAEITMEA NPGAVEADRF SGYQRAGINR
ISIGVQSFSA QKLTRLGRIH GPDEARRAAE LATSLQLRSF NLDLMHGLPD QTLEEALDDL
RQAIALNPPH LSWYQLTIEP NTGFSSRPPI LPDDDALWDI FQQGHQLLSA AGYLQYETSA
YAKPGYQCQH NLNYWRFGDY LGIGCGAHGK ITFNDGRILR TIKTKHPRGF MQGKYLDKQY
EVAAVDRPFE FFMNRFRLLE AAPRADFYHF TGLTESTIRP QLDEAIAKEY LIETTEYWQI
TEKGKLFLNS LLELFL