Gene TM1040_2548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2548 
Symbol 
ID4076679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2689525 
End bp2690880 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content60% 
IMG OID638007872 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_614542 
Protein GI99082388 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00538] oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.963821 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.259779 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACAGG AATCTCAATT GGCGCGGCTC GGACTTTTCG ACGCAAAGGT GCCCAGATAT 
ACAAGTTACC CGACTGCGCC ACACTTCAAC AATGACGTGA GCGCAGCGCG TTTTGCCTCC
TGGATCGGGT CTATCAAGCC CGGCGCAGAA ATCTCGCTAT ACGTGCATGT GCCCTTCTGT
CGCAGGCTGT GCTGGTTTTG CGCATGCCGC ACGCAGGGCA CGCAGTCCGA GTCACCTGTG
CGCGCCTACA TAGAGGTGTT GAAGCAAGAA CTTGCGCTTC TTGCGCGTGC CCTACCCGAA
GGCGTCCGCC TTGCGCGGCT GCATTGGGGC GGCGGGACGC CCACGCTCCT CAGTGCCGAA
ATGATCTCTG ATCTGGCGGA GGCGATCTTT GCCGTGACGC CGATGGCGAA GGGCGGCGAG
TTCTCCGTTG AGATCGACCC AAATGAAATC GACGATGCGC GCCTCGACGC CCTCGCGGCG
GCGGGGATGA ACCGTGCCTC GATCGGCGTT CAGGATTTTG ACCCCCAGAT CCAGGAAACC
ATCGGTCGCA TTCAGCCTTT TGATCTGACG CGCGACGCCG TCGACATGAT CCGCGCGCGG
GGCATCACAA GCCTCAATGC AGATATTCTC TTCGGGCTGC CGCATCAGAA CCGGATGCGC
ATGACCGAAA GCGTGCAAAA ACTGCTGTCG CTCTCGCCGG ATCGCGTGGC ACTCTATGGC
TATGCCCATG TGCCATGGAT GGCGCGGCGC CAGAATATGA TTCCAACCGA CAGCCTACCG
TCACCTCAAA CCCGACTACA GTTGTTTGAG ACCGCGCAGC GATTGTTTCA GTGGGATGGC
TATCGCGAAA TTGGTATCGA CCATTTTGCC ACGCCCCACG ATGGGCTGGC GGTTGCGGCC
CGGACGGGGC GGCTGCGCCG GAACTTTCAG GGTTACACCG ATGATCGGGC AGATGTGTTG
ATCGGCCTTG GGGCATCCTC TATCTCGCGT TTTCCGCAGG GCTATGCTCA GAATGCTCCA
TCCACATCGG CCTACACCAA GGCTATTCGT GACGGACAGT TTTCCACCGC GCGCGGCCAT
GTGTTTTCGG GCGAGGATTT GCTGCGTGGG CGCATGATCG AAGCCCTGAT GTGTGATTTC
GAGATTGCAA CCGACGATAT TCGGGCACAG TTCGACATCA CGCAAGACGC ATTGGAGCGC
ATGTATCGCG AGGCCTCCGT CGCCTTTCCG GAAATGCTCG ACGTCACCCC ATCGGGGCTG
CGGGTAAGAC CCGAAGGCAA GCCCCTGACG CGAATGGTGG CGCGCCACTT TGATGCCTAT
GACCTGAGCA AGGCCGGACA TAGCTCGGCG ATCTAG
 
Protein sequence
MTQESQLARL GLFDAKVPRY TSYPTAPHFN NDVSAARFAS WIGSIKPGAE ISLYVHVPFC 
RRLCWFCACR TQGTQSESPV RAYIEVLKQE LALLARALPE GVRLARLHWG GGTPTLLSAE
MISDLAEAIF AVTPMAKGGE FSVEIDPNEI DDARLDALAA AGMNRASIGV QDFDPQIQET
IGRIQPFDLT RDAVDMIRAR GITSLNADIL FGLPHQNRMR MTESVQKLLS LSPDRVALYG
YAHVPWMARR QNMIPTDSLP SPQTRLQLFE TAQRLFQWDG YREIGIDHFA TPHDGLAVAA
RTGRLRRNFQ GYTDDRADVL IGLGASSISR FPQGYAQNAP STSAYTKAIR DGQFSTARGH
VFSGEDLLRG RMIEALMCDF EIATDDIRAQ FDITQDALER MYREASVAFP EMLDVTPSGL
RVRPEGKPLT RMVARHFDAY DLSKAGHSSA I