Gene Caul_3872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3872 
Symbol 
ID5901334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4191553 
End bp4192965 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content69% 
IMG OID641564394 
Productoxygen-independent coproporphyrinogen III oxidase 
Protein accessionYP_001685496 
Protein GI167647833 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00538] oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGAAC ATGCCGAGCC GATCACCACG ATCCCGCGCC AGGACGCGGT CCCGACCGCT 
CGCCCGTCCG CCGGGCTAAT CGCCAAGTAT GACGGCCGCG CGCCGCGCTA CACGAGCTAT
CCGCCGGCCA CTCAGTTCAC CGCCGAGGTC ACCGCCGAAA CCTATGGCGC ATGGCTGGGG
GCGCTGCCGA CCGACCAGGC CGTATCGCTC TATCTGCACA TCCCGTTCTG CGCCCGGCTC
TGCTGGTACT GCGGCTGCAA CACGCGGGCG GTGAACCGTC ACGAACCGGT CGGCGACTAT
GTGCGCCTGC TGCTGGACGA GATCGACCTG CTGGGCGCCG CCCTGCCCGC TCGCCTGACC
GCCGGCACCG TGCATTTCGG CGGCGGCACG CCCAACATGC TGTCGACGGA CGAGGTCGAT
GCGATCTTCG ACCGTCTGGG CAAGGTCTTC GACCTCGCTT CGGATATCGA CATCGCCATC
GAGATGGATC CGGCTGTGCT GACCCGCGAC TGGGTGCGCG CCGCGGGCGC CCGGGGCCTG
AGCCGCGCCA GCCTGGGAGT CCAGAACCTG ACGCCGGAGG TTCAGGCGGC GGTCAATCGC
CGCGACACCT TCGACCAAAT CGCCGACTGC GTCGCCTGGC TGCGCGAAGC CAAGGTCGCG
TCGATCAACC TCGACCTGAT GTACGGCCTG CCGCACCAGA CCACGGCCAA CACGCTCTCG
ACCCTGGATG AGCTCCTGAC CCTGCGCCCT GAGCGCCTGG CCCTGTTCGG CTACGCCCAT
GTGCCGTGGA TGAAGAGCCA CCAGCAGCTG ATCGACGAGG CGGCTCTGCC CGGACCCCAG
GCGCGCCTCG ACCAGAGCGA AAGCGCCGCC GAGCGATTGG CGGCCGAGGG CTATGTCCGC
ATCGGCCTCG ATCACTTCGC CTTGCCCGAG GATCCGATGG CCAAGGCCCT GGCGGCTGGA
ACCCTGCGCC GCAACTTCCA GGGCTACACC ACCGATCCGG CCGTCACCCT CCTGGGCATG
GGCGCCTCGG CGATCGGCGG CCTGCCGCAG GGCTTCGTCC AGAACGCCGC CCATGAGCTG
ACCTGGCGCG CGGCGGTCAA GGAGGGGCGC CTGCCGGTGG CGCGCGGCGT CGCGATCAGC
GACGAGGACC GTTTCCGGGC CGAGATCATC GAGCGGCTGA TGTGTGATTT CGCGGTCGAC
TTGGCGGCGG TCTGCGCGCG GCATCGACGG TCGCCCGCCG ACCTGGCCGG CGAGTTGGCG
CGGCTGGCGC CATTCGTGGC CGACGGCCTG GTTCGGGTCG ACGGCTTCAA CATCCAGGTC
CCGCCCGTGG GCCGGCTGCT GGTGCGGTCG ATCGCGGCGG TGTTCGACGC CTATTTCAAC
GCCGACAGCC AACGCCACGC CAAGGCGATC TGA
 
Protein sequence
MFEHAEPITT IPRQDAVPTA RPSAGLIAKY DGRAPRYTSY PPATQFTAEV TAETYGAWLG 
ALPTDQAVSL YLHIPFCARL CWYCGCNTRA VNRHEPVGDY VRLLLDEIDL LGAALPARLT
AGTVHFGGGT PNMLSTDEVD AIFDRLGKVF DLASDIDIAI EMDPAVLTRD WVRAAGARGL
SRASLGVQNL TPEVQAAVNR RDTFDQIADC VAWLREAKVA SINLDLMYGL PHQTTANTLS
TLDELLTLRP ERLALFGYAH VPWMKSHQQL IDEAALPGPQ ARLDQSESAA ERLAAEGYVR
IGLDHFALPE DPMAKALAAG TLRRNFQGYT TDPAVTLLGM GASAIGGLPQ GFVQNAAHEL
TWRAAVKEGR LPVARGVAIS DEDRFRAEII ERLMCDFAVD LAAVCARHRR SPADLAGELA
RLAPFVADGL VRVDGFNIQV PPVGRLLVRS IAAVFDAYFN ADSQRHAKAI