Gene Arth_2239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2239 
Symbol 
ID4445161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2518032 
End bp2519261 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content68% 
IMG OID639690048 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_831719 
Protein GI116670786 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.167009 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTAGCG TCCTTCCCCT GGGCGACCCC GCGCCGTCGG ACGGCCTGCT GCCCGCGCAG 
GCCGCTTACG GCGCTGCGCA GCGGCGATTC GGGCTCTACG TCCACATTCC CTTCTGTGCC
GTCCGTTGCG GTTATTGCGA CTTCAACACC TACACCGCGA CAGAGCTAGG CGGCGGCGCG
TCCCAGGACG CGTACGCCTC CACTGCCATC GCGGAAGTGG AGTTTGCCGC CAAGGCCCTG
CAAGGCAGCG GTCTGCCGGA ACGCCGGCTG GGCACGGTCT TTTTTGGCGG CGGCACACCA
ACTCTGCTGC CCGCGGAAGA CCTGGCCCGC ATCCTCACCG CGGCGGTATC GCAATGGGGC
CTGGAACCGG GTGCCGAGGT CACTACCGAA GCCAACCCGG ATTCGGTCAC ACCGGAGTCG
CTGCAGCTCC TTGCGGATGC CGGCTTCACC CGTGTTTCCT TCGGAATGCA GTCGGCTGTT
CCGCACGTCC TGAAGGTCCT CGACCGCACC CATACGCCCA GCCGGGTGCC GCAGGTGGTG
CAGTGGGCCA GGGATGCCGG ACTCGCCGTC AGCCTCGACC TAATCTACGG AACACCGGGG
GAGTCGCTGG AGGACTGGCG GTACTCGCTC GAGACAGCCC TCTCGTACGG CCCTGACCAC
ATCAGCGCCT ACGCCCTGAT CGTGGAGGAC GGCACCAAGC TGGCCGCCCA GATCCGTCGC
GGCGAAGTGC CGGGGATCGA CGACGACGAC CACGCGGCCA AGTATGAACT CGCCGATGAA
CTGATCACCG CCGCAGGCCT TGGCTGGTAC GAGGTCAGCA ACTGGTCACG CACACCTGAG
CAGGCGTGCC GGCACAACCT GGCCTACTGG CGCGGCGATG ACTGGTGGGG GATCGGCCCG
GGCGCGCATT CGCACGTCGG CGGAGTCCGC TGGTGGAACG TGAAGCACCC CACGGCCTAC
GCCGGGAGGC TTGCCGGCGG TGTGTCACCG GCGGCGGGCC GGGAAACGCT CGACGCCGAA
ACCCGCAACG TTGAGCGGGT CATGCTTGAG GCCAGGCTGA GCACCGGACT CGAGGTATCC
GGACTGGGAG TGTCCGGGCG GCAGGCCGTC GCCGGACTGA TCGCAGACGG CCTGGTGGAT
CCCGCCGCGG CATTTCGGGG CAGGCTCGTG CTCACCCTGA AAGGCAGGCT GCTCGCGGAC
GCCGTAGTCA GAAGGATCCT GCCCGACTAA
 
Protein sequence
MPSVLPLGDP APSDGLLPAQ AAYGAAQRRF GLYVHIPFCA VRCGYCDFNT YTATELGGGA 
SQDAYASTAI AEVEFAAKAL QGSGLPERRL GTVFFGGGTP TLLPAEDLAR ILTAAVSQWG
LEPGAEVTTE ANPDSVTPES LQLLADAGFT RVSFGMQSAV PHVLKVLDRT HTPSRVPQVV
QWARDAGLAV SLDLIYGTPG ESLEDWRYSL ETALSYGPDH ISAYALIVED GTKLAAQIRR
GEVPGIDDDD HAAKYELADE LITAAGLGWY EVSNWSRTPE QACRHNLAYW RGDDWWGIGP
GAHSHVGGVR WWNVKHPTAY AGRLAGGVSP AAGRETLDAE TRNVERVMLE ARLSTGLEVS
GLGVSGRQAV AGLIADGLVD PAAAFRGRLV LTLKGRLLAD AVVRRILPD