Gene Rpic_4534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpic_4534 
Symbol 
ID6285987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia pickettii 12J 
KingdomBacteria 
Replicon accessionNC_010678 
Strand
Start bp858510 
End bp859856 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content67% 
IMG OID642619015 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_001893051 
Protein GI187926706 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.233345 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.576048 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATGC TCGCCCCGAC GGCCAAGAAC GCATTCACCC CCGCGTCGCT GGACCGGCCC 
GCTTACCAGA GCGGCTTCGG CAACGAGTTC TCCACCGAAG CCCTGCCCGG TGCGCTGCCG
CACGGCCAGA ACTCCCCGCA GAAGGCGCCG TATGGCCTGT ATGCGGAACA GATTTCCGGC
ACGGCGTTCA CCGCCCCGCG CGCGCACAAC CGCCGCTCGT GGCTGTATCG CATTCGCCCC
GGCGCCGTGC ATCTGCCGTT CGAGGCCATG GCGCAGGGCC GCTTCCACAG CCACTTCAAC
GAGGTGCCGC CCTCGCCCAA CCAGTTGCGC TGGGACCCGC TGCCCGCCCC GGCGGCCGGC
ACCGATTTCA TCGACGGCAT CGTCACCTTC GCCGGCAACG GCGGGCCCGA CGCGCAAACC
GGCTGCGGCA TCCACCTGTA CGCCGCCAAC GCGGACATGA CCGACCGCTT CTTCTACAAC
GCCGACGGCG AACTGCTGAT CGTCCCGCAG CAAGGCCGCC TGCGCCTGCT GACGGAGATG
GGCGTGGTTG ATGTCGAGCC GCTGGAGATT GCCGTTATTC CGCGCGGCGT ACGCTTCCGC
GTCGAGTTGC CGGACGGCGA CGCACGCGGC TACATCTGCG AGAACTTCGG CGCGCTCTTC
CGCTTGCCGG ACCTGGGCGT CATCGGCTCG AACGGCCTGG CCAACCCGCG CGACTTCCTC
ACACCGCACG CGTGGTACGA AGACCGCGAG GGCGCCTTCG AACTCGTCGC AAAATTCCAG
GGCAGCCTGT GGACCGCGAA GATCGGCCAC TCGCCGCTAG ACGTTGTGGC GTGGCACGGC
AACCTCGCGC CGTACAAGTA CGACCTGCGC CTGTTCAACA CCATCGGCTC GATCAGCTAC
GACCATCCGG ACCCCTCGAT CTTCCTGGTG CTGCAAAGCC CGTCCGCCAC GCCGGGTGTG
GACACGATCG ACTTCGTGAT CTTCCCGCCG CGCTGGCTGG CCGCCGAAAA CACGTTCCGC
CCGCCCTGGT TCCACCGCAA CGTCGCCAGC GAATTCATGG GCCTGATCCA GGGCGTGTAC
GACGCCAAGG CCGAAGGCTT CGTGCCCGGC GGCGCCAGCC TGCACAACTG CATGAGCGGC
CACGGCCCCG ACGCCGACAC CTTCGAGAAA GCCAGCAACA GCGACACCAC CAAGCCGCAC
AAGGTCGACG CCACGATGGC CTTCATGTTC GAAACGCCAG CGGTGATCCG CCCCACGCGC
TTCGCTGCGG AATCGGCGCA ACTTCAAGCC AAGTACTTCG AATGCTGGCA AGGCCTGAAG
AAACATTTCG ACCCGAGCAA GCGCTAA
 
Protein sequence
MNMLAPTAKN AFTPASLDRP AYQSGFGNEF STEALPGALP HGQNSPQKAP YGLYAEQISG 
TAFTAPRAHN RRSWLYRIRP GAVHLPFEAM AQGRFHSHFN EVPPSPNQLR WDPLPAPAAG
TDFIDGIVTF AGNGGPDAQT GCGIHLYAAN ADMTDRFFYN ADGELLIVPQ QGRLRLLTEM
GVVDVEPLEI AVIPRGVRFR VELPDGDARG YICENFGALF RLPDLGVIGS NGLANPRDFL
TPHAWYEDRE GAFELVAKFQ GSLWTAKIGH SPLDVVAWHG NLAPYKYDLR LFNTIGSISY
DHPDPSIFLV LQSPSATPGV DTIDFVIFPP RWLAAENTFR PPWFHRNVAS EFMGLIQGVY
DAKAEGFVPG GASLHNCMSG HGPDADTFEK ASNSDTTKPH KVDATMAFMF ETPAVIRPTR
FAAESAQLQA KYFECWQGLK KHFDPSKR