Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_5153 |
Symbol | |
ID | 6412853 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 5551329 |
End bp | 5552675 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642715043 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_001994116 |
Protein GI | 192293511 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.424959 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACATCA ATGCCGCTCC GCAGATCCTC GGCCGCAGCT CGCAGGACAT CACGCCCGGC TACATGTCCG GCTTCGGCAA TTCGTTCGAG ACCGAAGCGC TGCCCGGCGC ACTGCCGGTG GGGCGCAACT CGCCGCAGCG CTGCGCCTAC GGCCTCTATG CCGAGCAATT GTCCGGCTCG CCGTTCACGG CGCCGCGCGG TGCCAACGAG CGGAGCTGGC TGTATCGCAT TCGTCCCTCG GTGAAGCACT CCGGCCGCTT CGCCAAGACC GACATGGGGC TGTGGCGCTC GGCGCCGTGC TTCGAGCACG ATCTGCCGAT CGCGCAACTG CGCTGGGATC CGCCGCCGAT GCCGCAGGAG AAGCTGACCT TCCTGCAAGG CGTGCGGACG ATGACGACGG CCGGCGACGT CAACACCCAG GCCGGGATGG CGACGCATCT GTATCTGATC ACCCAGTCGA TGGTGGATCA GCACTTCTAC AATGCCGACG GCGAGATGAT GTTCGTGCCG CAGCAGGGCA GCCTGCGCCT CGTCACCGAG TTCGGCATCA TCACCATCGA GCCGGCCGAG ATCGCGGTGA TCCCGCGCGG CATCAAGTTT CGGGTCGAAC TGGTCGACGG CCCGGCGCGC GGCTATCTGT GCGAAAACTA CGGCGGCGCC TTCACGCTGC CGGAGCGCGG CCCGATCGGC GCCAACTGCC TCGCCAATTC GCGCGACTTC CTCACCCCGG TCGCCGCCTA CGAGGACAAG GACACGCCGA CCGAGCTTTA TGTGAAGTGG GGCGGCTCGC TGTACGTGAC CAAGCTGCCG CACTCTCCGA TCGACGTCGT TGCCTGGCAC GGCAACTACG CGCCGTACAA ATACGACCTG CGCACCTATT CGCCGGTCGG CGCGATCGGC TTCGATCATC CCGATCCGTC GATCTTCACC GTGCTGACCT CGCCGTCGGA GACGCCCGGC ACCGCCAATA TCGACTTCGT GATCTTCCCC GAGCGCTGGA TGGTGGCGGA CAACACGTTC CGGCCGCCGT GGTATCACAT GAACATCATG TCGGAGTTCA TGGGCCTGAT CTACGGCGTG TACGACGCCA AGCCGCAGGG CTTTGTGCCG GGCGGCGCCT CGCTGCACAA CATGATGCTG CCGCACGGCC CCGATCGCGA AGCATTCGAT CATGCCAGCA ACGGCGAGCT GAAGCCGGTG AAGCTGACCG GCACGATGGC GTTCATGTTC GAGACCCGCT ACCCGCAGCG TGTCACCGAA TATGCCGCGA GCTCCGGCCT GTTGCAGGAC GATTACGCGG ACTGCTGGAA CGGCCTCGAA AAGCGCTTCG ATCCAAACCG GCCATGA
|
Protein sequence | MNINAAPQIL GRSSQDITPG YMSGFGNSFE TEALPGALPV GRNSPQRCAY GLYAEQLSGS PFTAPRGANE RSWLYRIRPS VKHSGRFAKT DMGLWRSAPC FEHDLPIAQL RWDPPPMPQE KLTFLQGVRT MTTAGDVNTQ AGMATHLYLI TQSMVDQHFY NADGEMMFVP QQGSLRLVTE FGIITIEPAE IAVIPRGIKF RVELVDGPAR GYLCENYGGA FTLPERGPIG ANCLANSRDF LTPVAAYEDK DTPTELYVKW GGSLYVTKLP HSPIDVVAWH GNYAPYKYDL RTYSPVGAIG FDHPDPSIFT VLTSPSETPG TANIDFVIFP ERWMVADNTF RPPWYHMNIM SEFMGLIYGV YDAKPQGFVP GGASLHNMML PHGPDREAFD HASNGELKPV KLTGTMAFMF ETRYPQRVTE YAASSGLLQD DYADCWNGLE KRFDPNRP
|
| |