Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1801 |
Symbol | |
ID | 5899256 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1903246 |
End bp | 1904451 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641562291 |
Product | Rieske (2Fe-2S) domain-containing protein |
Protein accession | YP_001683428 |
Protein GI | 167645765 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.195025 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0204354 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCCCCA GTCCGCCCGC CGCCGCCTAT CTCGACGACG CGTTCTTCGA GGGCCTTTCG CGCTCGATGC TCGACGTCGG CGCGGCCGAG ACCCTGCCGC CGGCCTGCTA CACCGACGCG GCCTTCTACG CCTTCGAGAA GGAGGCGCTG TTCAATCACG AATGGCTGTG CGTGGGCCGC GTGGATTGGG TGAAGGCGCA GGGCGACTAT TTCACCACCA CGATCATTGG CGAGCCGATC ATCGTCACCC GCAACCGCTC CGACGAGATC AAGGCGATGT CGGCCGTCTG CCAGCATCGC GCCATGCTGG TGGCCGAAGG CCGGGGCAAC ACGCGCGGCT TCGTCTGCCC CTATCACCAC TGGGTCTATT CGCTGAACGG CGACCTGGTG AACGCCCCGG CCATGGAGCG CACCTGCGAC TTCGACAAGA AGTCGATCAA GCTTCCAACC TTCAAGGTCG AGGTCTGGCT GGGCTTCATC TTCATCAATT TCGACGATGC GGCTCCCCCC TTGGCGCCCC GCCTGGAAGC CGTCGAGAGC GCCATCGCCA ATTTCGATCT GTCGAACGCC GAGGGCCTGA CCCCGCCGAT GACCGGCCAG TTCGCCTGGA ACTGGAAGGT GATGTTCGAG AACAACAACG ACGGCTACCA CGCCAACAAG CTGCACCGCG GTCCGTTGCA CGATTTCATT CCCAGCGAGC TGTGCAGCTT CCCGGACGCC GCCGACGGCG ACGCGGGCTT CCTGCGCTTC AACGGCACGC TGCATCCCGA CGCCAGCTTC AATCCGACCC AGAAGGCGGT GCTGCCGATC TTCCCGAAGC TGACGGACGA GGACCGCAAC CGCGCCACCT TCGCCAACAT CCCGCCGACG CTGTCGCTGG TGATGACCAG CGACATGGTC ATCTATCTGA TCCTGCGCCC CACCGGTCCG GAGACCATGG AGCAGGACAC CGGCGTCCTG GTCGCGCCCG GCGCCACCGA GATTCCCGGC TTCGACGAGC GGCTGGAGAT GATCATGACC TCCGCCGGCA AGATCATCGC CCAGGACATG CATGTCGACG AACTGGTCCA GGTGGGCCTC CGCTCGCGGT TCGCGGTGCG GGGTCGCTAC TCCTGGCAGG AGGGGGCGCA GGTGCAGTTC AACCGCTGGC TCACCCCGCG CTACCAGAAA GCCTGGGCGG CGATGAGCAA GGGAGCCGCC GCATGA
|
Protein sequence | MGPSPPAAAY LDDAFFEGLS RSMLDVGAAE TLPPACYTDA AFYAFEKEAL FNHEWLCVGR VDWVKAQGDY FTTTIIGEPI IVTRNRSDEI KAMSAVCQHR AMLVAEGRGN TRGFVCPYHH WVYSLNGDLV NAPAMERTCD FDKKSIKLPT FKVEVWLGFI FINFDDAAPP LAPRLEAVES AIANFDLSNA EGLTPPMTGQ FAWNWKVMFE NNNDGYHANK LHRGPLHDFI PSELCSFPDA ADGDAGFLRF NGTLHPDASF NPTQKAVLPI FPKLTDEDRN RATFANIPPT LSLVMTSDMV IYLILRPTGP ETMEQDTGVL VAPGATEIPG FDERLEMIMT SAGKIIAQDM HVDELVQVGL RSRFAVRGRY SWQEGAQVQF NRWLTPRYQK AWAAMSKGAA A
|
| |