Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3935 |
Symbol | |
ID | 5901397 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4259376 |
End bp | 4260533 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641564456 |
Product | Rieske (2Fe-2S) domain-containing protein |
Protein accession | YP_001685558 |
Protein GI | 167647895 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.0946647 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.167315 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCTGA ACCTCCCAAA CCAAGACCCC GACGCCGCCG ATCCTGATGC GAATTGGGGC CTGCCCGGCT GGCTCTATGA CAACGCCCGC TTCTTCCGCG AGGAACAGGA CAAGGTCCTG CGTCCGTCGT GGCAGATCGT CTGCCATCTG AACGATATCC CCAAACCCGG CGACTTCCAC ACCTTCGACT TTCTGGGCGA GAGCACGATC GTCGTGCGCG GCAAAGACGG CGGGGCGCGG GCCTTCGCCA ATGTCTGCCG GCACCGGGCG GCGCGGCTGC TGGACGGCCC CAGCGGCCAT TGCGCGCGCG TGGTCTGCCC CTACCACGCC TGGACCTACG ACCTGGACGG CCGGCTGATC GGCGTGCCGC ATCGCGAGAC TTACCCTGCC TTGAAGATGG AAGAACAGGG GCTCCACACG GTGCAGGTCG AGGTCTATCG CGGCTTCGTC TTCGTGCGGC TCGAGGGCGA CGGCCCCAGC GTGGCCGAGA TGATGGCGCC CTACGACCAC GAGCTGGAGC CCTATCGCTT CGAGGACATG GTCCCGTTCG GCCGCGTCAC CCTGCGACCG CGCGCGGTGA ACTGGAAGAA CATCAGCGAC AACTATTCAG ACGGCCTGCA CATCCCGGTC GCCCATCCCG GCCTGACCCG GCTGTTCGGG CGGGGCTATG GCGTCGAGGC CGCCCCCTGG GTCGACAAGA TGTGGGGCCA GCTGATCGAG GAGCCGTCGC GCAATCCGTC CGAGCGGATG TACCAGCGGG TGCTGCCCGA CGCCCTCCAC CTGCCGCCCG AGCGCAAGCG GCTGTGGACC TATTTCAAGC TCTGGCCCAA CCAGGCGTTC GACATCTATC CTGACCAGGT GGACTTCATG CAGTTCATCC CGGTCTCGCC CACCCAGACG ATGATCCGCG AGATCGCCTA CGCCCTGCCC GACGACCGGC GGGAGATGAA GGCGGCGCGG TACCTCAACT GGCGCATCAA CCGCCAGGTC AACGCCGAGG ACACGCAGCT GGTGGCCCGC GTGCAGCAGG GCATGGCCTC GCGGAGCTTC ACGGCCGGGC CGCTGGCGGA CTCGGAGGTC AGCTTGCGGA GCTTCGGCCG CAAGATGCGG GCGCTGATCC CGGAAGCGCG GCTGCACCGG CCGCCGGAGG GGTGGTGA
|
Protein sequence | MDLNLPNQDP DAADPDANWG LPGWLYDNAR FFREEQDKVL RPSWQIVCHL NDIPKPGDFH TFDFLGESTI VVRGKDGGAR AFANVCRHRA ARLLDGPSGH CARVVCPYHA WTYDLDGRLI GVPHRETYPA LKMEEQGLHT VQVEVYRGFV FVRLEGDGPS VAEMMAPYDH ELEPYRFEDM VPFGRVTLRP RAVNWKNISD NYSDGLHIPV AHPGLTRLFG RGYGVEAAPW VDKMWGQLIE EPSRNPSERM YQRVLPDALH LPPERKRLWT YFKLWPNQAF DIYPDQVDFM QFIPVSPTQT MIREIAYALP DDRREMKAAR YLNWRINRQV NAEDTQLVAR VQQGMASRSF TAGPLADSEV SLRSFGRKMR ALIPEARLHR PPEGW
|
| |