Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1974 |
Symbol | |
ID | 5899429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2119372 |
End bp | 2120775 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641562463 |
Product | Rieske (2Fe-2S) domain-containing protein |
Protein accession | YP_001683600 |
Protein GI | 167645937 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTGGCCC CGCTGAACCG AGAAACCCTG TCGGATGGAA CCGCGATATC CGACCTGATC AACCTGGACA CGCGCGAGGT CAAGATGCGC GCCCTGTCCG ATCCGGAACT CTTTGCACTG GAGATGGAGC GCATCTTCGC CAAGACCTGG CTGTTCCTCG GCCACGAAAC CGAAATCCCC AATCCCGGTG ACTTCGTCAC CCGCGACATG GGCTCGGACG GGGTGATCGT CGCCCGGGAT CGCGAGGGCC AGATCCACGT CTCGTTGAAC GTCTGCCCCC ACCGCGGGAT GAAGATTTCG ACCTTGGAGG CCGGCAACAC CCTGGCGCAC GTCTGCATCT ACCATGGCTG GGCCTTCAAG CCGAACGGCG ACTTCGTCGG CGCTCCGGTT CGCAGCGAAT GCATGCAGGG CAAGATGCTG ACGGATGAGC AGTTGTCGCT GAAAAAGGCC CGGATCGCGA TCTACGGCGG GCTGATCTTC GCCACCTTCA ATATTGACGG CCCGAGCTTC GACGAGTTCC TCGGCGACGC GAAATGGTAT TTCGATACTC TGTGGAACCG CACGGCCGGA GGCATGGAAG TGCTTGGGCC GCCCCAACGC TTCATCATCC GAGCCAATTG GAAGACGGCC TGCGAGCAGT CGGCCTCAGA CGGCTTCCAT ACCCTGACCC TCCACCGGTG GCTGGGCGAG GTCGGCCCCT ACGCCAAGAA GCCCGAGGCG GAAGGCCAGG GCGCCGACCT GGCCGCCGAG ATGGGCGGAT GCGAGGTCTG GACCGATGGC GGCCACACCA TGCGCTGCAT CGACCTGGAC CGTAAGATTC GCCGCATCAC TGGGCGCGAT CCGTCCGAAC TGTCCGCCGC CGAGAAGCTC GCACTGCTGC CTCCCCCAGG CATGACCCCG GAGATGGTGC CGGAACTGCT GGAACGTTTC GACGACGACC ATCTTCGCTT GATGGCCTGG CGGCCGCCTC AGGTCGGCAA TTTCTTTCCC AATGGCCTGT TCGAGTTCAT CTACCTGCCG CAGCCGGACG GGACTGTGGC CGGCGCCATG GCCCTGCACG CCTATGTGCC CAAGGGCCCC GACAAGCTCG AATTCATGAA CTGGATTTTC GCGGAGAAGG ACACCCCGCC TGCGTTGAAG GCCCGCATGC TGCGCCAGTC GATCCAGCTT CTGGGCACCT CGGGGATGGT CGAACAGGAC GACTCGGACA CGTGGCCGCA CCAGACCATC GTCGCCAAGG GTGCGGTTTC CAAGGATATC ACCATGAAAT ACCAGGCCCT CTACGAGACG GGCCGGCCCG CCAACTGGCC CGGCCCGGGT CATGTCGGTG AAGGTTTCAC CAAGGACGAT ACCCAGTGGC AGTGGTGGAA GTCCTGGTAC GACCTGATGG TCGTCGACGC CTGA
|
Protein sequence | MLAPLNRETL SDGTAISDLI NLDTREVKMR ALSDPELFAL EMERIFAKTW LFLGHETEIP NPGDFVTRDM GSDGVIVARD REGQIHVSLN VCPHRGMKIS TLEAGNTLAH VCIYHGWAFK PNGDFVGAPV RSECMQGKML TDEQLSLKKA RIAIYGGLIF ATFNIDGPSF DEFLGDAKWY FDTLWNRTAG GMEVLGPPQR FIIRANWKTA CEQSASDGFH TLTLHRWLGE VGPYAKKPEA EGQGADLAAE MGGCEVWTDG GHTMRCIDLD RKIRRITGRD PSELSAAEKL ALLPPPGMTP EMVPELLERF DDDHLRLMAW RPPQVGNFFP NGLFEFIYLP QPDGTVAGAM ALHAYVPKGP DKLEFMNWIF AEKDTPPALK ARMLRQSIQL LGTSGMVEQD DSDTWPHQTI VAKGAVSKDI TMKYQALYET GRPANWPGPG HVGEGFTKDD TQWQWWKSWY DLMVVDA
|
| |