Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_5309 |
Symbol | |
ID | 5897088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010333 |
Strand | - |
Start bp | 19001 |
End bp | 20143 |
Gene Length | 1143 bp |
Protein Length | 380 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641550602 |
Product | 2-nitropropane dioxygenase NPD |
Protein accession | YP_001672088 |
Protein GI | 167621580 |
COG category | [R] General function prediction only |
COG ID | [COG2070] Dioxygenases related to 2-nitropropane dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.1832 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTTGC ACAGCCGCAT CTGCGAGATT TTCGGTATCC GCTATCCCAT CGTCCTCGCC GGAATGGGCG GGGCCAGTGT TCCCCGCCTG GCAGCGGCGG TCTCCAATGC CGGAGGGCTA GGAATCCTTG GCGCCGCAGC CTGCTCGCCC GAGGAACTTC GCGCGTGGAT CCGCGAAGTG CGATCGCTCA CCGACAAGCC CTTCGGCGTC GACACATTGC TGCCGGCCTC GGTCCGCCGC GAGGTGGTCG ACGCGGCGGC AGGCTCCGGC GAGGGCGGCA AGCCGTCGCC CATGGATCTG CTTGGCGACT ATCAGGCGTT TGCAGCCGAC TTCATGCGCC AGGAAGGCCT GCAAAAGGTC GTCCGTCCGC GCGAGGACAC CAACGCCGAA GCTCGGGGCG GCCCCGCGTT CTTCTCCAAG GAGTTCTTCG AGGCGCAAAT GGAGGTGGTG ATCGAGGAAA AGGTGCCGGT CTACGCCGCG GGCCTGGGAA ACCCCGGCCC CTGGATGGAG CGACTGCGGG AGAACGGTAC AAAGATCATG GCCGTCATCG GGTCTGTGAA GCACGCGCTG CAGGTCGCCG CGTCTGGCAT CGACGTGGTG GTCGCTCAGG GACATGACGG GGGCGGACAC AATTCGCCGA TCGGCACCAT GGCGCTTATC CCCCAGGTCG TCGACGCCAT GGCGGGGCGC ATCCCGGTGC TCGGGGCCGG CGGTATTGCC GACGGCCGCG GCGTAGCCGC CGCGATGATG CTGGGGGCTG AAGGCGCCTG GGTGGGCACC GCTTTCCTGG CGACGGAGGA AGCCGGCATT CAGCAATTCC AGAAGGAGGT TCTGGTCGAG TACGGCGATG GCGACACGGT AGTGTCAAAA TCCGTCACCG GAAAGCCGGC CAGGATCATT CGCAATAAGT GGGCGCAGGC GTGGGTGGAC GCGGAGAAGT CACCGCTGCC CATGCCCTTC CAGTCGATCA TCGCCGGGCC CGTGCTCGCG GCGGCGACCC TGGACCAGCG CAAGGATATC GCGCCTGGGT TTGCCGGCCA GGGCATGGGG CTCATCAAGG CGATTCGCCC CGCCCGGGAC GTCCTGGAAG ACCTCGTCAG CGGCGCCGAG ACCGCGCTCG CTCGCGCCGA CCGTTTTCGC TAA
|
Protein sequence | MALHSRICEI FGIRYPIVLA GMGGASVPRL AAAVSNAGGL GILGAAACSP EELRAWIREV RSLTDKPFGV DTLLPASVRR EVVDAAAGSG EGGKPSPMDL LGDYQAFAAD FMRQEGLQKV VRPREDTNAE ARGGPAFFSK EFFEAQMEVV IEEKVPVYAA GLGNPGPWME RLRENGTKIM AVIGSVKHAL QVAASGIDVV VAQGHDGGGH NSPIGTMALI PQVVDAMAGR IPVLGAGGIA DGRGVAAAMM LGAEGAWVGT AFLATEEAGI QQFQKEVLVE YGDGDTVVSK SVTGKPARII RNKWAQAWVD AEKSPLPMPF QSIIAGPVLA AATLDQRKDI APGFAGQGMG LIKAIRPARD VLEDLVSGAE TALARADRFR
|
| |