Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0425 |
Symbol | |
ID | 5897699 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 465196 |
End bp | 466197 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641560911 |
Product | 2-nitropropane dioxygenase NPD |
Protein accession | YP_001682060 |
Protein GI | 167644397 |
COG category | [R] General function prediction only |
COG ID | [COG2070] Dioxygenases related to 2-nitropropane dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCCCG CCGCCTTCGC GCCTCGCCTT CGCCTGCCGG TCATCGGCTC GCCGCTATTT ATCATCTCGG GCCCCGACCT GGTGATCGCC CAGTGTAAGG CTGGGATCAT CGGCTCGTTT CCGTCGCTGA ACGCTCGCCC CCTCTCCCTG CTCGACGAGT GGCTGCACCG CATCACCGAG GAGCTGGCCG CCTGGGACCG GGCGCATCCG GAAAGCCCCT CGGCGCCGTT CGCGGTCAAC CAGATCGTTC ACAAGACCAA CAACCGCTTG GACGAGGACC TGGCGCTCTG CGTCAAGTGG AAGGCTCCTC TGGTCATCAC CTCGCTGGGC GCGCGCGCGG ACGTCAATCA AGCCGTCCAC GACTATGGCG GTCTGACCTT CCACGATGTC ATCAACGATC GCTTCGCCCA CAAGGCCATC GAGAAAGGCG CCGACGGCCT CATCGCCGTC GCGGCGGGAG CCGGCGGCCA CGCCGGCACC CTGTCGCCCT TCGCCCTGAT CCAGGAGATC CGAGCCTGGT TCGAGGGCCC CTTGGCGCTG TCGGGCTCGA TCGCCAACGG CGCCGCGATC CTCGCCGCCC AGGCCCTGGG CGCGGATTTC GCCTATATGG GCTCGGCCTT CATCGCCACC CAGGAAGCCA ACGCCGATCC CGCCTACAAG CAGATGATCG TCGAGGCCGC CTCGTCCGAC ATCCTCTATT CCAACCTCTT TACCGGCGTG CACGGCAACT ATCTTCGCCC GTCGATCATC AAGGCGGGGT TGGACCCCGA CAACCTGCCG ATCAGCGATC CGTCGGCGAT GAACTTCGGC TCCGGCGGCA ATCAAAAGGC CAAGGCCTGG CGCGACATCT GGGGCTGCGG CCAGGGGATC GGCGCGATCG ACGCGGTGCG CACGACCGCA CAGTTCGTCG ATCAGTTGGA AGCCGAATAC GAGGCGGCCA TTCGGGCTTT GGACCAAAGA ACCGAGGCGG CGGGCTCGGT GCGCGTCTGG GGCGCGGCCT AG
|
Protein sequence | MIPAAFAPRL RLPVIGSPLF IISGPDLVIA QCKAGIIGSF PSLNARPLSL LDEWLHRITE ELAAWDRAHP ESPSAPFAVN QIVHKTNNRL DEDLALCVKW KAPLVITSLG ARADVNQAVH DYGGLTFHDV INDRFAHKAI EKGADGLIAV AAGAGGHAGT LSPFALIQEI RAWFEGPLAL SGSIANGAAI LAAQALGADF AYMGSAFIAT QEANADPAYK QMIVEAASSD ILYSNLFTGV HGNYLRPSII KAGLDPDNLP ISDPSAMNFG SGGNQKAKAW RDIWGCGQGI GAIDAVRTTA QFVDQLEAEY EAAIRALDQR TEAAGSVRVW GAA
|
| |