Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3628 |
Symbol | |
ID | 5901083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3921094 |
End bp | 3922101 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641564139 |
Product | 2-nitropropane dioxygenase NPD |
Protein accession | YP_001685253 |
Protein GI | 167647590 |
COG category | [R] General function prediction only |
COG ID | [COG2070] Dioxygenases related to 2-nitropropane dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCTGC CGCCCATTCT GCGCGACCGT CTGCGCCTGC CCGTCATCGC GTCGCCGCTG TTCATCATCA GCAATCCCGA CCTGGTGATT GCCCAGTGCA AGGCCGGCGT CGTCGGCTCG TTCCCGGCGC TCAACGCCCG GCCGATCTCG CAACTGGACG AGTGGCTGGC GCGGATCACC GAGGAGCTGG CGGCCCATGA CCGCGCCAAT CCCGACGCGC CCTCGGCCCC GTTCGCGGTC AACCAGATCG TCCACAAGAG CAACAACCGG CTCGAGGAAG ACATCGCCAT GTGCGTGAAG CACAAGGTCC CGGTGGTAAT CACCTCGCTG GGCGCTCGCG AGGATCTGAA CAGCGCGATC CACAGCTACG GCGGCATCAC CCTGCACGAC GTCATCAACG ACAGGCACGC CCACAAGGCC ATCGAGAAGG GCGCCGACGG CCTGATCCCG GTGGCCGCCG GGGCCGGCGG CCACGCGGGC ACCCTGTCGC CGTTCGCTCT GATCCAGGAG ATTCGCGCCT GGTTCGACGG GCCGGTGGCC CTGTCCGGGT CGATCGCCTG CGGTCGCTCG ATCCTGGCGG CCCAGGCCAT GGGCGCGGAC CTGGCCTATA TCGGCTCGGC CTTCATCGCC ACGAAGGAAG CCAACGCGCC GCAGGGCTAC AAGGACACCA TCGTCGAGGC GTCGGCCAAC GACATCGTCT ATTCCAACCT GTTCACCGGC GTGCACGGCA ACTATCTGCG CCAGTCGATC GTCCGGGCGG GCCTGGACCC CGAGAACCTG CCGGTCAGCG ATCCGTCGGC CATGAACTTC GGCTCGGGCG GCAATCAGGA GGCCAAGGCC TGGCGCGACA TCTGGGGCTC CGGCCAGGGT GTCGGCGCCA TCGACACGGT GCTCTCGGTC GGCGAACTGG TCGCCAAGTT CGCGGAGCAG TACGAGGAGG CCAAGGCGGA GCTGGCGGCC AAGACCGCGC TGACTTCGGG AAGCCACCTG GCTTTCGCGG CCCAGTAG
|
Protein sequence | MALPPILRDR LRLPVIASPL FIISNPDLVI AQCKAGVVGS FPALNARPIS QLDEWLARIT EELAAHDRAN PDAPSAPFAV NQIVHKSNNR LEEDIAMCVK HKVPVVITSL GAREDLNSAI HSYGGITLHD VINDRHAHKA IEKGADGLIP VAAGAGGHAG TLSPFALIQE IRAWFDGPVA LSGSIACGRS ILAAQAMGAD LAYIGSAFIA TKEANAPQGY KDTIVEASAN DIVYSNLFTG VHGNYLRQSI VRAGLDPENL PVSDPSAMNF GSGGNQEAKA WRDIWGSGQG VGAIDTVLSV GELVAKFAEQ YEEAKAELAA KTALTSGSHL AFAAQ
|
| |