Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_1851 |
Symbol | |
ID | 8419692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 2122012 |
End bp | 2123133 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645038435 |
Product | 2-nitropropane dioxygenase NPD |
Protein accession | YP_003198713 |
Protein GI | 258405971 |
COG category | [R] General function prediction only |
COG ID | [COG2070] Dioxygenases related to 2-nitropropane dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000000135103 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.816566 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTACCAC ACACCGCTTT TCCGAGCCTC ACCTTTGGGA ATCTGACCGC CCCTACCCCG ATCATCCAGG GCGGGATGGG CATCGGCATT TCCGGCGCCG GCCTCGCGGC GGCCGTGGCC AACGAAGGCG GCATCGGCGT CATATCGGCT ATTTGCCTGG GCATGCGCGC TCCGGGGTCT CGTCAGGATT ACGCGCAAGC CAATAAAGAA GGGTTGATCC GGGAAATCCG CACAGCCCGG CAAAAGACAT CGGGAGTCCT GGGCGTCAAT ATCATGGTCG CCTGCTCAGA CTACGATTCC CTGGTCTTGG GGGCTATTGA AGAAGAGGCG GACCTCCTCT TTCTCGGTGC CGGTCTGCCC TTGCAATTCC CCAAGGAACT CACGCCGGAA CGCATGCGCA CCATGCACAG CAAACTGGTT CCCATTATCT CCTCGGCCAA GGCCGCGAAT ACGCTGCTCA AATACTGGAG CAAACGTTTC GGACGCCTCC CCGACGGGTT TGTCGTCGAA GGCCCCAAGG CCGGGGGACA TCTTGGCTTC AAACGCGAAC AGATCGAGGA CCCGGCCTAC GCTTTGCAGC GTCTCATCCC GGAAGTCGTC GAGGCTGTGC GCCCGTATGC GGAAAAATAC GGACAACAGA TCCCGGTGAT CGCTGCCGGA GGCGTCTATT CGGGGGCGGA TATCGCGGAA TTTCTCGAAC TCGGTGCCAG CGGCGTCCAG ATGGGCACCC GCTTCGTGGC TACCCACGAG TGCGATGCGG ACAGCAAATT CAAGGAATCT TTCGTCCAGG CCAAAAAAGA AGACCTGACC ATCATCCAGA GCCCGGTTGG CCTCCCTGGA CGGGCGATCA ACAACGCCTT CCTCGAAGAT GTCGCCGCAG GAGCGAAAAA ACCCTTCACC TGCCCCTGGC ATTGCCTGAA GACCTGCGAC TACAAACAAT CCCCCTATTG TATCGCCTGC GCCCTGAACC AGGCCCGGGC CGGACGCCTC AAACACGGCT TCGCCTTTGC CGGCGCCAAC GCCTGGCGGG TGGACGCCAT TATCTCCGTT CAGGAACTCA TGCAGTCCCT GGCCGATGAA TTCGCCGAGG CCACAGCCCT GCCCGAAGCC GCCCTGGCCT AA
|
Protein sequence | MLPHTAFPSL TFGNLTAPTP IIQGGMGIGI SGAGLAAAVA NEGGIGVISA ICLGMRAPGS RQDYAQANKE GLIREIRTAR QKTSGVLGVN IMVACSDYDS LVLGAIEEEA DLLFLGAGLP LQFPKELTPE RMRTMHSKLV PIISSAKAAN TLLKYWSKRF GRLPDGFVVE GPKAGGHLGF KREQIEDPAY ALQRLIPEVV EAVRPYAEKY GQQIPVIAAG GVYSGADIAE FLELGASGVQ MGTRFVATHE CDADSKFKES FVQAKKEDLT IIQSPVGLPG RAINNAFLED VAAGAKKPFT CPWHCLKTCD YKQSPYCIAC ALNQARAGRL KHGFAFAGAN AWRVDAIISV QELMQSLADE FAEATALPEA ALA
|
| |