Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2806 |
Symbol | |
ID | 5706162 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 3186252 |
End bp | 3187313 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641272262 |
Product | 2-nitropropane dioxygenase NPD |
Protein accession | YP_001537632 |
Protein GI | 159038379 |
COG category | [R] General function prediction only |
COG ID | [COG2070] Dioxygenases related to 2-nitropropane dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.180087 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0002175 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGCCCCCCG AACTGCCCAC CCGGCTGACC GAACTGGTCG GTGTCCGGCA TCCCATCGTG CAGACCGGCA TGGGATACGT GTCCGGCGCG CGACTGACCG CGGCGACGGC CGACGCGGGC GGCCTCGGTG TCATCGCCTC GGCCACCATG AGCCTCGACG AGCTGCGCTC CGCGATCCGG GAGGTACGTC GTCGCACGTC CGCACCGTTC GGCGTCAACC TGCGTGCCGA CGCGACGGAC GTACGAGAAC GGGTCGGGTT GGTCATCGCG GAATCGGTAC GGGTCGTCTC GTTCGCACTC GCGCCGCGGC GGGACCTCAT CAGCAGGCTC CGGGACGCTG GGGTGGTCAC CATCCCGTCC GTCGGTGCAC TCCGACACGC GGAGAAGGTT GCTGCCTGGG GCGCGGACGC CGTCATCGTC CAAGGCGGTG AGGGAGGTGG GCACACCGGA GCGATCCCCA CCAGTCTGCT GCTTCCCCAG GTGGTTGACG CGGTCGACAT CCCGGTGGTC GCGGCCGGCG GCTTCTTTGA CGGTCGCGGG CTGGTCGCGG CGCTGGCCTA CGGCGCCGCG GGTGTCGCCA TGGGCACCCG GTTCCTGCTG ACCAGCGACA GTCCGGTCGG CTCGGTGGTG AAGCGGGCCT ATCTCGACAG CGGCGTGACC GACACGGTTG TCACCACCCA GGTCGATGGG CTGCCGCACC GGGTCCTCCG GACGCGGTTC GTCGATCGAC TCGAACGATC GGGGCGGATC GTCACGCTGG CCAACGCTTT CGGGCGGGCC GTCGCGCTGC GCCGACTCAC CGGGATGTCC TGGCCGGCCT TGCTCCGTGA CGGTCTCGCC GCCCGACGGA GCCGGGACCT GTCCTGGGCC CAGGCCCTGA TGGCCGCCAA CACACCGGTC CTGCTCCGAG CGGCGATGGT GGACGGCCGA GCCGACCTCG GTGTGATGTC CGCCGGGCAG GTGGTAGGGC TTATCGACGA CGTACCCTCG TGCGCGGAGC TCATCGATCG GATCATGACC GAGGCACAGG AGTGCCTCAC GCGACTCACA ACCGCTCGAT GA
|
Protein sequence | MPPELPTRLT ELVGVRHPIV QTGMGYVSGA RLTAATADAG GLGVIASATM SLDELRSAIR EVRRRTSAPF GVNLRADATD VRERVGLVIA ESVRVVSFAL APRRDLISRL RDAGVVTIPS VGALRHAEKV AAWGADAVIV QGGEGGGHTG AIPTSLLLPQ VVDAVDIPVV AAGGFFDGRG LVAALAYGAA GVAMGTRFLL TSDSPVGSVV KRAYLDSGVT DTVVTTQVDG LPHRVLRTRF VDRLERSGRI VTLANAFGRA VALRRLTGMS WPALLRDGLA ARRSRDLSWA QALMAANTPV LLRAAMVDGR ADLGVMSAGQ VVGLIDDVPS CAELIDRIMT EAQECLTRLT TAR
|
| |