Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0626 |
Symbol | |
ID | 4569779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 703023 |
End bp | 704282 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 639765223 |
Product | 2-nitropropane dioxygenase, NPD |
Protein accession | YP_911105 |
Protein GI | 119356461 |
COG category | [R] General function prediction only |
COG ID | [COG2070] Dioxygenases related to 2-nitropropane dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGTAG ACAACTTCAG GTTACAATTA GGAAAACAAG AGTACGTTCC TCTCGTCATC GGAGGCATGG GAGTCAACAT ATCAACAACC GAACTTGCGC TCGCTGCTGA AAGACTCGGC GGCATAGGCC ATATTTCGGA TGCCGAGACG GGATATGTCT GTGATCAATT ATTTGGAACA TCCTTTGTCA GCACAAAAAG AAAACGGTAC ATCGACAACA TCAACAACCC CGACAAGGCG AAAGTCCTTT TTGACCTTGG AGAAGTAGCC GAAGCCCAAA AAAAATACAT CGAGCATACC GTTTCGCAAA AAACCGGAAA GGGCGCGATT TTTTTGAACT GCATGGAAAA ACTGACGATG AACAATGCGC AGGAAACCCT GAAAGTTCGC CTCGCTGCCG CAATGGATGC CGGAATTGAC GGTCTGACCC TCGCTGCCGG CCTGAATCTG AGAACTCTTG ATCTGATTCA GGACCATCCC CGCTTCCGCG ATGTTAAAAT CGGGATTATC ATCTCGTCGG TCAGGGCCCT GTCGATCTTT CTGAAACGGG CAGTTCGTCT TCAGCGGTTG CCCGAATATA TTATCGTTGA AGGACCTCTG GCTGGCGGGC ATCTGGGATT CAGCCCTGAT GACTGGCATA CTTTCGATTT AAAAACAATT TTTAATGAAG TGATCCAGTT TCTCAAGCAA GAGAATCTGG CAATTCCCGT TATTCCTGCC GGTGGCATTT TCACCGGAAC TGATGCCGCC GAGTATCTTG CCGCAGGAGC TTCCGCTGTT CAGGTTGCAA CCCGTTTTAC CATTTCCAAA GAGGCCGGAC TGCCGGCAAA AGTCAAGCAG CACTACATCA ATGCCACCGA GGAGGACATT GTCGTCAATA TGGCATCAAC GACCGGCTAC CCGATGCGCA TGCTCATACA GTCTCCAACT CTCGACTATA CCATGAGACC TAACTGTGAG GGGCTTGGCT ATCTGCTGGA AAACGGAGGA AAATGCAGTT ATATCGACGC CTATCAGAAA GCTCTTGAGT CAAGAAAATC CGGAGAAAAA CTGGCAATCG GTGAAAAAAC ATGCCTCTGT ACCGGAATGG CGAATTACGA CTGCTGGACA TGCGGTCATA TGGCTTACCG CCTCAAGGAG ACCACGAACC GCCTTCATGA CGGATCATGG CAGCTCCCTG CGGCAGAAGA CATCTTTCTC GATTACCAGT TCAGCAGAGA TCACCAGATT CGTCTTCCAG AGCCCGAAGA AAACGCATAG
|
Protein sequence | MIVDNFRLQL GKQEYVPLVI GGMGVNISTT ELALAAERLG GIGHISDAET GYVCDQLFGT SFVSTKRKRY IDNINNPDKA KVLFDLGEVA EAQKKYIEHT VSQKTGKGAI FLNCMEKLTM NNAQETLKVR LAAAMDAGID GLTLAAGLNL RTLDLIQDHP RFRDVKIGII ISSVRALSIF LKRAVRLQRL PEYIIVEGPL AGGHLGFSPD DWHTFDLKTI FNEVIQFLKQ ENLAIPVIPA GGIFTGTDAA EYLAAGASAV QVATRFTISK EAGLPAKVKQ HYINATEEDI VVNMASTTGY PMRMLIQSPT LDYTMRPNCE GLGYLLENGG KCSYIDAYQK ALESRKSGEK LAIGEKTCLC TGMANYDCWT CGHMAYRLKE TTNRLHDGSW QLPAAEDIFL DYQFSRDHQI RLPEPEENA
|
| |