Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2227 |
Symbol | |
ID | 9156383 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 2318837 |
End bp | 2320177 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | cytochrome P450 |
Protein accession | YP_003647175 |
Protein GI | 296139932 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.536845 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGCCG ACAGCATCCC GCACCCGCCC TGGCGCGTCC CGTTCCTCGG CGACGTGCTC GGCATCGACC GAGCGCACCC GATGCAACAG GCCACCGCTC AATTCCGCGA ATGGGGACCG ATCCTCAAGC GCACCTTTGC CGGCCACGAC TTCGTCGCGG TCGGCTCGGC TGAGCTCGCC GACGCGGTAT TCGACGACGA GAATTGGCGC AAGTACGTCG GCCCGCCCCT GCGCGCATTG CGCCCGCTCG CGGGACAAGG CATGCTCATC CAGCCCGACG GTGCCGACTG GGCGCGCGGG CACGCTGCCG CCGCACCGGC GTTCGCTCGC GGACCGATGG AGGGCTACCA CCACGTCATC GTCGAGTCCC TCGATCGTGC CGCCGAATAC TTGCGGTGCG CCGACGGAGC CGTCGACACG TTCGAATTCA CCAGCGCCCT CACCCTCCAT ATCGCCTGCA TGACCACCTT CGGAGAGTCC GACGTCGTGA TCGGTGGCGA GCCATCGCCC GTAAGCACTG CTCTCACCAG GACTCTGCGG GCGATCACTA GCACGTCGAT GATCGCGCCG CGATGGGATC GCCGCCGCCG GCCGCGGACC TGGCGTGCCT TCGATCGCGA TGTCGCGATG CTGCACGAGA TCGTTGACCG CGCGGCACGA AAGAGAACCG CCGACGGCGC CACACACCGG GATATCCTGC ACCACCTACT CAACCCCCCG GCGGGCGTTG AGCTGCGACC CGAAGAGGTG CGCGACCACG CGGTGGTGTT CCTACTCGCA GGACACGAGA CAACGGCATC GTCGATGGCG ACCGCCCTGC ACTTTCTGGC CACGCATCCC GACGTTGCGG ACCGCGTCCG AGTAGAAGCC GCGAGCGTTG ACCCGCGCGA GTACGCCGAC GTCGCGCGGC TGCGGTATAC GCGTGCCGTT GTCCATGAGA CGCTGCGCCT GTGGCCGCCG ACATCTGGCG TGTTCCGACA GGCCAAGTAC GACACCCAGC TCGGTGGTCA CGCTATTGCG GCCGGGGAAT GGGTGTTCGT GGTTCTGCTC GCCGCGCAAC GCGACACCTC ATGGGGGCCG CGCGCCGACG ACTTCGACCC CGACCGGTTC CTCCATCCCG AGACCGGGAA AGCCCGTGTC GCTTCGCTGT TCAAGCCGTT CGGCCACGGA CCACGGCAAT GTATCGGACG AGCCTTCGCG CTGCATGAGC TCACGCTCGC CCTCGCCGTG CTACTGCGTG ACTTCGACGT TCACGGCGAC CCGGACTACA CCCTGCACAT GTCCGAGGCT GTCACCACGC GACCGAAAGG ATTGCAACTC CAATTCGCAC ACAGCGCCTG A
|
Protein sequence | MAADSIPHPP WRVPFLGDVL GIDRAHPMQQ ATAQFREWGP ILKRTFAGHD FVAVGSAELA DAVFDDENWR KYVGPPLRAL RPLAGQGMLI QPDGADWARG HAAAAPAFAR GPMEGYHHVI VESLDRAAEY LRCADGAVDT FEFTSALTLH IACMTTFGES DVVIGGEPSP VSTALTRTLR AITSTSMIAP RWDRRRRPRT WRAFDRDVAM LHEIVDRAAR KRTADGATHR DILHHLLNPP AGVELRPEEV RDHAVVFLLA GHETTASSMA TALHFLATHP DVADRVRVEA ASVDPREYAD VARLRYTRAV VHETLRLWPP TSGVFRQAKY DTQLGGHAIA AGEWVFVVLL AAQRDTSWGP RADDFDPDRF LHPETGKARV ASLFKPFGHG PRQCIGRAFA LHELTLALAV LLRDFDVHGD PDYTLHMSEA VTTRPKGLQL QFAHSA
|
| |