Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_5269 |
Symbol | |
ID | 4644735 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 5643651 |
End bp | 5644721 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639808744 |
Product | 2-nitropropane dioxygenase, NPD |
Protein accession | YP_956046 |
Protein GI | 120406217 |
COG category | [R] General function prediction only |
COG ID | [COG2070] Dioxygenases related to 2-nitropropane dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.353912 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCGGC TCAGCACCCC GCTGACCGAA CTGGTCGGCA TCGAGCATCC CGTCGTACAG ACCGGGATGG GGTGGGTCGC CGGCGCCCGA TTGGTCTCGG CGACGTCGAA TGCCGGCGGG CTGGGCATCC TGGCCTCGGC GACGATGACG CTGGAGGAGC TCACCACCGC GGTGGCCAAG GTCAAGGCCG CCACCGACAA GCCGTTCGGC ATCAACATCC GCGCCGATGC CGGCGATGCG AACGCACGCG TCGACCTGCT GATCCGCGAA GGGGTCAAGG TCGCCTCGTT CGCGCTGGCC CCGAAGCCCG ATCTCATCGC GAGGCTGAAA GACGCCGGCG TGGTGGTGAT TCCGTCGGTC GGTCTGGCCA AACACGCCAA GAAGGTGGCG GGCTGGGGCG CCGACGCAGT GATCGTGCAG GGTGGTGAGG GCGGCGGGCA CACCGGCCCG ATCGCCACCA CGCTGCTGCT GCCGTCGGTG CTCGACGCCG TCGCCGACAC CGGGATGCCG GTGATCGCCG CGGGTGGCTT CTTCGACGGC AGGGGCCTGG CCGCCGCGCT GTCGTACGGC GCGGCCGGGG TCGCCATGGG CACCCGGTTC CTGCTGACGT CGGACTCGAC CGTGCCCGAC GCGGTCAAGC AGCGTTACCT GGACGCCGCG CTGGACGGCA CCGTGGTGTC GACCAAGGTC GACGGTATGC CGCACCGGGT GCTGCGCACC GGGCTGGTCG AGAAGCTGGA GAGCGGGTCC CCAATCCGGG GGCTCTCGGC AGCGGTGCTC AACGCGCAGA AGTTCAAGAA GATGTCCGGC ATGACCTGGA AGTCGATGAT CACCGACGGG CTGGCCATGC GGCACGGCAA GGACCTGACG TGGTCGCAGG TCGTGATGGC CGCCAACACT CCGATGCTGC TCAAGGCGGG CCTCGTCGAG GGCAACACCG ATGCCGGTGT GCTCGCCTCC GGCCAGGTGG CCGGGATCAT CGATGATCTG CCCTCGTGCG CGGAACTGGT GCCGGCGATC GTCGCCGAAG CCGTCGAACA CTTGCAGAAG GCCTCGCAAC ACATCCGCTG A
|
Protein sequence | MSRLSTPLTE LVGIEHPVVQ TGMGWVAGAR LVSATSNAGG LGILASATMT LEELTTAVAK VKAATDKPFG INIRADAGDA NARVDLLIRE GVKVASFALA PKPDLIARLK DAGVVVIPSV GLAKHAKKVA GWGADAVIVQ GGEGGGHTGP IATTLLLPSV LDAVADTGMP VIAAGGFFDG RGLAAALSYG AAGVAMGTRF LLTSDSTVPD AVKQRYLDAA LDGTVVSTKV DGMPHRVLRT GLVEKLESGS PIRGLSAAVL NAQKFKKMSG MTWKSMITDG LAMRHGKDLT WSQVVMAANT PMLLKAGLVE GNTDAGVLAS GQVAGIIDDL PSCAELVPAI VAEAVEHLQK ASQHIR
|
| |