Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_1497 |
Symbol | |
ID | 5454754 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | + |
Start bp | 1628522 |
End bp | 1629742 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640877070 |
Product | tryptophan halogenase |
Protein accession | YP_001412773 |
Protein GI | 154251949 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.756039 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 77 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATCCTG ATGTCGTGAT TATCGGCGCC GGGCCGGCGG GTTCGGTTGC GGCTGCCATG CTGGCCGGTG CGGGCTTTTC GGTCGAGGTT CTGGAGCGGG CGCATTTTCC GCGCTTCTCC ATCGGCGAAA GCCTGTTGCC GCAGGCCATG GAGTGGCTCG CAGAAGCGGA TTTGCTGCGC GATGTCGTCG AGGCTGGCTT CCAGCACAAG AACGGGGCCA TGTTCCGTCA CGGCGACAGG GAGGAGAGCT TCGACTTCCG TATGAAAAGT TCGGATGGCT GGGGCACGAC CTATCAGGTT CGCCGCGACA AGTTCGACGA TCTCCTCGCA AAGGGTGCGG TGCGCAAGGG CGCGAAGGTG AGCTTCGGAC AAACCGTCAT CGCGATGCGG CCCGATCCCG TTGCCCCCAG CCTCACGGTG CGCGACGAAG AAGGCAATGA GCGCGAGATC ACCGCACGTT TCGTTCTCGA TGCGAGCGGC TTCGGCCGCG TGCTGGCGCG TCTTCTCGAT CTTGAATCGC CAGCGGGCTT CCCCGATCGC ATGTCGATCT TCACCCATGT CGAAGACAAC ATCCCGCCAC AGGCATACGA CCGGAACAAG ATCCTCATAA CGGTCAATCC GCGAAACAGC GAAATCTGGT ACTGGATGAT CCCGTTGGCG GATGGTCTCT GTTCGATGGG CGTCGTCGGT AAACCCGAAC ATCTTGCACC CTATGGTTCG ACGCGAGAGG AACAGCTCGC CTCACTTGTC GCCGAATCCG GTCTGATGGG TGAGCTTCTC GTGAATGCGC GCCGCGTGCG CGATGTTGGC GAAATATCGG GCTATGCCGC CCGTGTCAGC AGCCTCACCG GCCCCGGCTA CGCGCTGCTC GGCAATGCGG GCGAGTTTCT CGATCCCGTG TTTTCCTCAG GCGTCACCAT CGCGCTCAAA TCGGCCTCGC TCGCAACGCA TGCGCTTGTC AGGCAGCTTA AGGGTGAAAC GCCCGACTGG GACAAGGAGT TCGCACAGCC GCTTGCTCGA GGCGTCGAGA CTTTCCGCGC CTATGTCAGC GGCTGGTATG ACGGCTCGCT GCAGAAAATT ATCTTCAGCC AGCCGGCGGA TGCGAACCAG ATCAAGAAGA TGATCACCTC GGTGCTTGCC GGCTACGCCT GGGACGAGGC CAACCCCTTT GTCCGAGAGC CGGGCAAATA TCTCAAGATG GTGGAAGAGT TGTGCGGCTG A
|
Protein sequence | MNPDVVIIGA GPAGSVAAAM LAGAGFSVEV LERAHFPRFS IGESLLPQAM EWLAEADLLR DVVEAGFQHK NGAMFRHGDR EESFDFRMKS SDGWGTTYQV RRDKFDDLLA KGAVRKGAKV SFGQTVIAMR PDPVAPSLTV RDEEGNEREI TARFVLDASG FGRVLARLLD LESPAGFPDR MSIFTHVEDN IPPQAYDRNK ILITVNPRNS EIWYWMIPLA DGLCSMGVVG KPEHLAPYGS TREEQLASLV AESGLMGELL VNARRVRDVG EISGYAARVS SLTGPGYALL GNAGEFLDPV FSSGVTIALK SASLATHALV RQLKGETPDW DKEFAQPLAR GVETFRAYVS GWYDGSLQKI IFSQPADANQ IKKMITSVLA GYAWDEANPF VREPGKYLKM VEELCG
|
| |