Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bpro_3311 |
Symbol | |
ID | 4014003 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polaromonas sp. JS666 |
Kingdom | Bacteria |
Replicon accession | NC_007948 |
Strand | - |
Start bp | 3506937 |
End bp | 3508286 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637942976 |
Product | tryptophan halogenase |
Protein accession | YP_550120 |
Protein GI | 91789168 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAGC AGAGTTCTCC AACCCCCTCA TTCCCGGAAA CCCTGAGCGA ATGCGACGTA CTGGTGATCG GTGGTGGTCC GGCGGGGGCC ACAGCGGGAG CGTTGCTGGC GCAGCGGGGG CACAAGGTCG TGGTCCTCGA AAAAGAACAT CACCCGCGCT TTCACATTGG CGAGTCGCTG CTGCCGGCCA ACCTGCCGCT GTTTGAAAAA CTCGGTGTGG CCGACGCCGT CAAGGCCATC GGCATGGAGA AATGGGGCGC CGAATTTGTC TCGCCCTGGC ATGAGGCCAA GAGCCAGACC TTCAAGTTCG GCGACGCCTG GGACAAGTCC ATGCCGTTTT CCTACCAGGT GCGGCGCTCC GAGTTCGACG AAATCCTGAT CCGCAATGCC GCCCGCCTGG GTGCCGAAGT CATTGAAGGT TGCCGCGTCA AGGACGTCGC GTTTGCGGCC GACCATGGCA GCGCCACGGT GCATGCGCAG CACGAGGATG GCCGCACGCA GCACTGGCGT GCCCGCTTTG TCGTGGACGC CTCGGGGCGG GACACCTTGC TGGGCAAGCA GTTCGACGTC AAACGCCGCA ATCCCAAACA CAACAGCTCC GCGCTCTATG GGCACTTCAC CGGTGCAATC CGGCATCCGG GCCAGGACGA AGGCAACATC ACCATCTTCT GGTTCGAGCA TGGCTGGTTC TGGCTGATCC CGTTGGCGGA CGGCTTCACC AGCATTGGTG CGGTGGTCTG GCCGCATTAT TTGAAGAGCC GGACCAAGCC CGTCAGGGAT TTTTTCCTCG ACACGATTGC GATGTGCCCG GCCCTGAGCG AGCGGCTGGC CCATGCGACG CTGGCCTCCG AAGTGGAGGC GACCGGCAAT TTCTCCTACG CCTGCGATCG CACGCACGGC CCCAATTACG TGATGATTGG CGATGCCTTC ACCTTCATTG ACCCGGTGTT TTCGTCCGGC GTGATGCTGG CGATGCAGGG CGGCTTTGTC GGGGCCGAGA CGGTCGATAC CTGCCTGCGC GAACCCGCCA AAGCCGCGTC GGCACTGGCG CATTTTGACC AGCAGGTGCG GCTGGGCCCC AAAGAGTTTT CGTGGTTCAT CTACCGCGTG ACCAACCCGA CCATGCGCGA CATGTTTATG GCGCCCAGCA ATGTGTGGCG CGTCAAGGAA GCCCTGCTGT CGATGCTGGC CGGCGATATT TTTGGCAAAA CGCCGATCTG GGGTTCGCTG GCGGTGCTCA AGGGCATTTT TTATATTGCC TCGGCCTTGA ACTTCAAGCG CTCGTGGCAG GCTATGCGCC TGCGCAAGGC GAACATCCGT GTTGTGGACG GCCCGGGGCT CGCCCGTTGA
|
Protein sequence | MTEQSSPTPS FPETLSECDV LVIGGGPAGA TAGALLAQRG HKVVVLEKEH HPRFHIGESL LPANLPLFEK LGVADAVKAI GMEKWGAEFV SPWHEAKSQT FKFGDAWDKS MPFSYQVRRS EFDEILIRNA ARLGAEVIEG CRVKDVAFAA DHGSATVHAQ HEDGRTQHWR ARFVVDASGR DTLLGKQFDV KRRNPKHNSS ALYGHFTGAI RHPGQDEGNI TIFWFEHGWF WLIPLADGFT SIGAVVWPHY LKSRTKPVRD FFLDTIAMCP ALSERLAHAT LASEVEATGN FSYACDRTHG PNYVMIGDAF TFIDPVFSSG VMLAMQGGFV GAETVDTCLR EPAKAASALA HFDQQVRLGP KEFSWFIYRV TNPTMRDMFM APSNVWRVKE ALLSMLAGDI FGKTPIWGSL AVLKGIFYIA SALNFKRSWQ AMRLRKANIR VVDGPGLAR
|
| |