Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1290 |
Symbol | |
ID | 5898745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1358678 |
End bp | 1360189 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641561775 |
Product | tryptophan halogenase |
Protein accession | YP_001682918 |
Protein GI | 167645255 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0275906 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCAGC CGCCTCCCTT CAAGTTCGTC ATCGTCGGCG GAGGCACGGC CGGATGGATG GCGGCGGCCG CGCTGGCCCA GGCGTTCAAG GGCAAGGTGG CCGAGATCGA GCTGGTCGAG TCCGACGAGA TCGGCACGGT CGGCGTCGGC GAGGCCACCA TCCCGCCGAT CCAGTTTCTC AACAGCCTGC TGGGCATCGA CGAGATCGAG TTCGTCAAGA AGACCCAGGC CACCTTCAAG CTGGGCATCG AGTTCCGCGA TTGGAGGCGG CCGGGCCACA GCTACCTGCA CCCGTTCGGC CCCATCGGCG CGCCGATCGA GGGCGTGGGC TTCCACCACT ACTGGCGGCG GCTTCGCGAG CATGGCGACA CCACCGACAT CTGCGACTAT TCGATGTCGG CGGTGGCGGC CCGGAAGGGC AAGTTCGTCA TCGCGCCGCG TGAACTGCCC CCCGGCGTGC CGCCGCTGGC CTACGCCTAC CACTTCGACG CCGGTCTCTA CGCCCGCTTC CTGCGCGACT ACGCCGAGGC GCGCGGCGTC AAGCGCACCG AGGGCAAGAT CGTCGAGGTC AAGCAGCGCG TGTCCGACGG CTTCATCGAG GCTCTCAAGC TGGAGGGGGG GCGGCGCGTC GAGGGCGACT TCTTCGTGGA CTGCTCGGGC TTCCGCGGGC TGCTGATCGA GCAGACCCTC AAGACCGGCT ATGAGGACTG GAGCCACTGG CTGCCCTGCG ATCGGGCGCT GGCCGTCCCG TGCGACAGCG TCGCGCCGCT CACGCCCTAT ACGCGCTCGA CCGCGCGCAC GGCCGGCTGG CAATGGCGCA TCCCGCTGCA GCACCGCACC GGCAACGGCT ATGTCTATTC CAGCCCGTTC ATCAGCGACG ACGAGGCGGC CGCCACCCTG ATGGCCAATT TGGACGGCGC GCCGCGCGCC GAGCCTCGGC TGCTGCGGTT CACGACCGGC CGCCGCAAGC AAGCCTGGAA CAAGAACGTC GTGGCCCTGG GCCTGGCCAG CGGCTTCATG GAGCCGCTGG AGTCGACCAG CATCCACCTG ATCCAGACGG GGGTGATGCG CTTGCTCTCC CTGCTGCCGA CCCGCCATCA CGACCCAGCG GCGGTCGAGG AATATAATCG CCTGTCGAAG ATCGAATACG AGCAGATCCG CGACTTCATC ATCCTGCACT ACCGCGCCAC CGAGCGGGAC GATGCCGAGC TGTGGCGCTA CTGCCGGAGC ATGGCGCTGC CCGACAGCCT GACCCACAAG ATCGAGCTGT TCCGCGAACG CGGCAAGGTC GCCCGCTACG ACGAGCAGCT ATTCGCCGAG CCCAGCTGGA TCGCCGTCTT CCTGGGGCAG GGCGTGGAGC CTCGCGATCA CGACCGCCTG GCTGACGTGC CGCCGCTCGC GGACGTCCAA CGCACGCTGT TCAACATGCG GGACAAGATG GCCCATATCG CCGACCGGCT GATCACGCAC GACGCGTTCA TCGCGCGGCA TTGCAAGGCC GATCCGATCT AA
|
Protein sequence | MTQPPPFKFV IVGGGTAGWM AAAALAQAFK GKVAEIELVE SDEIGTVGVG EATIPPIQFL NSLLGIDEIE FVKKTQATFK LGIEFRDWRR PGHSYLHPFG PIGAPIEGVG FHHYWRRLRE HGDTTDICDY SMSAVAARKG KFVIAPRELP PGVPPLAYAY HFDAGLYARF LRDYAEARGV KRTEGKIVEV KQRVSDGFIE ALKLEGGRRV EGDFFVDCSG FRGLLIEQTL KTGYEDWSHW LPCDRALAVP CDSVAPLTPY TRSTARTAGW QWRIPLQHRT GNGYVYSSPF ISDDEAAATL MANLDGAPRA EPRLLRFTTG RRKQAWNKNV VALGLASGFM EPLESTSIHL IQTGVMRLLS LLPTRHHDPA AVEEYNRLSK IEYEQIRDFI ILHYRATERD DAELWRYCRS MALPDSLTHK IELFRERGKV ARYDEQLFAE PSWIAVFLGQ GVEPRDHDRL ADVPPLADVQ RTLFNMRDKM AHIADRLITH DAFIARHCKA DPI
|
| |