Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2138 |
Symbol | |
ID | 5899593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2311507 |
End bp | 2313012 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641562627 |
Product | tryptophan halogenase |
Protein accession | YP_001683764 |
Protein GI | 167646101 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.903949 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0562186 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGCGAA CCAACCCCAT TCGTTCGATC CTGATCGTCG GCGGCGGCAC GGCCGGTTGG ATGGCCGCGA CCTGGCTGGC CGGACGGTTG GCGCGTCAAG ACATCCAGAT CACCGTCGTG GAGTCGCCCG ACATTCGCAC CATCGGGGTG GGCGAGGCGA CCGTGCCGGC CATTCGCGGC TATTTCCGAG ACATCGGCGT CAGCGAAGCC GAAGTCATGG CCGCCACCCA GGGCACGGTG AAGCTGGGCA TCGAATTTCG CGACTGGAAG CGGGACGGGG AGAGCTTCTT CCACCCGTTT GGTCTCTACG GCATGGCCTC GCGCGGCGTG CCCTTCCACC AGTTCTGGCT CAAGCGCCAA GCCGAGGGCG ACACCGCGCC GTTGGCCGCC TACAGCCTGT GCACCCAGTT GGCCATGGCC AACCAGATGA TGGAGCCGCC GGCTTCACCG CCCAACGATC TGGGCGTGTT CAACTGGGCG GTCCATTTCG ACGCCGGCCT CTATGCGCAG TTCCTGAGGC GCAAGGCCAC GTCCGAGCTG GGTGTCACCC ATGTCGACGG CACGGTCGTC GAAGTGGCCA AGAACGGCGA GAACGGCTTC CTGACCGGCG TGGCCCTGGC GGACGGCCGC GTTCTCGAGG CCGACCTGTT CATTGATTGC TCGGGTTTCC GCAGCCTGCT GCTCGGCCAG GCGCTGGGCG TGGACTATGA GGATTGGACC CATCTGCTGC CCTGCGACCG GGCGGTGGCC CTGCCCTGCG AGCGTGACGG TCCGCTGACG CCCTACACCC GCAGCACCGC GCTGGCGGCC GGCTGGCAGT GGCGCATCCC GCTGCAACAT CGGGTCGGCA ACGGCTATGT CTATTCCAGT CGCCATATTT CGGACGACGA GGCCACGGCC GTTCTAATGT CGCGCTTGGA GGGGCCGGCT CTGGCCGAGC CCAACCTGCT GCGTTTCCAG ACCGGGCGCC GGCGCCGCTT CTGGGAGAAG AACTGCATCG CCCTGGGCTT GGCCGCCGGC TTCATGGAGC CGCTGGAATC CACCAGCATC GTGCTGATCC AAAGCGGGTT GGAGCGCCTG GGCGCGCTGT TTCCCGATCG CGGTTTCGAC CCGGCCCTGG CCGACGAGTA CAACCGCATC ACCACGCTCG AGTACGAGCG CATCCGCGAC TTCCTGCTGC TGCATTACGT CGCCAACCGT CGCGAGGGCG AAGCGATGTG GGACCATGTG CGTCAGCTGG CCTTGCCCGA ACCCCTGGTC CACAAGATGC GAATGTTCGC CAGCCGTGGA ACCATGGTCC GCTATGAGTG GGAGTCGTTC CACGACCCCA GCTGGCTGTC GATGTACGCC GGCTTCGACA TCGCGCCGCG CGCCCACGAT CCGATGGCGG ACTATTTCAC CAAGCCCGAG CTCGACAGCG CCTTGCGCCG GATGCGCGAA GCGATCGCTC GCGCTCAGGC CTTGGCCGTT CCTCACGAGG CCTTCCTGGC GGCGCAACGC CCTTGA
|
Protein sequence | MARTNPIRSI LIVGGGTAGW MAATWLAGRL ARQDIQITVV ESPDIRTIGV GEATVPAIRG YFRDIGVSEA EVMAATQGTV KLGIEFRDWK RDGESFFHPF GLYGMASRGV PFHQFWLKRQ AEGDTAPLAA YSLCTQLAMA NQMMEPPASP PNDLGVFNWA VHFDAGLYAQ FLRRKATSEL GVTHVDGTVV EVAKNGENGF LTGVALADGR VLEADLFIDC SGFRSLLLGQ ALGVDYEDWT HLLPCDRAVA LPCERDGPLT PYTRSTALAA GWQWRIPLQH RVGNGYVYSS RHISDDEATA VLMSRLEGPA LAEPNLLRFQ TGRRRRFWEK NCIALGLAAG FMEPLESTSI VLIQSGLERL GALFPDRGFD PALADEYNRI TTLEYERIRD FLLLHYVANR REGEAMWDHV RQLALPEPLV HKMRMFASRG TMVRYEWESF HDPSWLSMYA GFDIAPRAHD PMADYFTKPE LDSALRRMRE AIARAQALAV PHEAFLAAQR P
|
| |