Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3561 |
Symbol | |
ID | 5901016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3844978 |
End bp | 3846546 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641564069 |
Product | tryptophan halogenase |
Protein accession | YP_001685186 |
Protein GI | 167647523 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCGCA GTCGAAGAAT CCTCATCGTC GGGGGCGGCA CGGCCGGGTG GCTGACGGCC GCCTATCTGG CCAAGTCCCT GCGGATCGCC GAGCAGGCGC ACCTGGAGAT CACGCTGCTG GAGTCGCCCG ACATCGGCGT CATCGGCGTC GGCGAGGCGA CGTTCCCGAC CATCCGCACG ACGCTGCGGT TTCTCGGCGT CGACGAGGCG AGGTTCATCC GCGAGACCTC GGCCACCTTC AAGCAGGGCA TCCGCTTCAA CGACTGGGCG TGGGCGCAGG GCGAGGGCGG CGATGGCCCC CAGCGTCATC AGTACTTCCA TCCGTTCGAG GCGCCGTTCT CGACCGACGG CGCCAGCCTG GCGCCCTACT GGCTGCTGCA GAGCGAGGCG ACGCGCGCGC CGTTCGCCGA GGCCATGACC ATCCAGGCCC GGGTCGCCGA CGCCCAGCGC GCGCCCAAGC GTCCGCACGA GGGCGACTTC TCCGGGCCCC TGAACTACGC CTATCATTTC GACGCGGCCA AGCTGGCCGT GGTGCTGGCC GAGCGCGCCG TCGAGCTTGG CGTGCGCCGT CTGCCGGGCC TGCTGACGGG CGTGGAGCTC GACGCGACCG GCGCCATCGA CCACGTGATC TCGCAGGAGC ATGGCCGCCT GGAGGCCGAT CTCTACATCG ACTGCACGGG ATTCCGGGCC GAGCTGATCG GCCAGGCCCT GAAGGCCCCG TTCAAGTCGG CGCGGCCCAT CCTGTTCGCC GACCGGGCCC TGGCCTGCAA GATCCCCTAC GACCGCCCCG ACGCGCCGAT CCAGAGCTTC ACCGTCGCCA CCGCCCACGA GGCCGGCTGG ACCTGGGACA TCGGCCTGAA TGGCGCGCGC GGCGTCGGCT GCGTCTATGC CAGCGACCAC ATGGATGACG ACCGGGCCGA GGCCATCCTG CGCGGCTATG TCGGGGAAGG CGTCGAGATC GCGCCCCGGT CGCTGTCGTT CGAGGCGGGC TATCGCCAGA AGCAGTGGGT CAAGAACTGC GTGGCCGTCG GCCTGTCAGC CGGGTTCCTG GAGCCGCTGG AATCGACGGG CGTGGTGCTG ATCGAGGCGG CGGTGGCGAT CATCGCCGAG CTGTTCCCGC ACAACGGTCC GATCAGCGCC CCGGCCTTGC GCTTCAACGA GCTGATGACC GCCCGCTACG ACAACATCAT CACCTTCCTG AAGCTGCACT ACTGCCTGAG CCAGCGCACC GAGCCGTTCT GGCGCGCGAA CGCCGACCCG GCCTCGATTC CGGAACGGCT GGCCGACCTG CTGGAGCAGT GGCGCTGGCG CCCGCCCACC CGCTACGACT TCATCCTGGA TCTCGAGACC TTCGCCTTCT TCAACTACCA GTACATCCTG TACGGCATGG GCTTCAAAAC CGACCTGTCG CCAGGGCGCG GCGAGTTTCC CGACGTGGCG GCGGCCGACA AGCTGTTCGC CAAGATCAAG ACCTTCGGCG ACCGCGCCAC CCAGGACCTG CCCAGCCACC GCGACCTGAT CTCGCGGATC AACCGGTTTG GCTTTGATCG GGCGGCGGAG CACGCTTGA
|
Protein sequence | MDRSRRILIV GGGTAGWLTA AYLAKSLRIA EQAHLEITLL ESPDIGVIGV GEATFPTIRT TLRFLGVDEA RFIRETSATF KQGIRFNDWA WAQGEGGDGP QRHQYFHPFE APFSTDGASL APYWLLQSEA TRAPFAEAMT IQARVADAQR APKRPHEGDF SGPLNYAYHF DAAKLAVVLA ERAVELGVRR LPGLLTGVEL DATGAIDHVI SQEHGRLEAD LYIDCTGFRA ELIGQALKAP FKSARPILFA DRALACKIPY DRPDAPIQSF TVATAHEAGW TWDIGLNGAR GVGCVYASDH MDDDRAEAIL RGYVGEGVEI APRSLSFEAG YRQKQWVKNC VAVGLSAGFL EPLESTGVVL IEAAVAIIAE LFPHNGPISA PALRFNELMT ARYDNIITFL KLHYCLSQRT EPFWRANADP ASIPERLADL LEQWRWRPPT RYDFILDLET FAFFNYQYIL YGMGFKTDLS PGRGEFPDVA AADKLFAKIK TFGDRATQDL PSHRDLISRI NRFGFDRAAE HA
|
| |