Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1853 |
Symbol | |
ID | 5899308 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1975622 |
End bp | 1977115 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641562343 |
Product | tryptophan halogenase |
Protein accession | YP_001683480 |
Protein GI | 167645817 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.930503 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCAAGAAC AGGTCGTGAA AAGGGTCGTG ATCGCCGGGG GCGGAACGGC CGGCTGGATG GCGGCGGCGG CGCTGGTCAA GCAACTGGGG CCGCTGCTCG ACATCAGCCT AGTCGAGTCC GACGAGATCG GCACGGTCGG GGTGGGGGAG TCCACCATCC CGACGGCCCG CACCTTCAAC GCGCTGCTAG GGATCGACGA GCCGGCGTTC ATGCGCGCCA CCCAGGCGAC ATTCAAGCTG GGCATCGCGT TCGAGAACTG GGGGCGGATC GGCGATCGCT ACATCCACTC GTTCGGCCAA GTGGGCAAAT CCACCTGGAT GGGCGGCTTC CACCATTTCT GGCTACAGGC CAAGGCGGCG GGCTTTGGCG GCGATCTGGG GGACTATTGC CTGGAGTTGA AGGCCGCCGA GGCCGACCGG TTCTCCACCG GTGACGGTCC AGAGCTGAAC TACGCCTATC ATCTGGACGC GACGCTCTAC GGCGGCTTCC TGCGCCGCAT GGCCGAGGCT TTGGGCGTCA AGCGGATCGA GGGCAAGATC AGCCAGGTCG AGCAGCAGGC CGAGACCGGC TTCATCCAGG CCTTGGTCAT GGAAAATGGC GACCGGGTCG AGGGCGACCT GTTCATTGAT TGCACAGGCT TCCGAGGGCT GCTGATCGAG CAGACGTTGA AGGCGGGTTG GGAGGACTGG GGCGACTGGC TGCCGACCAA CAGCGCGCTG GCGGTGCAGA CCAGGGCCAC GGGTCCGGCC GTGCCCTATA CCCGCGCCAT CGCCCACGAG GCGGGCTGGC GCTGGAAGAT CCCGCTGCAG AATCGGGTCG GCAACGGTCT GGTCTATTGC AGCGAGTACA TGTCGGACGA CAAGGCCCGC GAGACCCTGC TGGAGTCGCT GGACGGCGAG CGGCTGATCG AGCCTCGGCT GATCCGCTAC CGCACGGGCC GCCGCCTGAA GACCTGGCAC AAGAACTGCG TCGCCCTGGG CCTGGCCAGC GGCTTCGTCG AACCGCTGGA GTCGACCTCG ATCCACCTGA TCATGATCGG GGTGACGCGG CTGATGCAGC TGTTTCCGTT CCACGGCGTC AGCGACGCCG TCGTCGCGCG CTACAACCAG CAGGCCGTCG ACGAGCTGGA GAAGATCCGC GACTTCATCA TCCTGCACTA TAAACTGACC GAGCGGACCG ACAGTCCGTT TTGGGATCGT TGCCGGACGA TGGACATCCC GGACTCCCTG GCCCAGCGCA TCGACCTGTT CCGCGAGAGC GCCCAGGCCT ACCAGTCGCC AGGCGAGCTG TTCCAGGTCG ACTCGTGGCT GCAGGTCATG CTCGGCCAGA GGCTGGAGCC GCGCGAGCAC CATCTCATGG GCCGCCTGAT GCCGGCTGAT CAGCTGAACC GGGCGCTGAG CGACTTGAGG GGCAACATCG CGCGCGCCGT GACCCAACTG CCGAGCCATC AGGCGTTCCT CGACCGTTAC TGTCCCGCGT CAGCGGCGAT GTGA
|
Protein sequence | MQEQVVKRVV IAGGGTAGWM AAAALVKQLG PLLDISLVES DEIGTVGVGE STIPTARTFN ALLGIDEPAF MRATQATFKL GIAFENWGRI GDRYIHSFGQ VGKSTWMGGF HHFWLQAKAA GFGGDLGDYC LELKAAEADR FSTGDGPELN YAYHLDATLY GGFLRRMAEA LGVKRIEGKI SQVEQQAETG FIQALVMENG DRVEGDLFID CTGFRGLLIE QTLKAGWEDW GDWLPTNSAL AVQTRATGPA VPYTRAIAHE AGWRWKIPLQ NRVGNGLVYC SEYMSDDKAR ETLLESLDGE RLIEPRLIRY RTGRRLKTWH KNCVALGLAS GFVEPLESTS IHLIMIGVTR LMQLFPFHGV SDAVVARYNQ QAVDELEKIR DFIILHYKLT ERTDSPFWDR CRTMDIPDSL AQRIDLFRES AQAYQSPGEL FQVDSWLQVM LGQRLEPREH HLMGRLMPAD QLNRALSDLR GNIARAVTQL PSHQAFLDRY CPASAAM
|
| |