Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1841 |
Symbol | |
ID | 5899296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1956930 |
End bp | 1958375 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641562331 |
Product | tryptophan halogenase |
Protein accession | YP_001683468 |
Protein GI | 167645805 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.132292 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00068545 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGGCA GGTCCTTGAA AAGCGTCGCC ATCGTGGGCG GCGGCGTCGC CGGCTGGATG ACCGCCTGCG CCCTGGCCCG CGTCCTGCCC GCCGACTGCG CGATCCGGGT CGTCGAGACG GCCGCCGCCG CCCCGCGCGG CGCGCTCTCC ACCCAGCCGA CCCTGCGCGC CTTCCACGGC CTGCTCGGCC TGGATGAACC CGCCCTGATG CGCGCCGCGC GCGGGACCTT CAAGCTGGGC TCGCGGTTCA GCGGCTGGAC GGTCGCCGAC CATGTCGAGG GCTTCAGCGA CACCGGCGCC AATCTTGATG GCGTGGCCTT CCATCACCAC TGGCTGCGGG CCCGCGAACG CGGCGAGGCC GGACGGTACG AGGACTACAA CCTGGCCGCC GTGGCGGGAC GACTCGGCCG GTTCGCCCCG CCCAGCGAGG ATCCTCGATC CGTGCTGTCG ACCCTCTCCT ACGGCCTGCA CCTCGACGCG GCAGGCTATG TCGCGGCATT GCGGGCCGCG GCGGGGCGCG TGGACCGCAT GGCCGGCGAG ATCGTCGAGG TGACCCCGAA CGCGGACGGC GGCCTCGACA CCGTGCGGTT GGCCAGCGGC GAGCGCGTCG CGGCCGACCT GTTCATCGAC ACCACGCCGG ACGGCCGGTT GATCGGATCG AACGCCGTCG GCGCCTGGAT CGACTGGTCG TCCTGGCTGC CTTGCGACCG ACTGGCCCTG CGCGAGGTCG CCGCGCGCCT CGAGCCGCCG CCCCTGACCG AGGTCGAGGC GATACCCGAG GGGTGGCTGC GGCGCATCCC GTTGCGCGGC GGCGACGCGG TGGCCCTGGC CTACAATTCG CGCCTGACGT CCGACGACGC GGCGCGGGAG ATCCTGGGCG GCGAGGCGAC GATCGCCCCG CTTAGCAATG GTCGGCGGGC CCAGGCCTGG GTCGGTAATT GTCTGGCGAT CGGTCCAGCG GCCGGCCAGC TGGAGCCCTT AAACGGCGAC GACGCCCATT TGGTCCAAAG CGGCGTCAGC CGGCTGATCG CCCTGCTGCC GACCGCCGAC GGCTCGCCCC TAGCCGCGAC CGAGTACAAC CGGTTGATGG CCGAGGAACT GGATCGGACC CGCGACACGG CTGCCTTCCG CTACGCCGTG GCCGCCCGGA CCGATCCGGT CTGGACCCTG GCCCGCCAAG CCCCGCCCTC CCCCGCCCTG GCCTACAAGC TTAGCCAGTT CGAGAGCCGC GGCCGGGTGG TGATGTACGA CGAGGAAACC TTCGTGGAAG GCGCGTGGCT CGCGGCGTTT CTCGGCCACG GGATCCTGCC CCGGCGTCAC GACCGCCTTG CCGACCGGCT GCCCGCCGAC CAGGCGGACG CCGCCTTGGC GCGATTGCGG GGCCTGATCC GCCAGGCCGC GCTCGCCATG CCGACCCAAG CGCAAGCCCT GAAGGATCCC GCATGA
|
Protein sequence | MTGRSLKSVA IVGGGVAGWM TACALARVLP ADCAIRVVET AAAAPRGALS TQPTLRAFHG LLGLDEPALM RAARGTFKLG SRFSGWTVAD HVEGFSDTGA NLDGVAFHHH WLRARERGEA GRYEDYNLAA VAGRLGRFAP PSEDPRSVLS TLSYGLHLDA AGYVAALRAA AGRVDRMAGE IVEVTPNADG GLDTVRLASG ERVAADLFID TTPDGRLIGS NAVGAWIDWS SWLPCDRLAL REVAARLEPP PLTEVEAIPE GWLRRIPLRG GDAVALAYNS RLTSDDAARE ILGGEATIAP LSNGRRAQAW VGNCLAIGPA AGQLEPLNGD DAHLVQSGVS RLIALLPTAD GSPLAATEYN RLMAEELDRT RDTAAFRYAV AARTDPVWTL ARQAPPSPAL AYKLSQFESR GRVVMYDEET FVEGAWLAAF LGHGILPRRH DRLADRLPAD QADAALARLR GLIRQAALAM PTQAQALKDP A
|
| |