Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_2733 |
Symbol | |
ID | 4286046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 3004234 |
End bp | 3005751 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638142232 |
Product | tryptophan halogenase |
Protein accession | YP_757957 |
Protein GI | 114571277 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACAA CACCGAAGGG CCCGGCGCCC GGATTGCCCC GCATTGCCAT TATCGGCGGA GGCTCGGCAG GATGGATGAC GGCGGCCGCG ATCATCAATG CCACGAAGGG CGCGGCCTCG CTCACCCTGG TTGAGTCAGA ACAGATCGGT GTGGTCGGGG TGGGGGAAGC GACTATCCCG CCGATCAAGC TGTTTAACCA GATGCTCGGG ATCGACGAAA ATGACTTCGT GCGAGCCACC AATGGTTCTT TCAAGCTGGG CATCGAGTTT GTCGACTGGT CCCGCAAGGG GCAGCGCTAT TTCCATCCGT TCGGCACACA TGGCCGGGAT TTCGACTCCG TCCCGCTCTA TCAATACTGG TTGCGCGAGC GTAAGCGGGG CGATGACACC CCGCTCGACG CCTATTCAAT GGCCTGGGAG ATAGCCCGGC AAAACCGCTT CTCACCCCCC GCCAGGGACC CTCGCCTGGT GCAGTCTACC TTCGACTACG CCTATCATTT CGATACCATC TTGTACGGTC AGTTCCTGCG GCGTTATGCC GAAACGCGTG GCGTGGTGCG CCAGGAAGGC CGTGTGGTCG ATACGCGCCG GACCGAGACG GGCGACGTGG AGGCGGTGAT GCTCGAAGGC GGGCGCGCCG TGGAGGCCGA CTTCTTTATT GATTGCACCG GGTTTTTCGG CCTCTTGATC GAGCAGGTCC TGGAGACCGG CTATGAGGAC TGGACCCATT GGCTGCCTTG CGACCGCGCC GTCGCAGTGC CCTGTGAGGG CGTCGGTGAT TTCACGCCCT ATACCCGGTC GACCGCCCGC GAGGCCGGCT GGCAGTGGCG CATCCCGCTT CAGCACCGGA CGGGCAATGG CCATGTCTAT GCCAGCCAGT TCATCAGCGA TGAAGCGGCA ACCGACACCC TGCTCGCCAA TCTGGACGGC GAGCCGCTGG CCGATCCGCG GCTCTTGCGC TTCACCACTG GCCGGCGGCG AAAATTCTGG AACCGCAATG TCGTCGCGCT CGGTCTTTCA GCCGGGTTCA TGGAACCGCT TGAGTCGACC AGCCTCCATC TAATCCAGAC AGGCATCAAT CGCTTGTTGG CTTTGTTTCC TGGCACCGGA GACACACAGA AGGAGGCTGC CGAATTCAAT CGGCTGACCG GAGAGGAATA CGAGCGCATC CGCGACTTCC TGATCCTCCA CTATCATGCC ACGACCCGGG ATGACGCGCC GCTCTGGCGC CACACAGCCA ACATGGCGAT CCCCGACAGC CTCGCCTGGC GCATGGAACA CTATCGCGCC AATGGTCGCC TCGTATCGCC CGGGACCGAA CTGTTTCTCA ATCCGTCCTG GATGGCCGTC TATGCCGGTC AGGAGATCGA ACCGGCCGGC CTGGACCCGT TGGCGGCCGC CAGTCCGGTG GACGGTGCGC AACGACTCGC CGGATTGCGC AGGGTGATGG CGGAGGCCAC GGCGCCGGTT CCCGATCACC GGGACTATAT CGAGCGCTTC TGCACAGCTG CGGTCTAG
|
Protein sequence | MSTTPKGPAP GLPRIAIIGG GSAGWMTAAA IINATKGAAS LTLVESEQIG VVGVGEATIP PIKLFNQMLG IDENDFVRAT NGSFKLGIEF VDWSRKGQRY FHPFGTHGRD FDSVPLYQYW LRERKRGDDT PLDAYSMAWE IARQNRFSPP ARDPRLVQST FDYAYHFDTI LYGQFLRRYA ETRGVVRQEG RVVDTRRTET GDVEAVMLEG GRAVEADFFI DCTGFFGLLI EQVLETGYED WTHWLPCDRA VAVPCEGVGD FTPYTRSTAR EAGWQWRIPL QHRTGNGHVY ASQFISDEAA TDTLLANLDG EPLADPRLLR FTTGRRRKFW NRNVVALGLS AGFMEPLEST SLHLIQTGIN RLLALFPGTG DTQKEAAEFN RLTGEEYERI RDFLILHYHA TTRDDAPLWR HTANMAIPDS LAWRMEHYRA NGRLVSPGTE LFLNPSWMAV YAGQEIEPAG LDPLAAASPV DGAQRLAGLR RVMAEATAPV PDHRDYIERF CTAAV
|
| |