Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_2722 |
Symbol | |
ID | 4286087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 2986370 |
End bp | 2987650 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638142221 |
Product | tryptophan halogenase |
Protein accession | YP_757946 |
Protein GI | 114571266 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0792075 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.00766625 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCTGAAC CAATCCGGTC GATATGCATT CTGGGAAACG GTCTGGAAGC CTGGCTTACC GCGCATGTGC TGTACAAGGC ACTCGGGGGA GAGGCGGTCT CAATCCATAT CCAACCGCTC GACTCAAGGA TCCATGACGA CTGCGCCTAT ACGATCTTGC GCCCATCGGC GCTGGTCGCT CTGGACCAGT TCGGTCTGAA CCTGCAGGGC CTTCTGCGTC TGCCGGCGAC CCTGCCCAGC CTGGGTCAGG TCCTTCGTTC TGCAAGCGGC GTCGACACGC TCCTGCCCTA TGGCGGGCAA GGTGTCGATT GGGCCGGGAC CAGTTTTCAC CATCATTGGC TTCGCGCACG CCAGGCCGGT TTGCGCCATC CCTATTTCGC CTTTTCGCCC GGCTATCACG CCATGGCGTC AGACCGCTTT GCCCCGCCCG ACAGACGCAA CGCGATAGGG CCGATGCAGC ATGAAAGCGG CTTGCACGTC AGTACTCGGG AGTTGACGGA AAACCTGCGG CTCAATTTGC AGGCCAACAT TATCGTCCTC GATCCGACAG CCAATTGCGA ACAGGCTGAT CTTGTCATCC ATGCGCCCGG GGCTCCGGAC CGGAATTCTA CGCTGGACCC GGTTCACGCT GGCCCGCCCA AACCCTACGC CGTGCGAGTC GAGGCTGGCG GCGAATCGCG CCTGCAAATC CCGCTGCGTT CCGGCTGGTT GGACCTGCCG ACGCCTGCGG GACCGGCCAA ACGTCATTGT GGCAACTCAC CCTGGGGCGA GGATGGCCTG GTCGTCGGGC TCGCGGCCGC CCACCTGCCC GGCCTTGAAG ACCGGTCCAT GGACCGGCTT CTCTTCGAGC TCGAAACCTT GCTGGAACTC TGGCCACGGT CCGGTGTCCA TTCCGCGGAG GCCCTCGAAT ACAACCGCCT TTGGGCGCAG GAGGCCGATG AATGGACAGC ACTGAGCGCG CTGTCGGCCG ACCAGGAAGA CCAACGTTGT CAGGCGCGGA AGGCTGTGTT CCGCCGACGC GGTTATATCG AGCCGCTGGA AAGCCGCACG ATCACTCCGG AGGACTGGGT GGAGGCCTTT ATTGGGCGCG GTGTGATCCC TGCCCATTAC GACCGGTTGA GTGAGCGTTT GACCGACCCG CAGCTCAAGT CCGAGCTGCA AAAATTCACC GATGCGGTCG GTCGGACCGT CCGCGAGTTT CCCAGTTTCC CGAGCTATCT ACGGGCGATC GACCGAGCGG TCGGGCCGGC CACCGATGCC AAGACGGAGC CATCCGCATG A
|
Protein sequence | MAEPIRSICI LGNGLEAWLT AHVLYKALGG EAVSIHIQPL DSRIHDDCAY TILRPSALVA LDQFGLNLQG LLRLPATLPS LGQVLRSASG VDTLLPYGGQ GVDWAGTSFH HHWLRARQAG LRHPYFAFSP GYHAMASDRF APPDRRNAIG PMQHESGLHV STRELTENLR LNLQANIIVL DPTANCEQAD LVIHAPGAPD RNSTLDPVHA GPPKPYAVRV EAGGESRLQI PLRSGWLDLP TPAGPAKRHC GNSPWGEDGL VVGLAAAHLP GLEDRSMDRL LFELETLLEL WPRSGVHSAE ALEYNRLWAQ EADEWTALSA LSADQEDQRC QARKAVFRRR GYIEPLESRT ITPEDWVEAF IGRGVIPAHY DRLSERLTDP QLKSELQKFT DAVGRTVREF PSFPSYLRAI DRAVGPATDA KTEPSA
|
| |