Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_2721 |
Symbol | |
ID | 4286086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 2984854 |
End bp | 2986377 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638142220 |
Product | tryptophan halogenase |
Protein accession | YP_757945 |
Protein GI | 114571265 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0000284489 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.0465531 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGGTCAAG ATGCCCCAAG GCGGATCGTC GTTGTCGGCG GAGGCACCGC AGGATGGATG GCTGCGGCCG CGCTGGTGTC GGTTTTGCCC AGCCAGCGCG TCCAGGTCAC CCTGGTCGAA TCCGAAGCAA TCGGCATCAT CGGTGTGGGC GAGGCGACGC TTCCCCATCT GCGCCATTTC AATGAAACCC TTGGCATCAA CGAGGCCGAT TTCATCAAGG CGACGTCCGC CACGCTGAAG CTCGGCATCG AGTTCGTGAA CTGGGCCCGA AAGGGCGACA GCTATGTGCA CCCATTCGGC GATTTCGGGA CCGAGATTGC CGGCCTGCCC TTTCATCAAG CCTGGACCCG GATGCGGGCC GCCGGCAAGG CCCGGGATAT CGGTGCCTAC TCGCTTCCCG TCCGCATGTG CGCGGCAAAC CGGTTCGACA GACCCGCAGA AGACCCGGCC GATTTTGCAT CCCGCTTCGG CTATGCCTAC CAGTTCGACG CCACCCGTTA TGCGCCCTTC CTGCGCCAGC ATGCCGAAGC CCGCGGCGCG ACCCGAATTG AGGGCATTGT CGACACGGTT CATTGCGATC CTGAAACCGG CGATATCGAG CGGCTCGACC TGAAGGACGG GCAAGAGATC GAAGGCGATT TCTTCTTTGA CTGCACCGGA TTCCGAGGCG TTCTGATCGA GCAGGCGTTG AATGTGGGTT ATGAGGACTG GTCACATTGG CTGCCGTGCA ACCGGGCTAT CGCCCTGCCT AGCGAAAAAT CCGGACCAAC CCCGCCCTAC ACGCGTGCAA CCGCACATCA GGCGGGCTGG CTATGGCGGA TTCCGCTGCA GCACCGCACC GGGAACGGGC ATGTCTATGC CAGCGATTTC ATCGATGATG AGACGGCCCG TCAGACGCTG CTCGACAATC TGGAAGGCGC GCCCCTGGCT GATCCTCGAC CGCTGCGTTT CACAACCGGC AGGCGTAAAC AATTCTGGGC TCATAATTGC GTCAGCATCG GGCTGGCTGG CGGTTTCCTT GAACCGCTTG AATCGACGAG CATCCATCTA ACGCAGATCG CAATCACGCA ATTCATTGAA CTGTTTCCGG TAGATAACGA TTACACGCTT GAGCGAGAAA GCTACAACGC GCACATGACG CGGGAATTCG AGCGCGTGCG CGACTTCCTC ATCCTCCACT ATCATGCGAC CGAACGGACT GATTCCGAGT TCTGGAACTA CGTCCGCACC ATGCCGGTAC CGGATTCGCT GACGGAAAAA ATGGCCCTGT TTCGGCAAAC CGGGCGTGTC GGTCGATATC AGCAAGGGCT ATTTTTGGAG CCCAGCTGGC TCGCCGTCTA TCTCGGTCAG CGAATCGTCC CGCAGAGCTG GGATGGGCGA TTGGACACGA TCCCGGAAGA ATCCCTCGGT CAGTCGCTGA CAACAATCGA GTCGCAGATC GATCAGGCGA TGTCCCGTAT GCCCGACCAT GACACCTGGC TGGCCAATCT GGCCGGCGTC GAGGCGGAGA CGCGTCATGG CTGA
|
Protein sequence | MGQDAPRRIV VVGGGTAGWM AAAALVSVLP SQRVQVTLVE SEAIGIIGVG EATLPHLRHF NETLGINEAD FIKATSATLK LGIEFVNWAR KGDSYVHPFG DFGTEIAGLP FHQAWTRMRA AGKARDIGAY SLPVRMCAAN RFDRPAEDPA DFASRFGYAY QFDATRYAPF LRQHAEARGA TRIEGIVDTV HCDPETGDIE RLDLKDGQEI EGDFFFDCTG FRGVLIEQAL NVGYEDWSHW LPCNRAIALP SEKSGPTPPY TRATAHQAGW LWRIPLQHRT GNGHVYASDF IDDETARQTL LDNLEGAPLA DPRPLRFTTG RRKQFWAHNC VSIGLAGGFL EPLESTSIHL TQIAITQFIE LFPVDNDYTL ERESYNAHMT REFERVRDFL ILHYHATERT DSEFWNYVRT MPVPDSLTEK MALFRQTGRV GRYQQGLFLE PSWLAVYLGQ RIVPQSWDGR LDTIPEESLG QSLTTIESQI DQAMSRMPDH DTWLANLAGV EAETRHG
|
| |