Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3158 |
Symbol | |
ID | 5706107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 3641729 |
End bp | 3643483 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641272590 |
Product | tryptophan halogenase |
Protein accession | YP_001537957 |
Protein GI | 159038704 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000257719 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGGTAGAG AAATAGGCTC TGAGTACGAC GTCGTCGTCA TGGGTGGTGG TCCGGCCGGC TCGACACTCG GAGCGCTGCT GGCCCGGCGG ACGACGTTAA GCGTGGCGAT ATTCGAGAAG GAAGAAATGC CGCGCGAGCA CATCGGCGAG TCGTTTGCCC ATCAGATGAT CCCGGCGATC GAGCAGAGCG GTGCGCTGGA GAAGGTTCTG GCCAGCAAGT GTTGGGTGAA GAAGTACGGC GGCGTGTTCA ATTGGGGCGA GGTTCCGATG GTCGCCTTCT TTGACCACCG CAACTACCTG AACGACGGCG TGCCCCGCTT TGCCATGCAC GTCAACCGCG CCGAGTTCGA CCATATCCTG CTCCAACACG CGGCCGACAG CGGCGTCAAG GTGTTCGAGG ACACGCCGGT CAAGCAGTTC GAGTCACACG CCGACGGCTG CACGATCACC CTGGGCGACG GCACGGTCGT TCGGTCGAAG TACTTCGTAG ACGCCTCGGG CAGGCGCAAC AGCATCGCCG CCAAGCAGAG ACGCGAGTGG CTGTCGACCT ACCGCAACAT CGCCATCTGG CAGCACTTTC TCGGCGGCGA GCGTGTGCAG GACCTGCCCG GCGATTGGAA CATCTTTCGG GAGGGGAACC AGTCGCCGAT CGGCTGTTTC GCCTTCCAGG ACGGCTGGTG CTGGTACATT CCGGTCCCGA AGATTATCGA TGGAAAACGC AGGTTGACCT ATTCGGTCGG CATCGTGACC ATTCCGGAGA TCCTCAAGCA GACGGGGTCC GACTTCACCG ACCAGAAGAC GTTTATCGAC ACGATACGTC GGGTGCCGTA CCTGAAGGAC CTCATCGCCG ACGCAGAGCC GATCGCCGAC AAGATGCTCA CGGCCACAAA CTACTCGATG GTCAACGGCC GGTTTTCCGA CTACGACGAG CGTTGGCTGC TCGTGGGTGA CTCGGCCTAC TTCGTCGACC CACTGTTCTC CTCTGGCGTC GCGTTCGCCA CGAACCAGGC GGTGAACGCC GCGATGCTGC TGGAGCACAC GCTCACCGGC GAACTTAACG AGCAGGGCTG CCGCGACCTG TGGCGCGACT ACGACGAGGG ATGGCACGGC ATGGCCGAAA CCTTCGCGCT CTCGATCGAC CAGTGGTACC ACGCGCTGGG CGGGCAGGAC CCGGAGAGCA TCTACTGGCG GCACCGCAGC AGCAGCCCGG ACCTGGATAT CCAGGAGCGG ACCTTTGACG TGCTACTCAA CACCTCGGTC ACGCCCAACC TCATGCAGCT GATCACGGGG GCGCCGATGC AGGGCGAGGG CCCGCTCACC CGGGCGAACG AGCGGGCCGA ACCAGCGGCG ATCGATGTCG ATGCGACGCT GACCCTGGCA CCTGGCGTGG TCGTCCGCGA AACGGTCGGG CTCGACGTGC CGGGATTCAA GGGGCACCTG CCCCCGCCGC CATTCGACGA CGAGGTGAGC GAGGCGACCA AGGCGGGGAT CGCGACCTAC TGGGCCGACC CAGTCACCAA CCGCGACGTC ATCGAGTCGC CAAGCGCCCT GCCCGTCCCG GCGCACCGCA TCGGCTTCGC CGACGGCGCC ATCGACGTCG AGATCCGGGG CCTGGCTCGG GAAGGTACAG CCGAGTTGCT CGGCTATCTC GCGGCAGGCG TGACGATGCG CAAGCTCGAT GGCCAGCTCA CCATGTCGCA GCATCAGCTC CTGAAGCGAC TCGTCCGCGC CGGGTTGGTC GTGGCCGCCG GTTGA
|
Protein sequence | MGREIGSEYD VVVMGGGPAG STLGALLARR TTLSVAIFEK EEMPREHIGE SFAHQMIPAI EQSGALEKVL ASKCWVKKYG GVFNWGEVPM VAFFDHRNYL NDGVPRFAMH VNRAEFDHIL LQHAADSGVK VFEDTPVKQF ESHADGCTIT LGDGTVVRSK YFVDASGRRN SIAAKQRREW LSTYRNIAIW QHFLGGERVQ DLPGDWNIFR EGNQSPIGCF AFQDGWCWYI PVPKIIDGKR RLTYSVGIVT IPEILKQTGS DFTDQKTFID TIRRVPYLKD LIADAEPIAD KMLTATNYSM VNGRFSDYDE RWLLVGDSAY FVDPLFSSGV AFATNQAVNA AMLLEHTLTG ELNEQGCRDL WRDYDEGWHG MAETFALSID QWYHALGGQD PESIYWRHRS SSPDLDIQER TFDVLLNTSV TPNLMQLITG APMQGEGPLT RANERAEPAA IDVDATLTLA PGVVVRETVG LDVPGFKGHL PPPPFDDEVS EATKAGIATY WADPVTNRDV IESPSALPVP AHRIGFADGA IDVEIRGLAR EGTAELLGYL AAGVTMRKLD GQLTMSQHQL LKRLVRAGLV VAAG
|
| |