Gene Sare_3158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3158 
Symbol 
ID5706107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3641729 
End bp3643483 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content65% 
IMG OID641272590 
Producttryptophan halogenase 
Protein accessionYP_001537957 
Protein GI159038704 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000257719 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGGTAGAG AAATAGGCTC TGAGTACGAC GTCGTCGTCA TGGGTGGTGG TCCGGCCGGC 
TCGACACTCG GAGCGCTGCT GGCCCGGCGG ACGACGTTAA GCGTGGCGAT ATTCGAGAAG
GAAGAAATGC CGCGCGAGCA CATCGGCGAG TCGTTTGCCC ATCAGATGAT CCCGGCGATC
GAGCAGAGCG GTGCGCTGGA GAAGGTTCTG GCCAGCAAGT GTTGGGTGAA GAAGTACGGC
GGCGTGTTCA ATTGGGGCGA GGTTCCGATG GTCGCCTTCT TTGACCACCG CAACTACCTG
AACGACGGCG TGCCCCGCTT TGCCATGCAC GTCAACCGCG CCGAGTTCGA CCATATCCTG
CTCCAACACG CGGCCGACAG CGGCGTCAAG GTGTTCGAGG ACACGCCGGT CAAGCAGTTC
GAGTCACACG CCGACGGCTG CACGATCACC CTGGGCGACG GCACGGTCGT TCGGTCGAAG
TACTTCGTAG ACGCCTCGGG CAGGCGCAAC AGCATCGCCG CCAAGCAGAG ACGCGAGTGG
CTGTCGACCT ACCGCAACAT CGCCATCTGG CAGCACTTTC TCGGCGGCGA GCGTGTGCAG
GACCTGCCCG GCGATTGGAA CATCTTTCGG GAGGGGAACC AGTCGCCGAT CGGCTGTTTC
GCCTTCCAGG ACGGCTGGTG CTGGTACATT CCGGTCCCGA AGATTATCGA TGGAAAACGC
AGGTTGACCT ATTCGGTCGG CATCGTGACC ATTCCGGAGA TCCTCAAGCA GACGGGGTCC
GACTTCACCG ACCAGAAGAC GTTTATCGAC ACGATACGTC GGGTGCCGTA CCTGAAGGAC
CTCATCGCCG ACGCAGAGCC GATCGCCGAC AAGATGCTCA CGGCCACAAA CTACTCGATG
GTCAACGGCC GGTTTTCCGA CTACGACGAG CGTTGGCTGC TCGTGGGTGA CTCGGCCTAC
TTCGTCGACC CACTGTTCTC CTCTGGCGTC GCGTTCGCCA CGAACCAGGC GGTGAACGCC
GCGATGCTGC TGGAGCACAC GCTCACCGGC GAACTTAACG AGCAGGGCTG CCGCGACCTG
TGGCGCGACT ACGACGAGGG ATGGCACGGC ATGGCCGAAA CCTTCGCGCT CTCGATCGAC
CAGTGGTACC ACGCGCTGGG CGGGCAGGAC CCGGAGAGCA TCTACTGGCG GCACCGCAGC
AGCAGCCCGG ACCTGGATAT CCAGGAGCGG ACCTTTGACG TGCTACTCAA CACCTCGGTC
ACGCCCAACC TCATGCAGCT GATCACGGGG GCGCCGATGC AGGGCGAGGG CCCGCTCACC
CGGGCGAACG AGCGGGCCGA ACCAGCGGCG ATCGATGTCG ATGCGACGCT GACCCTGGCA
CCTGGCGTGG TCGTCCGCGA AACGGTCGGG CTCGACGTGC CGGGATTCAA GGGGCACCTG
CCCCCGCCGC CATTCGACGA CGAGGTGAGC GAGGCGACCA AGGCGGGGAT CGCGACCTAC
TGGGCCGACC CAGTCACCAA CCGCGACGTC ATCGAGTCGC CAAGCGCCCT GCCCGTCCCG
GCGCACCGCA TCGGCTTCGC CGACGGCGCC ATCGACGTCG AGATCCGGGG CCTGGCTCGG
GAAGGTACAG CCGAGTTGCT CGGCTATCTC GCGGCAGGCG TGACGATGCG CAAGCTCGAT
GGCCAGCTCA CCATGTCGCA GCATCAGCTC CTGAAGCGAC TCGTCCGCGC CGGGTTGGTC
GTGGCCGCCG GTTGA
 
Protein sequence
MGREIGSEYD VVVMGGGPAG STLGALLARR TTLSVAIFEK EEMPREHIGE SFAHQMIPAI 
EQSGALEKVL ASKCWVKKYG GVFNWGEVPM VAFFDHRNYL NDGVPRFAMH VNRAEFDHIL
LQHAADSGVK VFEDTPVKQF ESHADGCTIT LGDGTVVRSK YFVDASGRRN SIAAKQRREW
LSTYRNIAIW QHFLGGERVQ DLPGDWNIFR EGNQSPIGCF AFQDGWCWYI PVPKIIDGKR
RLTYSVGIVT IPEILKQTGS DFTDQKTFID TIRRVPYLKD LIADAEPIAD KMLTATNYSM
VNGRFSDYDE RWLLVGDSAY FVDPLFSSGV AFATNQAVNA AMLLEHTLTG ELNEQGCRDL
WRDYDEGWHG MAETFALSID QWYHALGGQD PESIYWRHRS SSPDLDIQER TFDVLLNTSV
TPNLMQLITG APMQGEGPLT RANERAEPAA IDVDATLTLA PGVVVRETVG LDVPGFKGHL
PPPPFDDEVS EATKAGIATY WADPVTNRDV IESPSALPVP AHRIGFADGA IDVEIRGLAR
EGTAELLGYL AAGVTMRKLD GQLTMSQHQL LKRLVRAGLV VAAG