Gene Sala_0918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0918 
Symbol 
ID4083128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp930297 
End bp931808 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content61% 
IMG OID638009279 
Producttryptophan halogenase 
Protein accessionYP_615969 
Protein GI103486408 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGCA TGGACAGCAG CACCGCCAGG CTGAAAATCG TCATCGCCGG CGGCGGCACC 
GCCGGCTGGA TGGCGGCGGC GGCCTTGTCC GGCACCTTGG GCGCGGCAAT CGATCTGACC
CTGATCGAAT CCGATGCCAT TGGCACGATC GGCGTTGGAG AAAGCACGAT CCCGCCGATC
GTGCTGTTCA ATCGATTGAT GGGGATCAAT GAAGCCGCAT TCATGCGCGC GACCCAGGCG
ACGTTCAAGC TGGGGATCCA GTTTGAAAAC TGGAAGCATG TCGGCGAAAG CTATTTCCAT
TCTTTTGGCA CCACGGGAAA GGATCACTGG TCGGCGGGGT TCCAGCATTT CTGGCTTCAC
GGCCTGACCT GCGGGCACAG CCGGAGTTAT GACGATTACT GCCTGGAGCT GAAGGCTGCG
CACGAGGGCA AGTTCGCGCA TCTGCCCGAC GATCGCATGA ATTATGCCTA TCAACTTGAT
TCGTCGCTTT ACGCCGCCTT TTTGCGCGAG CGTGCGGAAG GCGACGGCAC GAGGCGAATC
GAGGGGCGGA TCGCCGCCGT CGAACTCGAC GGTGCAAGCG GCAATATCGC GGCGCTGATG
CTCGACGGCG AGCGGCGGAT CGAAGGCGAT CTTTTCATCG ATTGTACCGG TTTTCGTGCG
CTGCTGATCG AAGGCGCCCT GCACGCGGGG TTCGACGACT GGACACACTA TCTTCCCTGC
GATTCCGCGA TTGCCGTGCA GACTGCAAGC GTCGCCCCGC CCGTGCCCTA TACGCGGGCG
ATCGCGCACG ATGCCGGATG GCAATGGCGC ATTCCGTTGC AGCACCGCCA GGGAAACGGG
ATCGTCTATT GCAGTCGTTA CCTTGCCAAA GACGATGCGC TCGACCGGTT GCTCGGTTCC
ATCGAGGGTG ATGTGCTGAC CGAGCCGAAC TTCATCGGCT ATCGCACGGG AGCGCGGCGC
AAACAGTGGT ACCGAAACTG CGTGGCAGTC GGTCTGTCGG GCGGGTTCAT GGAGCCGCTT
GAATCGACCT CGATCCACCT CATCCAGCGG GCCGTGCTGC GTCTCATCCG TATGCTGCCT
TCGGGGCCGG TCAGCGAGCG CGACATCGCC GAGTTCAACG ATCAGCAATT TGCCGACATG
GAACAGATCC GCGACTTTCT CATTCTCCAT TACAAGGTGA CCGAGCGGCG TGATTCACCC
TTTTGGCGGC AGTGTGCGGC GATGCCGATC CCGGCGAGCC TTGAACAGAA AATCGAACTG
TTTCGCGAGA CGGGCCGGGT GTTCCGCAGA AATGAGGAGC TTTTTGTCGA AAACAGCTGG
GTGCAGGTGA TGATGGGCCA GGGCATCATG CCACAGCGCT ATCATCCGAT CGCGGCAAAG
CTCCGTCCCG ACGAACTCGA GGCATTTCTG TCGATGCTGC GCGACGGCGT CGAGCGAACG
GTTGCAAGCC TGCCTGCGCA CGGGGCCTAT ATCGCCCGAT ATTGCGCGGT CGGCGGGCGG
AACGACGCAT GA
 
Protein sequence
MTGMDSSTAR LKIVIAGGGT AGWMAAAALS GTLGAAIDLT LIESDAIGTI GVGESTIPPI 
VLFNRLMGIN EAAFMRATQA TFKLGIQFEN WKHVGESYFH SFGTTGKDHW SAGFQHFWLH
GLTCGHSRSY DDYCLELKAA HEGKFAHLPD DRMNYAYQLD SSLYAAFLRE RAEGDGTRRI
EGRIAAVELD GASGNIAALM LDGERRIEGD LFIDCTGFRA LLIEGALHAG FDDWTHYLPC
DSAIAVQTAS VAPPVPYTRA IAHDAGWQWR IPLQHRQGNG IVYCSRYLAK DDALDRLLGS
IEGDVLTEPN FIGYRTGARR KQWYRNCVAV GLSGGFMEPL ESTSIHLIQR AVLRLIRMLP
SGPVSERDIA EFNDQQFADM EQIRDFLILH YKVTERRDSP FWRQCAAMPI PASLEQKIEL
FRETGRVFRR NEELFVENSW VQVMMGQGIM PQRYHPIAAK LRPDELEAFL SMLRDGVERT
VASLPAHGAY IARYCAVGGR NDA