Gene Saro_1604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1604 
Symbol 
ID3918712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1671027 
End bp1672568 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content67% 
IMG OID640444344 
Producttryptophan halogenase 
Protein accessionYP_496878 
Protein GI87199621 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGAGG GTGAGGTCAG GCGGATCGTG ATCGTCGGCG GCGGGACCGC CGGCTGGCTG 
TCGGCCTGCT TCCTTGCCGC GGAATTGCGT TCGCGCAGGG TCGACGTCTC GATAACCCTG
ATCGAGGCAC CCGACATTCC CACGATCGGC GTGGGCGAGG GGACCTGGCC GACGATGCGC
GGTACCTTGC GCGCCATCGG CATCGCAGAG GACAATTTCC TTGCCGCCTG CGATGCCTCG
TTCAAACAGG GATCGGTCTT CTCCGGTTGG GTCACCGGAG AGCCGGGGGA CCGGTACCAG
CATCCGTTCA CGCCACCACC CGCCGGGGCG CTCGACGACA TCGTCGCGGC CTGGGCGGGC
AGGGGCGACC AGCCTTTCGC AAGCGCAATG ACCGCGCAGG CGGCGGTGAT CGACCGGGAC
CTTGCGCCGC GCCAGCCGAA CATGCCGTTC TATGCCGGCG CCCTGAACTA CGCCTACCAC
CTCGACGCCG GAAAGTTCGC CGCACTTCTT TCCTCCCGTG CGACCGGACA CCTTGGCGTC
ACCCACGTCC GCGATCGTGT CCGCAACACG GTCGCGGACG ACAGGGGGCA TCTGCTGGCG
GTGGAACTGG CCTCTGGCGA CCGGGTCGAG GGCGATCTCT TCATCGATTG TTCGGGCCAT
GCCGCGCTGC TGATTGGCGG CGCCTGCGGC GTCCCCTTCA TTGACCGCAG CGACGTCCTG
CTCAATGATC GCGCGCTCGC GGCGCAAGTG TCCGTGCAGC CGGGCACGCC GGTAGCTTCT
GCCACCGTCG CAACCGCACA CCGCGCCGGC TGGCTGTGGG ACATCGGGCT TCCCGGCAGG
CGCGGCATCG GCTGCGTCTA TTCATCCCGC TTCCTGTCCG ACGACGATGC GCTGGCAGTG
CTGCGCGCCT ATGTCTCGGC AAACGTCCCG GGCGCAGATC CTGCGGCGGT AGAGCCTCGG
CGCATCGCCT TTCCGACCGG GCACAGGGAG CGTTTCTGGC ATGGCAACTG CATCGCCATT
GGCCTTTCGG CGGGCTTTCT CGAACCGCTC GAGGCCTCGG CCATCGTGAT GATCGAGCTT
TCACTCCGAG CGCTGGCCGA GAATTTCCCC CGCCAGCGCC AGGCCATGCC GTTCCTCGCC
GGCCGCTTCA ACGACCTGTT TCGCTATCGC TGGGACCGGG TGATCGAGTT CCTCAAGCTT
CACTACCTGC TCTCGAAGCG GCAGGAGCCC TACTGGCAGG CACAGCGCGA TCCGGCCGCT
GTGCCGCAGA GGTTGGCCGA ACTTGTCGCG CTCTGGCGTC ACCAGCCGCC GAGCGCATGG
GATTTCCCGC AGGTGGACGA GATATTCTCG GCCGAGAGCC ATGCCTACAT TCTCTATGGC
ATGGGCTTTC CACCGCCTGC CTCCACCGCG CGCGATCCGC TTGCCGCCCG CGCGCTAGAT
GACGTGGCGC AGCGCACCCG CGCCTTGTGT GCCGCGCTGC CGACGAACCG CAGCTACCTG
TCTTCCCTCA TGCCCGCGCA AGGAGCCGCC GCCCTCCCAT GA
 
Protein sequence
MDEGEVRRIV IVGGGTAGWL SACFLAAELR SRRVDVSITL IEAPDIPTIG VGEGTWPTMR 
GTLRAIGIAE DNFLAACDAS FKQGSVFSGW VTGEPGDRYQ HPFTPPPAGA LDDIVAAWAG
RGDQPFASAM TAQAAVIDRD LAPRQPNMPF YAGALNYAYH LDAGKFAALL SSRATGHLGV
THVRDRVRNT VADDRGHLLA VELASGDRVE GDLFIDCSGH AALLIGGACG VPFIDRSDVL
LNDRALAAQV SVQPGTPVAS ATVATAHRAG WLWDIGLPGR RGIGCVYSSR FLSDDDALAV
LRAYVSANVP GADPAAVEPR RIAFPTGHRE RFWHGNCIAI GLSAGFLEPL EASAIVMIEL
SLRALAENFP RQRQAMPFLA GRFNDLFRYR WDRVIEFLKL HYLLSKRQEP YWQAQRDPAA
VPQRLAELVA LWRHQPPSAW DFPQVDEIFS AESHAYILYG MGFPPPASTA RDPLAARALD
DVAQRTRALC AALPTNRSYL SSLMPAQGAA ALP