Gene Saro_1882 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1882 
Symbol 
ID3917103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1987056 
End bp1988561 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content64% 
IMG OID640444626 
Producttryptophan halogenase 
Protein accessionYP_497156 
Protein GI87199899 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0761286 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACGT CGGACCGGAC GATCCGTTCT GTGGCAATTG TCGGCGGCGG CACGGCGGGC 
TGGATGACGG CGGCTGCACT GGCGCAGGCG CTTAGGCACA ACTGCCGGAT AACGCTGGTT
GAATCGGACG ACATCGGCAC CGTGGGCGTC GGCGAGGCGA CGATCCCTCC GATCCGCACC
TTCAACGAAA CCCTGCAGAT CGACGAGCGC GAGTTCGTCC GCAAGACGCA AGGGACGTTC
AAGCTCGGCA TCGAGTTCGT CGACTGGGCC CGGGTCGGAA ACCGCTATTT TCACCCCTTC
GGCCCGCACG GGCGTGCCTT CGACATGGTG AACCTGCACC ACTATTGGCT GCGCGCCAGG
GCGGAGGGGG AGACGGCACC GCTGGACGAG CACTCGATGG CTTGGGCGCT GGCCAGGGAA
AATCGCTTTG CGCCACCAAT GCCCGACCAG CGCAACGTGT TGTCCACGTT CGATTTTGCC
TATCACTTCG ATGCCGGCCT CTATGCCCGT TTCCTGCGCG AATATGCCGA GGCGAGGGGT
GTCGTCCGCA TCGAGGGCAA AATCGGCAGC GTTCAGCAGA ATGGCGAAAC CGGCTTCGTC
ACTGGCGTCA CGCTGGAAGA CGGCCGCGCG GTCGAGGCCG AGCTTTTCGT CGATTGCTCG
GGCTTCCGCG GCCTCCTGAT CGAGGGGGCG CTGCAGGCGG GGTACGAGGA TTGGACCCAT
TGGCTGCCAT GCGACCGCGC GATGGCGGTG CCTTGCGAAA ATGCCTATCC GCTGACGCCC
TACACCCGTT CGACCGCCCG CGAGGCGGGG TGGCAGTGGC GCATTCCGCT GCAGCATCGC
ACCGGCAACG GCTACGTGTT CTGCAGCCAG TTCCTGTCGG AGGACGAAGC GGCGGAAAAG
CTGCTGTCGC GCCTCGACGG CAAGGCGCTG GCCGATCCGC GTCCGCTGCG GTTCGTCACG
GGGCGGCGCA AGAAGTTCTG GGACAGGAAC GTGATCGCCA TCGGCCTGTC GAGCGGCTTT
ATGGAGCCGC TGGAATCGAC GTCTATTCAC TTGATCCAGG CTGGAATTTC CAAGCTTCTC
GCGCTTTTCC CCGATCGCGG CTTCGACCCT ATCGTGATCG ACGAGTACAA CCGCATCGCG
GTGAGCGAGT TCGAGCGCAT CCGCGATTTC ATCATCCTGC ACTACAAGCT CACCGAGCGC
GACGATGCGG AGTTGTGGCG CTATTGCGCG GCGATGGACA TCCCGGACAC GCTCAAGACG
AAGATCGAGC ATTTCCGAAG CTTTGGCAGG CTGGTTCAGC GCGATGCAGA CCTGTTCGGG
CCGCCATCGT GGCTCGCGGT GCACATCGGT CAGCTCAATT TTCCCGAACG GACCGATCCA
CTGGCGGACT ATCGCGGCAT CGACGGGCGC GAATGGCTGG CGAAGCTGCG GGCGGCGATG
CACCACGCGG CGATGCAGCA ACCCACGCAC GAGCAGTTCA TCGCGGCGAA CTGCGCGGCG
GCATGA
 
Protein sequence
MATSDRTIRS VAIVGGGTAG WMTAAALAQA LRHNCRITLV ESDDIGTVGV GEATIPPIRT 
FNETLQIDER EFVRKTQGTF KLGIEFVDWA RVGNRYFHPF GPHGRAFDMV NLHHYWLRAR
AEGETAPLDE HSMAWALARE NRFAPPMPDQ RNVLSTFDFA YHFDAGLYAR FLREYAEARG
VVRIEGKIGS VQQNGETGFV TGVTLEDGRA VEAELFVDCS GFRGLLIEGA LQAGYEDWTH
WLPCDRAMAV PCENAYPLTP YTRSTAREAG WQWRIPLQHR TGNGYVFCSQ FLSEDEAAEK
LLSRLDGKAL ADPRPLRFVT GRRKKFWDRN VIAIGLSSGF MEPLESTSIH LIQAGISKLL
ALFPDRGFDP IVIDEYNRIA VSEFERIRDF IILHYKLTER DDAELWRYCA AMDIPDTLKT
KIEHFRSFGR LVQRDADLFG PPSWLAVHIG QLNFPERTDP LADYRGIDGR EWLAKLRAAM
HHAAMQQPTH EQFIAANCAA A