Gene Saro_1889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1889 
Symbol 
ID3917110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1998001 
End bp1999515 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content64% 
IMG OID640444633 
Producttryptophan halogenase 
Protein accessionYP_497163 
Protein GI87199906 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.140819 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCCTT CGAACAAGCG CAGGATAGTC ATCGCCGGTG GCGGGACGGC CGGATGGATG 
ACGGCCGCCG CGCTTGCCCG CTTTGCCATG CCGCACTGGC AGGTCACGCT CGTCGAATCC
GAAGAGATCG GGACGATCGG CGTCGGCGAA GCGACCATTC CGATGGTGCG GCTATTCAAC
CGCTCGCTGG GGATCGACGA ACGCCAGTTC CTGCGCGAGA CCCACGGTAC GTGGAAGCTG
GGCATCGCCT TCGAGGGCTG GGGCGGCTTC GACGAGCGTT ACATCCATGG CTTCGGACTG
ACAGGCCGGA GCCTTGGCGT CCTGCCCTTT CACCATTACT GGTTGCGGGG TCGCGCCATG
GGAGCCGCAG GACCGCTGGG CGATTACGTG CTTAATGCCG TCGCCTGCGA GGAAGGCCGT
TTCGCCCACA TCGACCGTCC AGAGGGCAGC GCATTGCCGC CGATGCCCTA TGCCTATCAT
TTCGACGCGT CGCTCTATGC CGCGTTCCTG CGGCGCTATG CCGAAGAGCG TGGCGTGCTG
CGGATCGAAG GTCGTATCCA GTCGGCCGCG CGCGATTGTT CCACCGGTGA CATTGCCTCG
CTGAAGCTTG CCGATGGCCA CGAGGTGGAG GGCGATCTGT TCATCGATTG CTCGGGCTTT
CGCAGCCTTC TCCTCGGACA GGAAATGGGC GTCGAGTACG TCGACTGGAG CCACTGGCTG
CGCTGCGACC GCGCGGTTGC CGTTCCGTGC GAGCGCGCCG GGCCGCTGGT TCACTATACG
CGGTCCATTG CCCAGCCCGC AGGCTGGTGC TGGCGCATCC CGCTGCAGCA CCGGACCGGC
AACGGCCACG TGTTCTGTTC CGACGCGATG GGCGAGGACG AGGCGACCGC GCGCCTTCTG
GGAAGGCTCG ACGGGCGCCA ACTGGCCGAG CCGCGCACGA TCCGGTTCAA GGCAGGGCGG
CGCGAGACGT TCTGGAAGAA CAACGTGGTT GCGGTCGGCC TGTCCTCCGG CTTCATCGAA
CCTCTGGAAT CGACTTCCAT TCACCTGATC CAGACGGCAA TCAGCCGGAT CATCGATTTT
TTGCCGAACG GGCCGGTACA GGCAGCGGAC CGCGATGCGT ACAACCGGCT TTCGGTCTTC
GAGATCGAAC GCATCCGCGA CTTCGTGATC CTTCACTACG TCGCAAATGG CCGTCACGGA
GAGCCGTTCT GGGATAGCCT CAGAGCGATG GAAATTCCCG AAACGCTGGC TAAGCGCATC
GCCATGTTCC GCGCATCGGG CCGCATCGTG CGGGAACACG AAGAGCTTTT CGACGTGCCC
GGTTGGGTGC AGGTAATGGT GGGGCAGGGG ATCATGCCGG AGCGGTGGCA CCCTCTTGCC
GACCAATTGG ACCGGACCAA CCTTTCCGCG TTCCTCGAAA CCGTCAGCGA AGCCTACAGA
AAGGACGTTG CGCGAATGCC CCTTCACGCC GACTACCTCG AACATGTCTG CGGAAAGGTG
GCTGCCCATG CGTAG
 
Protein sequence
MPPSNKRRIV IAGGGTAGWM TAAALARFAM PHWQVTLVES EEIGTIGVGE ATIPMVRLFN 
RSLGIDERQF LRETHGTWKL GIAFEGWGGF DERYIHGFGL TGRSLGVLPF HHYWLRGRAM
GAAGPLGDYV LNAVACEEGR FAHIDRPEGS ALPPMPYAYH FDASLYAAFL RRYAEERGVL
RIEGRIQSAA RDCSTGDIAS LKLADGHEVE GDLFIDCSGF RSLLLGQEMG VEYVDWSHWL
RCDRAVAVPC ERAGPLVHYT RSIAQPAGWC WRIPLQHRTG NGHVFCSDAM GEDEATARLL
GRLDGRQLAE PRTIRFKAGR RETFWKNNVV AVGLSSGFIE PLESTSIHLI QTAISRIIDF
LPNGPVQAAD RDAYNRLSVF EIERIRDFVI LHYVANGRHG EPFWDSLRAM EIPETLAKRI
AMFRASGRIV REHEELFDVP GWVQVMVGQG IMPERWHPLA DQLDRTNLSA FLETVSEAYR
KDVARMPLHA DYLEHVCGKV AAHA