Gene Saro_1607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1607 
Symbol 
ID3918715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1674284 
End bp1675801 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content65% 
IMG OID640444347 
Producttryptophan halogenase 
Protein accessionYP_496881 
Protein GI87199624 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGTC CGAACCCCGC AGTCCCCGCC CTGCCGCGCC GCATCGTCAT CGCCGGGGGC 
GGAACTGCCG GCTGGATGAC CGCCGCAGCG CTTGCCCGGA CGCTCGGCAA GGTGGCGCAG
GTCACGCTTG TCGAAAGCGA GCAGATCGGC ACCATCGGCG TGGGCGAAAG CACGATCCCG
CCGCTGGTGG CCTACAATCG CATTCTCGGC ATCTCCGAGG CCGAATTCAT GCGCGCCACG
CAGGCCACCT TCAAGCTCGG CATCAATTTC GAGAACTGGC GCGTGCCGGG CGAGGCCTAC
TTCCATTCCT TCGGCGGCAC CGGCAAGGAT CACTGGTCGG CCGGCTTCCA GCACTTCTGG
ATGCACGGCC TCGCACGCGG CCACGACGAG GCCTACGGCG AGTATTGCCT GGAGCTCAAG
GCCGCGCAGG CCGGCCGTTT CGCGCACCTT CCGGAAGACC GGATGAACTA CGCCTACCAG
CTCGATTCCG GCCTCTACGC CCGCTTCCTG CGACAGATGG CCGAAGCCGA TGGTACGGTC
CGGATCGAGG GCAAGATCGG CGGGGTTCCG CTCGATCCCG AAACCGGAGA CATCGCCGCG
CTTGTTCTCG AGGATGGTCA GCGCATCGAA GGTGACCTTT TCATTGACTG CACCGGCTTT
CGCGCGCTGC TGATCGAGGG CGCTCTCCAC GTCGGCTATG ACGACTGGAC CCACTGGCTG
CCCTGCGACG GCGCAATTGC CATCCAGACG TCAAGCGTTC GCGCTCCCGT TCCCTATACC
CGCGCAATCG CGCACGGCAG CGGATGGCAA TGGCGCATTC CGTTGCAACA CCGCCAGGGC
AACGGCATCG TCTATTGCAG CGCCTACATG GACCACGAGG CAGCGCTGGA AATGCTGCTG
TCCACGGTGG AAGGCGAAAA GCTCGTCCGG CCCAATCCGA TCCGCTTCCG CACGGGGGTG
CGGCGCAAGC AATGGCATCG CAACTGCGTT GCCGTTGGTC TTTCGGGCGG CTTCATGGAA
CCGCTGGAAT CGACGTCGAT CCACCTCATC CAGCGTGCGA TCCTGCGCAT CGTGCGGATG
CTGCCGGCAG GAACGATCAG CGCACGTGAC GTTGCCGAGT TCAATGACCA GCAGATGCTC
GACATGGAGC AGATCCGCGA TTTCCTGATT CTACACTACA AGGCGACCGA CCGCCGCGAC
ACGCCGTTCT GGCGCCACTG CGCGAGCATG GAGGTTCCGG AAAGCCTCGC TCACCGGATC
GAGCTGTTCC GCGAGACCGG CCGCGTCTTC CGTCGCAACG AGGAGCTGTT TGCCGAAAAC
TCGTGGGTCC AGGTGATGAT GGGGCAGGGA ATCGTGCCGC AGAGCTATCA CCCGGTTGCG
GCCAAGCTGC GCGACGAGGA ACTGGCGCAT CTGCTCCAGA CCCTGCGCGA GCAGGTCGAT
CGCACCGTTG CCGCGCTGCC CGCGCACGGT GATTATATCG CGCGCTACTG TGGCGCCGAA
ATGCCGGCGG CTGCATGA
 
Protein sequence
MKRPNPAVPA LPRRIVIAGG GTAGWMTAAA LARTLGKVAQ VTLVESEQIG TIGVGESTIP 
PLVAYNRILG ISEAEFMRAT QATFKLGINF ENWRVPGEAY FHSFGGTGKD HWSAGFQHFW
MHGLARGHDE AYGEYCLELK AAQAGRFAHL PEDRMNYAYQ LDSGLYARFL RQMAEADGTV
RIEGKIGGVP LDPETGDIAA LVLEDGQRIE GDLFIDCTGF RALLIEGALH VGYDDWTHWL
PCDGAIAIQT SSVRAPVPYT RAIAHGSGWQ WRIPLQHRQG NGIVYCSAYM DHEAALEMLL
STVEGEKLVR PNPIRFRTGV RRKQWHRNCV AVGLSGGFME PLESTSIHLI QRAILRIVRM
LPAGTISARD VAEFNDQQML DMEQIRDFLI LHYKATDRRD TPFWRHCASM EVPESLAHRI
ELFRETGRVF RRNEELFAEN SWVQVMMGQG IVPQSYHPVA AKLRDEELAH LLQTLREQVD
RTVAALPAHG DYIARYCGAE MPAAA