Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1882 |
Symbol | |
ID | 3917103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1987056 |
End bp | 1988561 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640444626 |
Product | tryptophan halogenase |
Protein accession | YP_497156 |
Protein GI | 87199899 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0761286 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGACGT CGGACCGGAC GATCCGTTCT GTGGCAATTG TCGGCGGCGG CACGGCGGGC TGGATGACGG CGGCTGCACT GGCGCAGGCG CTTAGGCACA ACTGCCGGAT AACGCTGGTT GAATCGGACG ACATCGGCAC CGTGGGCGTC GGCGAGGCGA CGATCCCTCC GATCCGCACC TTCAACGAAA CCCTGCAGAT CGACGAGCGC GAGTTCGTCC GCAAGACGCA AGGGACGTTC AAGCTCGGCA TCGAGTTCGT CGACTGGGCC CGGGTCGGAA ACCGCTATTT TCACCCCTTC GGCCCGCACG GGCGTGCCTT CGACATGGTG AACCTGCACC ACTATTGGCT GCGCGCCAGG GCGGAGGGGG AGACGGCACC GCTGGACGAG CACTCGATGG CTTGGGCGCT GGCCAGGGAA AATCGCTTTG CGCCACCAAT GCCCGACCAG CGCAACGTGT TGTCCACGTT CGATTTTGCC TATCACTTCG ATGCCGGCCT CTATGCCCGT TTCCTGCGCG AATATGCCGA GGCGAGGGGT GTCGTCCGCA TCGAGGGCAA AATCGGCAGC GTTCAGCAGA ATGGCGAAAC CGGCTTCGTC ACTGGCGTCA CGCTGGAAGA CGGCCGCGCG GTCGAGGCCG AGCTTTTCGT CGATTGCTCG GGCTTCCGCG GCCTCCTGAT CGAGGGGGCG CTGCAGGCGG GGTACGAGGA TTGGACCCAT TGGCTGCCAT GCGACCGCGC GATGGCGGTG CCTTGCGAAA ATGCCTATCC GCTGACGCCC TACACCCGTT CGACCGCCCG CGAGGCGGGG TGGCAGTGGC GCATTCCGCT GCAGCATCGC ACCGGCAACG GCTACGTGTT CTGCAGCCAG TTCCTGTCGG AGGACGAAGC GGCGGAAAAG CTGCTGTCGC GCCTCGACGG CAAGGCGCTG GCCGATCCGC GTCCGCTGCG GTTCGTCACG GGGCGGCGCA AGAAGTTCTG GGACAGGAAC GTGATCGCCA TCGGCCTGTC GAGCGGCTTT ATGGAGCCGC TGGAATCGAC GTCTATTCAC TTGATCCAGG CTGGAATTTC CAAGCTTCTC GCGCTTTTCC CCGATCGCGG CTTCGACCCT ATCGTGATCG ACGAGTACAA CCGCATCGCG GTGAGCGAGT TCGAGCGCAT CCGCGATTTC ATCATCCTGC ACTACAAGCT CACCGAGCGC GACGATGCGG AGTTGTGGCG CTATTGCGCG GCGATGGACA TCCCGGACAC GCTCAAGACG AAGATCGAGC ATTTCCGAAG CTTTGGCAGG CTGGTTCAGC GCGATGCAGA CCTGTTCGGG CCGCCATCGT GGCTCGCGGT GCACATCGGT CAGCTCAATT TTCCCGAACG GACCGATCCA CTGGCGGACT ATCGCGGCAT CGACGGGCGC GAATGGCTGG CGAAGCTGCG GGCGGCGATG CACCACGCGG CGATGCAGCA ACCCACGCAC GAGCAGTTCA TCGCGGCGAA CTGCGCGGCG GCATGA
|
Protein sequence | MATSDRTIRS VAIVGGGTAG WMTAAALAQA LRHNCRITLV ESDDIGTVGV GEATIPPIRT FNETLQIDER EFVRKTQGTF KLGIEFVDWA RVGNRYFHPF GPHGRAFDMV NLHHYWLRAR AEGETAPLDE HSMAWALARE NRFAPPMPDQ RNVLSTFDFA YHFDAGLYAR FLREYAEARG VVRIEGKIGS VQQNGETGFV TGVTLEDGRA VEAELFVDCS GFRGLLIEGA LQAGYEDWTH WLPCDRAMAV PCENAYPLTP YTRSTAREAG WQWRIPLQHR TGNGYVFCSQ FLSEDEAAEK LLSRLDGKAL ADPRPLRFVT GRRKKFWDRN VIAIGLSSGF MEPLESTSIH LIQAGISKLL ALFPDRGFDP IVIDEYNRIA VSEFERIRDF IILHYKLTER DDAELWRYCA AMDIPDTLKT KIEHFRSFGR LVQRDADLFG PPSWLAVHIG QLNFPERTDP LADYRGIDGR EWLAKLRAAM HHAAMQQPTH EQFIAANCAA A
|
| |