Gene Sde_2391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2391 
Symbol 
ID3967911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3032944 
End bp3034524 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content47% 
IMG OID637921482 
Producttryptophan halogenase, putative 
Protein accessionYP_527863 
Protein GI90022036 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000100477 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAAACAA GACCAGCCGA TCAAACTGCT CCGCTAAAAA ATATTGTTAT AGTGGGTGGC 
GGTACCGCAG GCTGGCTAAC CGCAGGGCGA CTGGCAGCAC AATTTAATAC AGGCGCAGAT
GCAGTTAAAA ATACGAACAA TATTCGCGTT ACACTTATTG AATCGCCCAA CATACCTACC
GTTGGTGTAG GTGAAGGTAC TTGGCCCACA ATGCGATCAA CCTTAATTAA AATGGGCATT
CGCGAAACAG ATTTTTTAAC CCAATGCGAC GCAACCTTTA AACAAGGCGC AAAATTTGCG
CGCTGGACAA CAGGCAAGCA AGACGATTTT TACTACCACC CGCTTATGCT GCCGCAAGGT
TTTGGCAAAA CAGACCTCGC ACAACACTGG CAAACCGTTA AACAAGTAAC CGGCCAATCT
TTTTCAGAAG CAGTTTGCAT TCAACAAGCT ATTTGTGAAA AGGGACTCGC CCCCAAAACC
ATTCGCGCCC CCGAATTTAA CGGTGCGGCC AATTACGCCT ACCACCTCAA CGCGGGTAAA
TTTGCCACCT TTTTACAAAA ACACTGCACA CAAAACTTAG GCGTTAATCA TATTCTGGAT
GACGTAAGCG GTGTTAACAT CGCAGACAAC GGCGATATAG CCAGTGTAAT AACCAAAGCA
AACGGCAATA TTGAAGGCGA TTTATTTGTA GACTGCACTG GTTTTAACGC GCTGCTAGTA
GGCAAGCACT ACCAAGTACC GTTTAAAGAC TGTAGCGATG TACTTTTCAT AGACAGTGCT
TTAGCCGTAC AACTCCCCTA CTCCAAAGCA GACTCCCCCA TTGCCTCGCA CACTATTTCT
ACCGCACAAG ATGCCGGCTG GATATGGGAT ATAGGTCTTA CCCACAGACG CGGTATTGGC
CACGTGTATT CAAGCAGGCA CACTAGCGAA AGCGATGCGC TACAAGCGCT GGCAACTTAC
ACTCAAACAG ATTGCGACAA GCTAGATGTA AGAAAAATAC CCATTAAATC GGGCCACCGC
GAAAAGTTCT GGGTAAATAA TTGCGTTGCA GTGGGCTTAG CTGCGGGGTT TCTCGAGCCA
CTAGAAGCCT CTGCACTTGT GCTGGTTGAG CTTTCAGCAC AAATGATAGC CGAGCAACTA
CCGGCTAACA GAGCAACCAT GAATATAGTG GCAAAACGTT TTAACGAAAC CTTTCTTTAC
CGCTGGGATA AAATTATCGA CTTTTTAAAA TTGCACTATT GCATTAGCCA GCGCACAGAC
ACCGCCTTTT GGCGCGACAA CTGCGACCCA GCAACCATTC CACAAAGCTT GCAAGATTTA
CTAGCGCTTT GGCAACATCG CGCCCCAAGC GACCTAGACT TTACCAGTAA CAACGAAGTA
TTCCCTGCTG CTAGCTACCA ATATGTCCTG TACGGCATGG GGTTTAATAC CCAATTTAGC
AATACAGGCT TATATAACGC AGCAGTGGCA GACGCGCACT TTATGCGCAA GCAATTGAAC
GAGGACGAAG CCCTTAAGGC ATTGCCAAGC AACCGAGAAC TATTAGAAAA AATTGCCCAA
TTCGGCTTAC AGCCGGTATA A
 
Protein sequence
MQTRPADQTA PLKNIVIVGG GTAGWLTAGR LAAQFNTGAD AVKNTNNIRV TLIESPNIPT 
VGVGEGTWPT MRSTLIKMGI RETDFLTQCD ATFKQGAKFA RWTTGKQDDF YYHPLMLPQG
FGKTDLAQHW QTVKQVTGQS FSEAVCIQQA ICEKGLAPKT IRAPEFNGAA NYAYHLNAGK
FATFLQKHCT QNLGVNHILD DVSGVNIADN GDIASVITKA NGNIEGDLFV DCTGFNALLV
GKHYQVPFKD CSDVLFIDSA LAVQLPYSKA DSPIASHTIS TAQDAGWIWD IGLTHRRGIG
HVYSSRHTSE SDALQALATY TQTDCDKLDV RKIPIKSGHR EKFWVNNCVA VGLAAGFLEP
LEASALVLVE LSAQMIAEQL PANRATMNIV AKRFNETFLY RWDKIIDFLK LHYCISQRTD
TAFWRDNCDP ATIPQSLQDL LALWQHRAPS DLDFTSNNEV FPAASYQYVL YGMGFNTQFS
NTGLYNAAVA DAHFMRKQLN EDEALKALPS NRELLEKIAQ FGLQPV