Gene Sde_2807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2807 
Symbol 
ID3968286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3540956 
End bp3542467 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content46% 
IMG OID637921904 
Producttryptophan halogenase, putative 
Protein accessionYP_528276 
Protein GI90022449 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000313891 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGCCAC TGCAAAAAAT AGTGATTCTA GGCGGCGGCA CCACAGGCTG GATGTGCGCG 
GCAATGCTCG CTAACAGCGT AAATACCAAG CGCATAAATA TAGAGCTGGT GGAATCTGAA
CAAATAGGCA CAATCGGTGT GGGCGAATCT ACGGTACCGC CGTTTATGGA TATGCTGGCG
TCGCTGGGTA TAGATGAGGT GGAGTTTATT CAAGCCACTC AAGCCACTTT TAAGCTGGGA
ATTCAATTTA AAGATTGGCT TAAAAAAGAC GAAACTTTTT TTCACCCCTT CGGCCGCGCC
GAAGCCGGTA TGGATGAACT TAGCCTTTAC CATTTATGGC TGCGCGCGCA ATTAAACGGC
GACACTTTTT CGCGTTTTTT AGATTACTCC CCCAACAGTG TAATGGCACA ACAAAAACGG
TTTGCCCCAT ACAAAGCCGT GCCCGGCACA CTACTGGCAG ACTCGCGCTA TGCATTACAT
TTAGATGCGG GCTTAGTAGC AAAATACTTG CGTAACTTCG CGCAACAAAA AGGCGTTAAA
CGAACAGAAG GTAAGGTAGA AAAAGTAAAT ACATCTATAG ACACAGCACC GCGCATTTTA
TCGCTGCAAC TAGAAAGCGG CCAAACAATT AACGGCGATT TTTTTATAGA CTGCTCTGGC
TTTAGGGCGC TGCTTATCGG CGATGCTTTG CAGTCTAGCT TTACCGATTG GTCTACCTAT
TTACCCTGTA ACCGTGCAGT TACCGTACAA AGTGAAGCGC TACCCGAACT GCCGCCCTAC
ACAAAAGCCA CTGCACAATT AGCTGGCTGG CAATGGCGCA TACCGCTACA ACATCGCACG
GGCAACGGTT ACGTGTATGC AAGTAAATAC ATTAGCGATG AGCAAGCTAC GCAAACACTG
CTAAGTAATA TACAAGGTAA AACATTAACT GAACCGCGTA TAATTCCTTT CACTACGGGC
ATGCGCAAAC AAGTTTGGAA AGCAAACTGC ATTGGCGTTG GGCTTGCCGC TGGTTTTATA
GAGCCTCTGG AATCCACCGC TATACATTTA GCTATGCGCG GTATTGCAGA ATTTTTACAG
CAGTTTCCAC ACAGCGATTG CAACCCAGCA TTAATTAACG AATACAATGC ACGCCTACAA
CAGGATTACG AAGAAATTCG CGATTTTATT ATTTTGCATT ACGCTGCAAC CGAGCGAAAT
GACAGTGACT TTTGGGCCTA CTGCAAAAAT GTAGACTGGC CAGCATCGCT AGTGCAAACG
GTTGAGTATT TTAAAGCTCG CGGCGACGTG CCGCGCAAGC TTGCCCCACT TTTTGAAAGT
ACAAGCTGGC GCAGCCTTTG CGAAGGCATG AACATTCGGC CTAGCGCTTA TTCTGCCTTT
ATTGAAAACG CGGATTACAA AGCCAGCAAA CAACACATGC AAAACTACAA GCAGCAGCTT
GTAAGCCTCG TAAAACAAAT ACCCACCCAC AAGGCGTTTA TTGAACAAAA CTGTGCGAGC
AGCAGCAAAT AA
 
Protein sequence
MKPLQKIVIL GGGTTGWMCA AMLANSVNTK RINIELVESE QIGTIGVGES TVPPFMDMLA 
SLGIDEVEFI QATQATFKLG IQFKDWLKKD ETFFHPFGRA EAGMDELSLY HLWLRAQLNG
DTFSRFLDYS PNSVMAQQKR FAPYKAVPGT LLADSRYALH LDAGLVAKYL RNFAQQKGVK
RTEGKVEKVN TSIDTAPRIL SLQLESGQTI NGDFFIDCSG FRALLIGDAL QSSFTDWSTY
LPCNRAVTVQ SEALPELPPY TKATAQLAGW QWRIPLQHRT GNGYVYASKY ISDEQATQTL
LSNIQGKTLT EPRIIPFTTG MRKQVWKANC IGVGLAAGFI EPLESTAIHL AMRGIAEFLQ
QFPHSDCNPA LINEYNARLQ QDYEEIRDFI ILHYAATERN DSDFWAYCKN VDWPASLVQT
VEYFKARGDV PRKLAPLFES TSWRSLCEGM NIRPSAYSAF IENADYKASK QHMQNYKQQL
VSLVKQIPTH KAFIEQNCAS SSK