Gene Sde_2498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2498 
Symbol 
ID3968780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3158965 
End bp3160470 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content45% 
IMG OID637921589 
Producttryptophan halogenase, putative 
Protein accessionYP_527970 
Protein GI90022143 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACAGTA AGCAAGTTAA GAAGATAGTA ATCGTAGGTG GTGGTACAGC AGGGTGGATG 
GCAGCCGCCA TGCTTGCTTG TCGCTACTCG CGAGAAGATT TAGCTATACA GCTTGTAGAG
TCTGATGCAA TTGCTACTGT TGGAGTGGGT GAGGCTACGG TACCGGGGAT AATCCAGCTG
CATCAGCACC TTGGTATTAA AGAAAGTGAG TTTGTAAGCG CAACCAATGC AACCTTTAAG
CTTGGTATAG AGTTTAAAAA TTGGAGCCAG TTGGGGGCTA CCTTTTTTCA CCCCTTTGCA
AAATACGGCG CGCCTATTGC GGGGCAGGCA TTTTTTGATT GCTGGCTGCG CTTAAAACAA
GCGGGGTATA CGGCAAAATT AGATGAGTTT TCTTTATCTA TTGCCTTGGC AAAGGCAAAT
AAATTTGTAC AACCCGACGA TAACGCCACC AATCAATTGG CAATGTTTGG CTACGCGTAT
CACTTTGATG CAACGCTTTA TGCAAAGTTT TTGCGTGCCT ACGCAGAGCA ACGCGGAGTG
CAGCGTACCG AAGGCTTAAT TACACAAACG TACTTACAGG CAGATGGCAA TATAGAATGC
GTGGAGCTGG CAAGTGGCGA AAAAATTGCG GGCGATTTAT TTTTTGATTG CTCAGGCTTT
CGCGGCCTAC TTATAGAAGA GGCACTGCAA ACTGGTTATC AAGATTGGAG CCATTGGCTA
CCCTGTAATA AAGCGGTAGC AGTGCAAACA ATAAACGAAA AACCACCTAC GCCCTATACG
CGCTCTACTG CTTTAGCCGC TGGGTGGCAG TGGACAATTC CCCTACAGAA TCGCATAGGC
AATGGTTATG TGTTTTGCGA CCGCTATATA TCGGACGACG AAGCAATAGC CACCTTAACC
CGTAATGTAG AACGCGAAAT GCTTACCGAG CCAAGAGTAA TAGGGTTTAA CGCCGGCGTG
CGCAACAAGT TTTGGAATAA AAATTGTGTG GCTATTGGTT TGGCGAGTGG GTTTATCGAG
CCATTAGAAT CCACCAGTAT TTCACTTATT CAAACCGGCG TAGAAAAAAT AATGGATGCG
ATGCCAGCGT TGGAATACAG CGAAAATACG ATAGCTTCAA CCAACTCGTT AAATCAGCAA
GAATATGAGC GCATACGCGA TTTTATTGTT TTGCATTACA AAGCCAGCGC CCGCGAAGAC
AGCGCGTTTT GGCGTGATGT GCGAGAAATG GATATACCCA CAACACTACA AAATAAAATG
AGTGCTTACT TAAAAGATGC AACATTTTTA GATTACGGCC AAGAATCTTT TAAAGATGCA
AGTTGGCAAA CCATGTATAA CGGTTTTAAT CTTTACCCGC AAATACCTCC AAGTAATGTT
GCTGATCTAG ATGTGCAGCA GCTAATGCTT GTGGCCGAGA AAATGCGTGC AGCTATTCAA
GCAGGGGTGG CTCACGCACC CAGTCATGCA GAGTTTCTTT CTACACTCGC CGACGGCAAA
TTCTAA
 
Protein sequence
MHSKQVKKIV IVGGGTAGWM AAAMLACRYS REDLAIQLVE SDAIATVGVG EATVPGIIQL 
HQHLGIKESE FVSATNATFK LGIEFKNWSQ LGATFFHPFA KYGAPIAGQA FFDCWLRLKQ
AGYTAKLDEF SLSIALAKAN KFVQPDDNAT NQLAMFGYAY HFDATLYAKF LRAYAEQRGV
QRTEGLITQT YLQADGNIEC VELASGEKIA GDLFFDCSGF RGLLIEEALQ TGYQDWSHWL
PCNKAVAVQT INEKPPTPYT RSTALAAGWQ WTIPLQNRIG NGYVFCDRYI SDDEAIATLT
RNVEREMLTE PRVIGFNAGV RNKFWNKNCV AIGLASGFIE PLESTSISLI QTGVEKIMDA
MPALEYSENT IASTNSLNQQ EYERIRDFIV LHYKASARED SAFWRDVREM DIPTTLQNKM
SAYLKDATFL DYGQESFKDA SWQTMYNGFN LYPQIPPSNV ADLDVQQLML VAEKMRAAIQ
AGVAHAPSHA EFLSTLADGK F