Gene Sde_3053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3053 
Symbol 
ID3967658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3905506 
End bp3907032 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content45% 
IMG OID637922150 
Producttryptophan halogenase, putative 
Protein accessionYP_528522 
Protein GI90022695 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00102959 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAGTTA AAAAAGTAGA ATCCATACTT GTTGTGGGTG GTGGTACAGC TGGCTGGTTA 
ACCGCAGGCA TTATTGCGGC AAAGCATGGT ACTTCGGTAT CCATAACCGT TGTGGAGTCA
CCCAACATTA AAACGGTGGG GGTAGGCGAA GGTACTTGGC CCACAATGAA AACAACCCTG
CAAGAAATGG GCGTTTCAGA AACAGACTTT CTAAGGCAAT GTGATGCATC GTTCAAACAG
GGCGCGAAAT TTTGCCAGTG GAAAACTGGC GAGCAATCAG ACTACTACTA TCACCCGCTA
ATGCTACCGC GCGATTTCGA CGAATTTAAT AGCGCACCTT TCTGGCTCGA CCAAAAAAGC
GGAGAATCCT TCTCTAACAG CGTGTGTTTT CAGCAAGCTT TATGCGAAAA AAACCTTGCG
CCTAAAACAT TAACAATGCC CGAATACGCA GGCGCTGCAA ATTATGCCTA CCATTTAGAC
GCGGGCAAAT TTGCACCGTT TTTAACACAC CACTGCACAA AAAAATTAAA CGTTACCCAC
GTTAAAGCCA CTGTAGAAAA TGTAAAACTA ACAGATAGCG GCGAAATAGA TTACCTACTA
ACAAAAGAAG CAGGCCAACT TGAGGCCGAC CTTTATATAG ATTGCTCTGG CTTTTGTTCA
CTACTATTAG GGCAAGCGCT AGATGTGCCC TTTGTAGATA AGAGCGATAT TTTATTTTTA
GATACAGCCA TAGCCACTCA CGTACCCTAC CCAACTGAAA ATTCAGCTAT TGCTCCCCAC
ACCCTTTCTA CCGCCCAAAC AAGTGGGTGG ATATGGGATA TAGGCCTGCA AAGCAGACGC
GGCGTAGGCC ATGTTTACTC TAGTAAGTAC ATCGATGATC AAACAGCAAA GCAGCAATTG
GCCGACTACC TGTGCACCGA TGTGCAATCG CTAGAAACCA AAACAATTCC TATGCCCTGC
GGTCACCGAG AGAAGTTTTG GCAAAAAAAC TGCGTAGCCA TTGGCCTAGC CGCTGGTTTT
TTAGAGCCAC TAGAAGCATC CGCTTTAGTA CTGGTGGAAA TGTCTGCCCA ATTTATTCGC
GACCAGTTAC CCGCGCACAC CAGTATTATG CCTATTGTTG AAAAACGCTT TAACACCACC
TTTCACTACC GCTGGCAGCG TATTATCGAC TTTTTAAAAT TACACTATGT GCTTAGCCAA
CGACGCGACT CCGAATTTTG GTGTGCACAG CAAGATGCAT GCTCCATACC CGAATCCCTA
CAAGAACTAT TAAACCTATG GCAATATCAA CCCCCGTGGC GCCACGATTT TCTGCACAAA
GATGAAGTTT TCCCCGCGGC AAGCTATCAA TATGTACTTT ACGGCATGGG TTTTAAAACA
CATTGCAGAG AAGACGAAGT AAACAAAGCG CGCTACCAGC AGCTACTAGA AGAAACAAGC
TTCACCAAAC ATCGCGCTAT TAAAGCCCTA CCGCCAACAC GTGAGTTGCT AAACACATTA
CATCAACACC GCATGCAGGT AATTTAA
 
Protein sequence
MKVKKVESIL VVGGGTAGWL TAGIIAAKHG TSVSITVVES PNIKTVGVGE GTWPTMKTTL 
QEMGVSETDF LRQCDASFKQ GAKFCQWKTG EQSDYYYHPL MLPRDFDEFN SAPFWLDQKS
GESFSNSVCF QQALCEKNLA PKTLTMPEYA GAANYAYHLD AGKFAPFLTH HCTKKLNVTH
VKATVENVKL TDSGEIDYLL TKEAGQLEAD LYIDCSGFCS LLLGQALDVP FVDKSDILFL
DTAIATHVPY PTENSAIAPH TLSTAQTSGW IWDIGLQSRR GVGHVYSSKY IDDQTAKQQL
ADYLCTDVQS LETKTIPMPC GHREKFWQKN CVAIGLAAGF LEPLEASALV LVEMSAQFIR
DQLPAHTSIM PIVEKRFNTT FHYRWQRIID FLKLHYVLSQ RRDSEFWCAQ QDACSIPESL
QELLNLWQYQ PPWRHDFLHK DEVFPAASYQ YVLYGMGFKT HCREDEVNKA RYQQLLEETS
FTKHRAIKAL PPTRELLNTL HQHRMQVI