Gene Sde_1262 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_1262 
Symbol 
ID3965244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp1614478 
End bp1616052 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content45% 
IMG OID637920336 
Producttryptophan halogenase, putative 
Protein accessionYP_526736 
Protein GI90020909 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.69859 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000259242 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCATGT CTAAGTCCAG TACATCTAAG CCAGTTACAT CAATTGTTAT CGTCGGTGGT 
GGTACAGCAG GTTGGATTAC TGCAGGCACC CTCGCCGCCC ACCTAAACTC AAACAGCCAA
GCGACAGTTA CCGTTACACT CGTCGAGTCG CCAACTACAC CCACAATAGG GGTAGGCGAA
GGCACGTGGC CCACTATGCG TAACACGCTT AAAAAAATGG GTGTGCGCGA AACCGATTTT
ATACGCGAAT GCAATGCAAC GTTTAAACAA GGGGCTAAAT TCTGTGGCTG GACAACCGGC
GAACAAAATG ATGGGTACTA TCATCCGCTT GTTTTGCCCC AAGGCTATGC CGATATAAAT
CTTGCTCCCC ATTGGTTAGC TAGCAATAAA GCCACGCCAT TTGCGCAATC GGTGTGCGTA
CAAGCAGCAC TGTGCGAAAC GGGTCTCGCC CCTAAATTAA TAAGCACTGC AGAGTATGCG
GCCATCGCAA ATTATGCTTA CCATTTAGAT GCGGGCGCAT TTACCCGATT TTTAACAAAG
CACTGCACAC AAAATTTACA CGTACAGCAC GTATTGGCAG ATGTAACTAG TGTGCAGGCA
AAAGAAAATG GCGATATAGA ATCCGTTAAC ACTTTGCAGG CAGGTAATAT TTATGGCGAC
CTATTTATAG ATTGCACAGG TTTCGAAGCG TTATTAATAG GCAAGCATTT CAAAGTACCG
TTTATTTCGT GCAAACATAC ACTGTTTATT GATACCGCTC TCGCTGTGCA CTTGCCTTAC
ACAGACGATC AATCAAACAT TGCTTCGCAC ACTATCTCCA CCGCGCAAAC GTCTGGCTGG
ATATGGGATA TAGGCCTACA AAACCGCAGA GGCATAGGCC ATGTATACTC CAGCGCCCAT
ACAACAGATG CCCAAGCCGA AAAAGCGCTT AGGCAATATA TCGCCAAACT ATCGCCCAAC
ACCCAAGACT TAACTGTACG AAAAATACCT ATCGAGCCAG GCCACAGGCA AACATTTTGG
CAAAATAATT GTGTGGCAAT TGGCCTGTCT GCAGGCTTTT TAGAGCCGCT AGAAGCATCA
GCATTAGTGC TTATCGAAAT GTCTGCTACT ATGCTTGCCG AACAGTTACC CACCACACGC
GCAAGCATGG CTATTATTGC TAAACGATTT AATCAAACTA ATACGTATCG CTGGCAACGC
ATTATCGACT TTTTAAAATT ACACTACACC TTAAGCAAAC GCACAGACAG CGACTTTTGG
ATAGATAACC GCGCCCCTAA CACCAACCCA GATAGCCTAA AAGAGCTGCT CGAGTTGTGG
AAATACCATT ACCCTTGGCA CAGCGATTTC GACCGCGCGG CAGAAGTATT TCCATCTGCT
AGCTATCAAT ATTTATTGTA TGGCATGCAG TACCCCACGC AGTCTAGCCA TTTGGGGATG
TCTGATAAAA ACATTGCATT AGCAAATAAA CTATTCACCC AAAACAAAGC ATTAACCCAA
AAGCTATTAA GCTCCCTTCC TAGCAACCGA GAACTAATAA ATAAAATAAA AACGTACGGA
CTAGCGCAAA TTTAA
 
Protein sequence
MSMSKSSTSK PVTSIVIVGG GTAGWITAGT LAAHLNSNSQ ATVTVTLVES PTTPTIGVGE 
GTWPTMRNTL KKMGVRETDF IRECNATFKQ GAKFCGWTTG EQNDGYYHPL VLPQGYADIN
LAPHWLASNK ATPFAQSVCV QAALCETGLA PKLISTAEYA AIANYAYHLD AGAFTRFLTK
HCTQNLHVQH VLADVTSVQA KENGDIESVN TLQAGNIYGD LFIDCTGFEA LLIGKHFKVP
FISCKHTLFI DTALAVHLPY TDDQSNIASH TISTAQTSGW IWDIGLQNRR GIGHVYSSAH
TTDAQAEKAL RQYIAKLSPN TQDLTVRKIP IEPGHRQTFW QNNCVAIGLS AGFLEPLEAS
ALVLIEMSAT MLAEQLPTTR ASMAIIAKRF NQTNTYRWQR IIDFLKLHYT LSKRTDSDFW
IDNRAPNTNP DSLKELLELW KYHYPWHSDF DRAAEVFPSA SYQYLLYGMQ YPTQSSHLGM
SDKNIALANK LFTQNKALTQ KLLSSLPSNR ELINKIKTYG LAQI