Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_1262 |
Symbol | |
ID | 3965244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | - |
Start bp | 1614478 |
End bp | 1616052 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637920336 |
Product | tryptophan halogenase, putative |
Protein accession | YP_526736 |
Protein GI | 90020909 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.69859 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000259242 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCATGT CTAAGTCCAG TACATCTAAG CCAGTTACAT CAATTGTTAT CGTCGGTGGT GGTACAGCAG GTTGGATTAC TGCAGGCACC CTCGCCGCCC ACCTAAACTC AAACAGCCAA GCGACAGTTA CCGTTACACT CGTCGAGTCG CCAACTACAC CCACAATAGG GGTAGGCGAA GGCACGTGGC CCACTATGCG TAACACGCTT AAAAAAATGG GTGTGCGCGA AACCGATTTT ATACGCGAAT GCAATGCAAC GTTTAAACAA GGGGCTAAAT TCTGTGGCTG GACAACCGGC GAACAAAATG ATGGGTACTA TCATCCGCTT GTTTTGCCCC AAGGCTATGC CGATATAAAT CTTGCTCCCC ATTGGTTAGC TAGCAATAAA GCCACGCCAT TTGCGCAATC GGTGTGCGTA CAAGCAGCAC TGTGCGAAAC GGGTCTCGCC CCTAAATTAA TAAGCACTGC AGAGTATGCG GCCATCGCAA ATTATGCTTA CCATTTAGAT GCGGGCGCAT TTACCCGATT TTTAACAAAG CACTGCACAC AAAATTTACA CGTACAGCAC GTATTGGCAG ATGTAACTAG TGTGCAGGCA AAAGAAAATG GCGATATAGA ATCCGTTAAC ACTTTGCAGG CAGGTAATAT TTATGGCGAC CTATTTATAG ATTGCACAGG TTTCGAAGCG TTATTAATAG GCAAGCATTT CAAAGTACCG TTTATTTCGT GCAAACATAC ACTGTTTATT GATACCGCTC TCGCTGTGCA CTTGCCTTAC ACAGACGATC AATCAAACAT TGCTTCGCAC ACTATCTCCA CCGCGCAAAC GTCTGGCTGG ATATGGGATA TAGGCCTACA AAACCGCAGA GGCATAGGCC ATGTATACTC CAGCGCCCAT ACAACAGATG CCCAAGCCGA AAAAGCGCTT AGGCAATATA TCGCCAAACT ATCGCCCAAC ACCCAAGACT TAACTGTACG AAAAATACCT ATCGAGCCAG GCCACAGGCA AACATTTTGG CAAAATAATT GTGTGGCAAT TGGCCTGTCT GCAGGCTTTT TAGAGCCGCT AGAAGCATCA GCATTAGTGC TTATCGAAAT GTCTGCTACT ATGCTTGCCG AACAGTTACC CACCACACGC GCAAGCATGG CTATTATTGC TAAACGATTT AATCAAACTA ATACGTATCG CTGGCAACGC ATTATCGACT TTTTAAAATT ACACTACACC TTAAGCAAAC GCACAGACAG CGACTTTTGG ATAGATAACC GCGCCCCTAA CACCAACCCA GATAGCCTAA AAGAGCTGCT CGAGTTGTGG AAATACCATT ACCCTTGGCA CAGCGATTTC GACCGCGCGG CAGAAGTATT TCCATCTGCT AGCTATCAAT ATTTATTGTA TGGCATGCAG TACCCCACGC AGTCTAGCCA TTTGGGGATG TCTGATAAAA ACATTGCATT AGCAAATAAA CTATTCACCC AAAACAAAGC ATTAACCCAA AAGCTATTAA GCTCCCTTCC TAGCAACCGA GAACTAATAA ATAAAATAAA AACGTACGGA CTAGCGCAAA TTTAA
|
Protein sequence | MSMSKSSTSK PVTSIVIVGG GTAGWITAGT LAAHLNSNSQ ATVTVTLVES PTTPTIGVGE GTWPTMRNTL KKMGVRETDF IRECNATFKQ GAKFCGWTTG EQNDGYYHPL VLPQGYADIN LAPHWLASNK ATPFAQSVCV QAALCETGLA PKLISTAEYA AIANYAYHLD AGAFTRFLTK HCTQNLHVQH VLADVTSVQA KENGDIESVN TLQAGNIYGD LFIDCTGFEA LLIGKHFKVP FISCKHTLFI DTALAVHLPY TDDQSNIASH TISTAQTSGW IWDIGLQNRR GIGHVYSSAH TTDAQAEKAL RQYIAKLSPN TQDLTVRKIP IEPGHRQTFW QNNCVAIGLS AGFLEPLEAS ALVLIEMSAT MLAEQLPTTR ASMAIIAKRF NQTNTYRWQR IIDFLKLHYT LSKRTDSDFW IDNRAPNTNP DSLKELLELW KYHYPWHSDF DRAAEVFPSA SYQYLLYGMQ YPTQSSHLGM SDKNIALANK LFTQNKALTQ KLLSSLPSNR ELINKIKTYG LAQI
|
| |