Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssed_0315 |
Symbol | |
ID | 5609779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sediminis HAW-EB3 |
Kingdom | Bacteria |
Replicon accession | NC_009831 |
Strand | - |
Start bp | 366721 |
End bp | 367983 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640931147 |
Product | tryptophan halogenase |
Protein accession | YP_001472056 |
Protein GI | 157373456 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0000735026 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCAGCCT CTTCATCTAC GCCAGAGACA CTGACTGATA TTGACGTTGC AATCATAGGT GCCGGGCCAT CGGGTTGTAT CGCCGCAAGC CTCCTTCATC AGCAAGGCAA GCGGGTTATT GTCATAGAGA AGCTACACTT TCCGCGCTTT TCCATCGGTG AGAGTCTACT GCCTTGCTGT ATGCAATGGC TTGAAGAGGC AAATATGCTC GACGCGGTGA ACCAGGCCGG TTTTCAGTTC AAAAACGGCG CCGCTTTTCG TTATAAAGAG CAATACACAG ATTTCGATTT CAGCGATAAA TTCACACCGG GACCGGGGGC AACCTTCCAG GTAGAACGCG CCGACTTCGA TAAACTGCTG GCAGATACGG CTGCCGAACA GGGCGTTGAT ATTCGCTACG GTGAAACGGT TACAACCGTC GACCTGATCG GCAAACCAAG ACTGACGGTA ACCGATGCCA ACGGAGAGGC ATATCTTATC GAGGCCGAGT ATCTATTGGA TGCCAGTGGG TTTGGTCGTG TTCTGCCTAA GTTACTCGAG CTGGAAAAGC CATCGACGTT AGCCAATCGC AGTGCCATAT TTACCCACAT CCAAGATAAT ATCGGCAAAA AAGAGGTCGA TGACAGGCCC TTCGACCGTA ATAAAATATT GATCAGCGTT CACCCACAAA ACAGGGATAT CTGGTATTGG CTGATCCCAT TAAGCCCTGA CAGGTGCTCT TTAGGTGTCG TGGGTGAACC CCATCTCATG GGCAACTCAG ACCTTGACCT GGAAGTGATT TTGATGGATA TGGTCAATCA GGAACCTGGC CTGAAGACGC TATTGGCTGA TGCCGAAATT CTTCGGGAGT GTGCGGAACT TAAAGGCTAC TCCGCCAATG TCAGCACCTT GGCCACCGAC AAGTTTGCCC TCTTAGGCAA TGCCGGCGAA TTCCTGGATC CGGTCTTCTC TTCAGGCGTT ACTATCGCTA TGCAATCGGC CTCTATGGCG ACTAAATGTG TCGTGAGGCA ACTCAATGGT GAAAAAATTG ATTGGCAGAG TGAATATGCA AAGCCTTTGA TGCGGGGAGT CGATACCTTC CGCACCTATG TTGAGGCTTG GTATGATGGT CGCTTCCAGG ACGTCATCTT CTATGATGCC CCGGATAACA AGATAAAACA GATGGTCTGC TCAATCTTAG CCGGCTATGC ATGGGATGAG GCCAACCCCT TAGTCGCCGA ATCAGAAAAA AGGCTCAACC TTATTGTTGA GCTATGCCGC TAA
|
Protein sequence | MPASSSTPET LTDIDVAIIG AGPSGCIAAS LLHQQGKRVI VIEKLHFPRF SIGESLLPCC MQWLEEANML DAVNQAGFQF KNGAAFRYKE QYTDFDFSDK FTPGPGATFQ VERADFDKLL ADTAAEQGVD IRYGETVTTV DLIGKPRLTV TDANGEAYLI EAEYLLDASG FGRVLPKLLE LEKPSTLANR SAIFTHIQDN IGKKEVDDRP FDRNKILISV HPQNRDIWYW LIPLSPDRCS LGVVGEPHLM GNSDLDLEVI LMDMVNQEPG LKTLLADAEI LRECAELKGY SANVSTLATD KFALLGNAGE FLDPVFSSGV TIAMQSASMA TKCVVRQLNG EKIDWQSEYA KPLMRGVDTF RTYVEAWYDG RFQDVIFYDA PDNKIKQMVC SILAGYAWDE ANPLVAESEK RLNLIVELCR
|
| |