Gene Sama_0939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_0939 
Symbol 
ID4603191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp1135041 
End bp1136564 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content52% 
IMG OID639780274 
Producttryptophan halogenase 
Protein accessionYP_926816 
Protein GI119774076 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.220698 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAA CGAAAATTGC TATCCTGGGT GGTGGTACTG CCGGTTGGTT GGCGGCCAAC 
CATTTGGGGG CGGAGCTGTG TGCCGATAAG GAGGTTGAAA TAACCCTTAT CGAATCGCCG
GAGATCCCAA CCATTGGTGT GGGGGAGGGC ACCGTGCCCT ATATCATGAA AGGCCTCAAA
CGCTTTGGCA TATCTGAGTC CGAGCTGCTG GCAAACTGTG ATACCACCTT CAAGCAGGGC
ATTAAGTTCG TCAATTGGCT CGACCCTGAG CGCCACGGCG ATAACCACTA TTACCATCCT
TTCGACTCAC CTTATCCCGG TGGCATGGAC ATCAGCCATT ACTGGCTGAC CCAAAAAGAT
AAGCGCCCCT TTGATGATGT GGGTATACAG GCCCGGATTT GTGAAAAAAA CCTGGCTCCC
AAGCGTATCA GTGCTCCCGA ATATCAGGGT GAACTGGCTT ACGCCTATCA TTTCAATGCG
GTGAAGTTTG CTGCCTTGCT GGCTAAGAAC GCCCGGGAGC GCTTTGGGGT CAAGTATCTG
AGTGCCACAG TCGCAGGCGC GACGCTGAAT GACGACGGCG CCATTGCCAG TCTGAATACC
AAGGAAGTCG GTAGCTTGGC GTTCGATTTT TATGTCGATT GCAGCGGTTT TCACTCGGTA
CTGTTGGACA AGGTGCTTAA GGTGCCCTTT GTGGATAAAG GCAAAGAGCT GTTGACCGAC
TCAGTGATAG TACAGCAGGT TCCCTTGAAG AGCGGTGAGG CGCTTTCGCC CTATACCAAG
GCGACTGCGC ATAAGGCGGG CTGGATTTGG GATATCCCTC TAACCACCCG TCGGGGTACC
GGTTTCGTGT ATTGCAGCCA ATACATGAGC GATGAAGAGG CCGTTTCCAC CTTTGCCCAA
TACCTTGGCA TGGACGTGAG CGAGATATCG CCAAGAAAGA TCCCGATGAA GATTGGTTAT
CGGGAGAAGT TTTGGGCCAA AAATTGCGCC ACCCTGGGGC TTGCTCAGGG CTTTGTGGAA
CCACTGGAAG CCACCTCGAT ACTGGTAACG GACTTTTCTG CAGAACTGCT GGCCAAAAAC
TTCCCCAGGG AAACCTCTGA TATTGAGGTA CTTAGCCCTT ACTACAATGA TGTCATTACT
TATGTATGGG AAAGGGTCAT CGATTTTATC AAGCTGCATT ACTGTCTCTC AGACAGGGAA
GATACCGGCT TTTGGGCAGC CAATCGCGAT TCCGACACCT GGTCCGAGAC CCTAAAATCC
CGACTGGCAA AGTTTGCACT CAGGCCTCCT CAGCAATCGG ACTTTTTAAG CCGTTTTGAT
TTATTCGATG ATAAAAACTT CCTGTATGTG CTCTATGGAA TGGGCTTTTC AAGCCGTATC
AAGGCGCTCG ACCCGAGGGA GATAGAGCAG AGCAGGCAGC TGTTGGAGAG TAACGACAAA
TTGGCTGACA GGGCGGAGGA GTTGTTGATG GAGCACGGAA AGTGGCTCGC AGGTCTGAAG
GCGGCCATGG CACGGGCATC ATAG
 
Protein sequence
MKITKIAILG GGTAGWLAAN HLGAELCADK EVEITLIESP EIPTIGVGEG TVPYIMKGLK 
RFGISESELL ANCDTTFKQG IKFVNWLDPE RHGDNHYYHP FDSPYPGGMD ISHYWLTQKD
KRPFDDVGIQ ARICEKNLAP KRISAPEYQG ELAYAYHFNA VKFAALLAKN ARERFGVKYL
SATVAGATLN DDGAIASLNT KEVGSLAFDF YVDCSGFHSV LLDKVLKVPF VDKGKELLTD
SVIVQQVPLK SGEALSPYTK ATAHKAGWIW DIPLTTRRGT GFVYCSQYMS DEEAVSTFAQ
YLGMDVSEIS PRKIPMKIGY REKFWAKNCA TLGLAQGFVE PLEATSILVT DFSAELLAKN
FPRETSDIEV LSPYYNDVIT YVWERVIDFI KLHYCLSDRE DTGFWAANRD SDTWSETLKS
RLAKFALRPP QQSDFLSRFD LFDDKNFLYV LYGMGFSSRI KALDPREIEQ SRQLLESNDK
LADRAEELLM EHGKWLAGLK AAMARAS