Gene Sde_2828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2828 
Symbol 
ID3968231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3567099 
End bp3568595 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content44% 
IMG OID637921925 
Producttryptophan halogenase 
Protein accessionYP_528297 
Protein GI90022470 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000300055 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0040734 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACAAA ATAAAATTAA AAAAGTGGTC ATCGCAGGTG GCGGCACAGC AGGTTGGATG 
GCCGCAGCGG CCATATCTAA GTTACTCGGT AAAAACCTCG ATATTAGTTT AGTGGAATCC
GATCAGATTG CCACGGTAGG GGTGGGTGAG GCTACCATTC CCGCGTTACA AACATTCCAT
AAGTTGTTGG GCATTAAAGA ACCGCACTTT ATGGCTGCCA CCCAAGCAAC CTTCAAACTG
GGTATCGAAT TCAAACGTTG GAAGGATACC CAGTCGAGCT ACATTCATTC GTTTGGCGCA
GTAGGTAAAG ATTGTTGGGC CGCAGGCTTC CAACATTTTT GGCGGCGGGG AGTAGACTTA
GGCGTTAACC ACGATTACGG CGATTATTGC TTAGAGCTAG AAGCCGCCAA ACAAAATAAA
TTTGCCCACC CCAATAGCGG CAATATTTTT TACGCCTACC ATTTAGATGC AACCCTGTAC
GCTAAATATC TGCGTAAATT CAGCGAAACC TTTGGCGTAA AACGTATAAA GGGTAAAATT
GTTGAAGTTA AAACACACTT ACATAACGAT TATATTAAAT CGTTAGTGTT AGAAAGCGGC
CAAGAAGTAG AAGGCGATTT ATTTATAGAT TGCACAGGTT TTATTGGCCT ATTAATAGAG
CAAACATTGC AAACCGGTTA CGAAGACTGG TCCCACTGGT TGCCCTGCGA TAGCGCAGTG
GCAGTGCAAA CAGCAGCAAC GCAAGCGCCC ATACCTTATA CCCGCTCTAC TGCTCACGCT
GCCGGCTGGC AGTGGCGTAT ACCGCTGCAG CATAGAGTGG GAAACGGTTT AGTGTATTGC
AGTAAACATA TTAGCGATGA AGAAGCTAAG CAAACACTAT TAAATAATAT TGAAGGCGAA
CTGCTAACAG AACCAAGAGT AATTAAATAT CGCACGGGCC AGCGTTTAAA ACACTGGAAT
AAAAACTGCG TTGCATTAGG GTTGGCAAGC GGCTTTATCG AGCCTCTTGA ATCTACAAGT
ATTCATTTAA TACAGCGCGG CATTTTGCGA CTGCTTTTTT TGTTCCCTTC GAATGGAATA
AACGATACCG ATGTGGCTGA ATACAATCAG CAAACTAAAG CAGAGATTGA GCATATACGC
GATTTTATTA TTTTGCATTA TCACGTAAAC CAGCGTAACG ATTCGCGGTT TTGGCGCTAC
TGCGCAAATA TGTCTATACC CGAAACACTC GCGCACCGTA TCAGTCTATT TAAAAAATCT
TCTCGATTTT ACCCAAAAGA CGATGAGCTA TTTGGCGAGT ATTCCTGGGT GCAAGTCATG
TTGGGGCAGG GCATAGAGCC AGAGGGGTAT CATCCTATTG TGGATATGAT GTCTGAGGAT
GAGCTTCATC ACTTTTTAAA AAATATTCGG AGTTCGGTTC AGCAGGCACT TGCCAGCATG
CCGCAGCACA CCGATTACAT TCAGCAGTAT TGCAAAGCGC CACCCATACC TATCTAG
 
Protein sequence
MKQNKIKKVV IAGGGTAGWM AAAAISKLLG KNLDISLVES DQIATVGVGE ATIPALQTFH 
KLLGIKEPHF MAATQATFKL GIEFKRWKDT QSSYIHSFGA VGKDCWAAGF QHFWRRGVDL
GVNHDYGDYC LELEAAKQNK FAHPNSGNIF YAYHLDATLY AKYLRKFSET FGVKRIKGKI
VEVKTHLHND YIKSLVLESG QEVEGDLFID CTGFIGLLIE QTLQTGYEDW SHWLPCDSAV
AVQTAATQAP IPYTRSTAHA AGWQWRIPLQ HRVGNGLVYC SKHISDEEAK QTLLNNIEGE
LLTEPRVIKY RTGQRLKHWN KNCVALGLAS GFIEPLESTS IHLIQRGILR LLFLFPSNGI
NDTDVAEYNQ QTKAEIEHIR DFIILHYHVN QRNDSRFWRY CANMSIPETL AHRISLFKKS
SRFYPKDDEL FGEYSWVQVM LGQGIEPEGY HPIVDMMSED ELHHFLKNIR SSVQQALASM
PQHTDYIQQY CKAPPIPI