Gene Sala_0740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0740 
Symbol 
ID4081150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp745041 
End bp746558 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content60% 
IMG OID638009098 
Producttryptophan halogenase 
Protein accessionYP_615793 
Protein GI103486232 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.726405 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGACC ACGCCATTCG CAAAATCGTC ATAGTCGGCG GCGGAACCGC GGGATGGATG 
GCGGCCTCGG CATTTTCAGC TTTGCTGGAG CGATCGATCG AAATCATCCT GGTCGAATCC
GACGAGATCG GCACGGTCGG GGTCGGCGAG GCGACCATTC CGCCGCTGAT TGCATTCAAC
TCCATGCTCG GTATCAACGA GGATGAATTT CTCTCCGCGA CGCGCGATAC GTTCAAGCTC
GGTATTGAGT TCGTCGACTG GGGCGCGGTC GGAGAACGCT ATTTCCATCC GTTCGGCCCC
CATGGCCAGG ATTTTCGCGG TGTTGCATTC CACCAACTTT ACCTGCGCGA GACGGCGAGG
CAGCCCCTGC CCGACATTCG GCACTGGTCG ATGAGTGCGA CCGCGGCAGA GCTTGGTCGC
TTTGCCCGCC CCGGACCCAC TGCGCGCCCT CCCCTCTCGC AGCTGGCCTA CGCCTTCCAT
TTCGACGCCG GTCTCTATGC GCGATTTCTG CGCAGCTATG CGGAAAAGAA CGGGGTGATC
CGTATCGAGG GAAAGCTGGT TGACGCATCA CTGAATCCCG AAAGCGGGCA CGTCAGGTCG
GTCAAACTGG CCAATGGAAA CGATATCGCA GGCGATCTGT TCATCGATTG CTCCGGATTT
CGCGGCCTCC TGATCGAGGA GCAACTCGGT ACGGGCTATG AAGACTGGAG CCACTGGCTG
CCCTGCGATC GCGCCGTCGC CGTACCATGC GGCCTGGCAA GCCCTCCCGA GCCGTTTACC
CGCTCCACCG CGCGATCGGC CGGCTGGCAG TGGCGCATCC CCCTTCAGCA CCGCATGGGC
AATGGTCTCG TCTATTCGAG CGCACATCTG GAGCGCCGCG CAGCCGAGGA TCTGCTCGTC
GCCAACCTTG AAGGCAAAGG TCTTGCCGAT CCAAAACATC TGTCCTTCAC CGCCGGACGG
CGCCGCAAGG CCTGGAACGC CAATGTCGTT TCCCTGGGAT TGTCGAGCGG CTTTGTCGAA
CCGCTTGAAT CAACCAGCAT CCATTTCATT CAGAGCGGAA TCGCCAAGCT GCTTGCCCTG
TTTCCCGACC GGCGCTTCGA TCCCGTCGAA AGGAACGAAT ATAACCGTCA GATGGCCGAT
GTATTCGAAG ACGCCCGCGA CTTCATAATC CTGCATTACA AGGCGACCCG GCGTGACGAT
TCCGATTTCT GGAACGATTG CCGGACGATG GATGTCCCCG ATGGTCTGGC TGCAAAGTTC
GGGCTTTGGC AATCGAAAGG GCGCCTGTTT CGCGAAGGGA GAGAATTGTT CGGAACCGCC
AGTTGGGTTG CCGTGCTGTT GGGACAAGGC ATACGTCCCG CCGAAACCGA CCCGGCGGTG
AATGCGATCG ATCCCGACAT TGCACGCGAC ATGCTTGACA AAATGCGATC AAGCTATCGA
CAAATGGCCG AGCATATGCC GCCGCATTGG GATTTTATTT CGCGCGCTTG CCTTGCACCG
GAGGCGATCT CAGGCTGA
 
Protein sequence
MSDHAIRKIV IVGGGTAGWM AASAFSALLE RSIEIILVES DEIGTVGVGE ATIPPLIAFN 
SMLGINEDEF LSATRDTFKL GIEFVDWGAV GERYFHPFGP HGQDFRGVAF HQLYLRETAR
QPLPDIRHWS MSATAAELGR FARPGPTARP PLSQLAYAFH FDAGLYARFL RSYAEKNGVI
RIEGKLVDAS LNPESGHVRS VKLANGNDIA GDLFIDCSGF RGLLIEEQLG TGYEDWSHWL
PCDRAVAVPC GLASPPEPFT RSTARSAGWQ WRIPLQHRMG NGLVYSSAHL ERRAAEDLLV
ANLEGKGLAD PKHLSFTAGR RRKAWNANVV SLGLSSGFVE PLESTSIHFI QSGIAKLLAL
FPDRRFDPVE RNEYNRQMAD VFEDARDFII LHYKATRRDD SDFWNDCRTM DVPDGLAAKF
GLWQSKGRLF REGRELFGTA SWVAVLLGQG IRPAETDPAV NAIDPDIARD MLDKMRSSYR
QMAEHMPPHW DFISRACLAP EAISG