Gene Sala_3037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_3037 
Symbol 
ID4083045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp3183888 
End bp3185327 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content68% 
IMG OID638011423 
Productcarboxylesterase, type B 
Protein accessionYP_618074 
Protein GI103488513 
COG category[I] Lipid transport and metabolism 
COG ID[COG2272] Carboxylesterase type B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.505408 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.604257 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGCG CTGATCTGAC GCGCCGCGCG CTGCTCGCGG CGGGCGCGGT CGCCGCATTG 
GCGCCGCCTA TTTTTGCGCA AGAAACGCGC GACGTCCGCG TTTTCAAGGG CATCCGCTAC
GCGGCGGCGC GGCGTTTCGA GGCGCCCGTC GCGCTGCCTG GCAACCGGAC CGCGCTGGGC
GACTTTGGCC CCGCCTGCCC GCAGCGCGGC GATCGCTATA CACCGCAGTC CGAAGACTGC
CTCTTCCTCA ACGTCTGGAC GCCCGCGGTG GCCGACGGTG CGAAGCGCCC GGTGATGGTC
TATTTCCACG GCGGCGCCTA TTCGACCGGC AGCGTCACCG ACCCGATCAA CGATGGCGCG
GCGCTCGCGG CGCGCGGCGA TGTCGTGGTG GTGACGGTCA ACCACCGGCT CAACGCGCTC
GGCTATCTCT ATCTCGCGCG GCTCGATCCG CGCTTTCCCG ACAGCGGCAA TGCGGGGCAG
CTCGACCTGA TCGCCGCGCT CCAATGGGTG CAGCGCAATA TCGCAGCCTT CGGCGGCGAT
CCCGCCAATG TCACTGTGTT CGGCCAGTCG GGCGGCGGCG CGAAGATCGC GACGCTGATG
GCGATGCCCG CCGCCAAGGG GCTTTTCCAC AAGGCGATCA CCATGAGCGG GCAGCAGGTT
ACCGTTTCGG GCCCGCTCAA CGCCACGAAG CGCGCCGAAG CCTTCCTCGC CCAGCTCGGC
AAGGGTGTCG ATCCCGCGAC CGCGCCCATC GGGCAACTGA TCGCCGCGCT GGAGGCGATC
GACCCTATCC TCGGCGGCTC CGTCTACATG GGCCCCGTGC TCGACATGAC GCATCTGACG
CGCCACCCCT TCTGGCCCGA CGCCGCGCCG CAGTCGCTCG CTATTCCGAT GATGCTCGGC
AACACGGTGA TGGAAACGCG CGCATTCTAC GCGCCCGATG GCAAGCAGCT TGCGGGGTTG
AACTTTGACA ATCTTGCGGC GCGCATCGCG CCCGAAATCA AGGTCGACGC GCATCCCGAA
TGGGTCGTGG GCCAGTTTCG CGCGCACTAT CCCAAGGCTG CCCCGATCGA GCTGTTCCAC
CGCATCGTCA CCGCGGGGCG GAGCTGGCGC GGGCAGGTCG AGGAGGCCGA GGCGCGTGCG
CGTGCCGGGG CACCGGCGCT CGTCTACCAG CTCGATTTCG AGGACGCCAA ACACACCGAC
GACATCGGTT TCGCCTTCGG CACCATCCCC GATCCAAGCC CGGCGCAGCA GGCGATGAGC
GACCGGATGA TGGATGCCTT CGTCCGTTTC GCGCGCACGG GCGATCCGGG CTGGCCCGCC
TACGACCTCG CCACGCGGCA GACGATGATC TTCGATCGCC TGAGCCGTGT CGAGAGCGAT
CCGCGCCGGT GGGAGCGCGA GCTGTTCGCG CGCGTGCCCT ATATCCAGCC GGGGAGTTAA
 
Protein sequence
MSGADLTRRA LLAAGAVAAL APPIFAQETR DVRVFKGIRY AAARRFEAPV ALPGNRTALG 
DFGPACPQRG DRYTPQSEDC LFLNVWTPAV ADGAKRPVMV YFHGGAYSTG SVTDPINDGA
ALAARGDVVV VTVNHRLNAL GYLYLARLDP RFPDSGNAGQ LDLIAALQWV QRNIAAFGGD
PANVTVFGQS GGGAKIATLM AMPAAKGLFH KAITMSGQQV TVSGPLNATK RAEAFLAQLG
KGVDPATAPI GQLIAALEAI DPILGGSVYM GPVLDMTHLT RHPFWPDAAP QSLAIPMMLG
NTVMETRAFY APDGKQLAGL NFDNLAARIA PEIKVDAHPE WVVGQFRAHY PKAAPIELFH
RIVTAGRSWR GQVEEAEARA RAGAPALVYQ LDFEDAKHTD DIGFAFGTIP DPSPAQQAMS
DRMMDAFVRF ARTGDPGWPA YDLATRQTMI FDRLSRVESD PRRWERELFA RVPYIQPGS