Gene Sala_0643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0643 
Symbol 
ID4082733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp656711 
End bp657910 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content67% 
IMG OID638009002 
Productprotein of unknown function DUF900, hydrolase-like protein 
Protein accessionYP_615697 
Protein GI103486136 
COG category[S] Function unknown 
COG ID[COG4782] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.399465 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.101765 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCGTG CCCGCCAGCT GGCGGCGGCG TGCGGCGCCG CGCTCGCGCT GTCGGGGTGC 
AGCATTGCGG CGGTCGACTA TGGCAAGATC CGCCACGCCG AATATGTCGC CGACCAGCGC
TGCGATGCGC AGCCGGGCGC GGTCGTCGAT GGCGCGGCGC TGCCCCACTT CTTTGTGACG
AGCCGCCTGC CCGACTGCCG CGCGAGCGAG ATCGAACTGC TGCACCACCG CGGCGACCAT
GTACGGTACG GCCGCTTCGA CGCGCCGCGC GATGTTGTGG TGGCAAAGAA AAAACGCTTC
CTGCCGCCGC TCGCCTTTCA GGCGGCGCCC GACTGGTGGC GCGCGCTGCA GGCCGAAACC
GACCGCAAGC AGGGCCGCGT GCTGCTCTAT GTCCACGGCT ATCGCGAGAG TTTCGCAACG
ACGTCGAAGG ATGCGGCGCA GATCGCGCGG ATGACAGGGT TCGACGGGCC GATCATCGAA
TATAGCTGGC CGTCGCAGGG CAAGCTCTTC AGTTATGTCG TCGACGAAAC GAACATGTAT
CACGACGTCC GCAACTTCCG CGATTTCCTG AAAACCCTCG CCGAACAGGG CTGGGTCAGG
GAGATCGTCA TCGTCTCGCA CTCGCTGGGC GCACGGCTGG TGATCCCCGC GGTCGCCTAT
GTCGATCGCG CGTCGAGCAA CGCCGACAGC AGCAATATCT CGAACATCAT CCTCGCCTCA
CCCGACTTCG ACCGCGAGAC GTTCGAGCGT GACATCGAAG AGGAAGTGCT GTCGGCGCGG
CGCGTCGCAA ACGACCGGCG CATCACCATC TATGCGTCGC GCGCGGACAG GGCGCTCGCG
GCGTCGCGCG CGATCCACGG CTATCCGCGA TTGGGCTCGC CCTATTGCTT CAATCCGTTC
GAGGCGGCGG AACTGAAGGC CAGGGGGCTT CCCGAACGCT GCTATCCCGC GCCGCGCGCC
GGGCTGACGG TGATCGACAC GACCGACGTG TCGCGCGGAT CGACGGGGCA CAGCAATATC
CTGCTGAGCG CGCCCGCCTG CCGCGACTTC ATCGACGTCG TGGCGGGCAA GCGCACCCGG
CCCGAGCGCG TCGCGACCCC GTGGACGCAT GTGTTCCGGC TGGAACCCGA CCCGGCACTG
ACCAAGGCGG AGCACGACGC AATATGTCGC CGCACCGCCG AAGCGGGCGA CGACCGCTGA
 
Protein sequence
MMRARQLAAA CGAALALSGC SIAAVDYGKI RHAEYVADQR CDAQPGAVVD GAALPHFFVT 
SRLPDCRASE IELLHHRGDH VRYGRFDAPR DVVVAKKKRF LPPLAFQAAP DWWRALQAET
DRKQGRVLLY VHGYRESFAT TSKDAAQIAR MTGFDGPIIE YSWPSQGKLF SYVVDETNMY
HDVRNFRDFL KTLAEQGWVR EIVIVSHSLG ARLVIPAVAY VDRASSNADS SNISNIILAS
PDFDRETFER DIEEEVLSAR RVANDRRITI YASRADRALA ASRAIHGYPR LGSPYCFNPF
EAAELKARGL PERCYPAPRA GLTVIDTTDV SRGSTGHSNI LLSAPACRDF IDVVAGKRTR
PERVATPWTH VFRLEPDPAL TKAEHDAICR RTAEAGDDR