Gene Sala_0437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0437 
Symbol 
ID4082984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp444934 
End bp446769 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content64% 
IMG OID638008794 
Productputative methylase/helicase 
Protein accessionYP_615491 
Protein GI103485930 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAACC GAAGGAAACC TGCGTCTCTC GCCGCCACGC TCGACATAAA CTGCGACTGC 
AAAGAATATG TGATCGACTA TCTCGCCAAG AGCTTTCCTG TTCGTCTGAT GGCGGTGTTC
ACCGACGAAA ACGGCAATCC GCGCTCCGAG CCGATGACTG ACGAGGATGG CGCACCTGTG
CTGTGCCGCT CCGCGCTGGC CGCGCGCGAC CGGATGATCG AACAGCTCTG CGCCCTGCCG
CCGATTGCTA CTGCGCTCGA TGCCATCATC GAACGGTTCG GCGTTGACCA GGTGGCGGAA
GTCACCGGTC GCACACGTCG GCTCATCGTC GGCCGCGACG GTCGCCAGAA ACTCCAATCC
CGCTCGCCGC GCGCCAATGT GGCCGAAACC CAGGCCTTTA TGGACGGCGC GAAGCGCATC
CTGGTGTTCT CCGATGCCGG GGGAACGGGG CGCAGCTACC ATGCTGATCT TGCCGCCAGG
AACCAGGCCC GCCGGGTCCA CTTTCTGCTC GAGCCGGGCT GGCGAGCTGA CGCCGCGATC
CAGGGGCTTG GCCGGACCAA TCGCACCAAT CAGGCATCAG CCCCGCTGTT CCGGCCCGTG
ACCACCGATG TGCGCGGCGA GCGGCGGTTC ATTTCGACGA TCGCCCGCCG CCTCGACAGC
TTGGGCGCTC TGACGCGCGG GCAGCGCCAG ACCGGTGGCC AGAACCTCTT CGATCCGGCC
GACAATCTCG AAAGCATCTA CGCCAAGGAG GCGCTCCACC GCTGGTTCGG CCTCTTGTTC
ACAGGCAAGC TCGAAGCCGT CAGCCTCGGG CGCTTTCAGG AGCTGACAGG TCTCCGGATC
GAAGCGCCTG ATGGTTCGAT GGTCGATGAC CTGCCGTCGA TCCAGCGGTG GCTCAACCGC
ATTCTTGCGC TTCCCATCGC CCTGCAGAAC GCGATCTTTG ACGAGTTCAT GGGGCTGGTC
GAAGCGCGCA TCGATGCCGC CCGGCAGGCT GGCACCCTCG ATCTCGGCTT GGAGACCATC
GCGGTTGAGG ACTTCACGGT CCTGTCGGAC ACGCTGCTGC GCACAGATCC AGCATCGGGC
GCGACGACCC ATCTCCTCGA ACTGGAAATC GCCAGGGCCC TGAAGCCGCT CACGCTGACG
CGGCTCGAGG AGATTCACGG CATCACCGGG CAGCGGCAAC GCCCGGTTCG GAACGCCCGC
TCAGGTCGAG TCGCCTTACT GGTGCCCGCC CGAAGTATCC TTGCCGATGA CGGTACGCGC
GTGGCCCGCT TCGAATTGCT TCGCCCGATG AAGCACAGCC ACATCACCGA GGATCAGCTC
GCTGAGAGCA GCTGGGAGGG GATCTCCGTC GACGTTTTCC GCGAGGCCTG GGTCGCCGAG
GTGGAAGAGG CCAGGACCAG CCACAAGCGC GAGCGCCTCT ATCTCGCGAC GGGCCTACTG
CTTCCGGTCT GGGACAAGCT CCCTTCGGAT TTCGTCAGGG TTAGTCGCAT CTCGGCGGCG
GATGGCCGTT CGCTCCTTGG CCGCGAGGTT CCCGCCCATT GTGTGCCCGA ACTGTGCCGA
GCGCTGGGTC TGGAACGCGA GCAAACGCTT TCCGCCGACG ACATTGTCCA GACCGTCCTG
GCAACGGGGA GGGCCATGGA GTTCACGGGC CGCGAGCTGC TCATGGTCAA GCGCAGCCTG
GTCAATGGGT CACAGCGACT TGAGCTTACG GGATGGAGTG CTGCTCGGCT CGACTGGTAC
AAGGCCCAAG GCTGCTTTAC CGAGATCATC CGCTATCAGA CCCGGCTCTT CGTACCGATC
GAGGGCGCGG CGAGTGTGAT TGCCAGACTG GCATGA
 
Protein sequence
MENRRKPASL AATLDINCDC KEYVIDYLAK SFPVRLMAVF TDENGNPRSE PMTDEDGAPV 
LCRSALAARD RMIEQLCALP PIATALDAII ERFGVDQVAE VTGRTRRLIV GRDGRQKLQS
RSPRANVAET QAFMDGAKRI LVFSDAGGTG RSYHADLAAR NQARRVHFLL EPGWRADAAI
QGLGRTNRTN QASAPLFRPV TTDVRGERRF ISTIARRLDS LGALTRGQRQ TGGQNLFDPA
DNLESIYAKE ALHRWFGLLF TGKLEAVSLG RFQELTGLRI EAPDGSMVDD LPSIQRWLNR
ILALPIALQN AIFDEFMGLV EARIDAARQA GTLDLGLETI AVEDFTVLSD TLLRTDPASG
ATTHLLELEI ARALKPLTLT RLEEIHGITG QRQRPVRNAR SGRVALLVPA RSILADDGTR
VARFELLRPM KHSHITEDQL AESSWEGISV DVFREAWVAE VEEARTSHKR ERLYLATGLL
LPVWDKLPSD FVRVSRISAA DGRSLLGREV PAHCVPELCR ALGLEREQTL SADDIVQTVL
ATGRAMEFTG RELLMVKRSL VNGSQRLELT GWSAARLDWY KAQGCFTEII RYQTRLFVPI
EGAASVIARL A