Gene Sala_0498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0498 
Symbol 
ID4081388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp515862 
End bp517142 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content67% 
IMG OID638008856 
Producthypothetical protein 
Protein accessionYP_615552 
Protein GI103485991 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACGCA AACTTCTCGT TTTCGCCGTC GCGGCGACTC TTGCTCAGCC AGCCTTTGCT 
CAACCGTTCT GTCAAGCGGG CGCTTATCGC GGCGCCGACG GCGACTTTGT CGCGCTGGCA
AAATCGACGG TCAATCCGGC GGGCGGGCTA CGCTATCTGT TCCGTGACGG GCGCCGCGGA
TCGACGGGCG ATGCCGACGC GCCGCTCGAT TGCGCTCCCG ACGGCGTGCG CATTGGCAAG
GGCGCCGAGG CGGAGACGTG GGCGCGCATC GCCCTTCGCG AAACCCCGGC GACCTTCGAC
AGCGCGGGAT CCAAATTGTC GGGCATGTTG ATCGAACCGC CGGGCAGCGA TCCGCAGCGG
CCGCTGGTGG TGATGGTCCA TGGCTCCGAA CGCACGTCGC CGATCGGCGG CATCTATGGC
TATGCGATGG CGGCGCAAGG TCTGTCGGTG TTCGTCTATG ATAAACGCGG CACCGGCGCA
TCGGAGGGCG AATATACGCA GAACTTCGAA TTGCTCGCGC GCGACGCCGC TGCGGCACTC
GGGCAGGCGC GCGCGATGCT GCCCGGACAT GCCGGGAGGG CGGGCTTTTT CGGAGGCAGC
CAGGGCGGGT GGGTCGCTCC GCTCGCCGCG ACGCTGACCC CCGCCGATTT TGTCGCGGTC
GGTTTCGGCC TCGTCGCCTC GCCGATCGAG GAGGACCGCG AGCAGATGAT CTCCGAAGTG
CGCGCGGCGG GGCTGGGCGC CGATGCCGAA GCGCTCGTTA ACCGCCTGTC GGCAGCAACG
GCCAGGCTGC TGCTGTCGAA CTTCAAGGAT GGTTATGTCG AACTCGACGC CGCGCGCGCC
GCGCTCGCGG ACAAGCCGTG GGCCACGCAG ATACGGGGCG AGCACAGCGG GATGATGTTG
CGGATGTCCA ATGCTGAGCT GCGCCGGATC GGGCGGGCGC GCTTCGACAA TCTGGAACTG
ATCTGGGATT ATGACGCGGT GGCGGCGCTG CGCCGGCTTC GCACGCCGCT CTTGTGGGTG
CTCGCGGGCG AGGATCGGGA AGCCCCGATC GAAACGACAC GCGCCGCGCT GGCCGAATTG
CGGGCGGCAG GGCAACCGAT CGACGTCTAT CTGTTTCCCG GCACCGACCA TGGCATGATC
GAGTTCACGA CCGGCCCCGA CGGCAAGCGG TCCTACACGC GCATCACCGA CGGCTATCTG
AAGCTGCTTG GCGACTGGAT GAAGGGCGAG GCGCGCGGCA CTTACGGCCG CGCCGAGACG
CTGACGCCTA CCCCGCGCTG A
 
Protein sequence
MIRKLLVFAV AATLAQPAFA QPFCQAGAYR GADGDFVALA KSTVNPAGGL RYLFRDGRRG 
STGDADAPLD CAPDGVRIGK GAEAETWARI ALRETPATFD SAGSKLSGML IEPPGSDPQR
PLVVMVHGSE RTSPIGGIYG YAMAAQGLSV FVYDKRGTGA SEGEYTQNFE LLARDAAAAL
GQARAMLPGH AGRAGFFGGS QGGWVAPLAA TLTPADFVAV GFGLVASPIE EDREQMISEV
RAAGLGADAE ALVNRLSAAT ARLLLSNFKD GYVELDAARA ALADKPWATQ IRGEHSGMML
RMSNAELRRI GRARFDNLEL IWDYDAVAAL RRLRTPLLWV LAGEDREAPI ETTRAALAEL
RAAGQPIDVY LFPGTDHGMI EFTTGPDGKR SYTRITDGYL KLLGDWMKGE ARGTYGRAET
LTPTPR