Gene Sala_2007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2007 
Symbol 
ID4079944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2116639 
End bp2118801 
Gene Length2163 bp 
Protein Length720 aa 
Translation table11 
GC content69% 
IMG OID638010383 
Productpeptidase S9, prolyl oligopeptidase active site region 
Protein accessionYP_617051 
Protein GI103487490 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00565221 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCAAAC TGGCCTTGTT TGGAACCGCC CTTATCGCCC TCACCGCCCA GCCGCTCTGG 
GCGCAGCAGG AAGAGGCCCC GGCGCAGCAG CCGCCCGCAT CCGAAACCCC CGCCGCTACC
CCCACCCCCA TGCCCGCCGC CGCGGTGGGC AGCGCGCGCA CCGGCCCCGA ACGCCGCTTC
ACCGGCGCCG ACCTCTTCGA CCTCGCGATC GCCGCCGATC CGCAGATCAG CCCCGACGGC
CGCCACATCG CCTATGTCCG CCGCGCGAAC GACATCATGA CCGACCGCGC GGTCAGCTCG
ATCTGGCTGA TCGACACCGC GACCGGGCGC GAAACCCCGC TTGCGGGACA GGATGGTCCG
GCCTTTTCGC CGCGCTGGTC GCCCGACGGC TCGCGCCTCG CCTATGTATC GGCCGCGGGC
GGCAGCGCGC AGCTCTGGGT GCGCTGGATG GACGGCGGCG AAGCAGTGCG GCTGACCGGC
CTGCCGACCA GCCCGTCGAG CCTGACCTGG GCGCCCGACG GCCGTTCGAT CGCCTATACG
ATGCTGGTGA AGGACGAGGG CGCCAGGCTC GGCAGCGCGC CCGCGAACAA GCCGGAGGGC
GCCAAATGGG CCGAACCGCT CGACGTCCGC ACCCTCCTCA CCTACCGCGC CGACGGACAG
GGCTATGTCG AGCCGGGGTT CGAAAAAATC TTCCTCATCC CCGCGACCGG CGGCGCGCCG
CGCCAGTTGA CCTTCGGCCC GTATCACGAT GGCGGCCCCT TGAGCTGGTC GCGCGATGGT
CGCACGCTTT ATTTCAGCGC CAACCGCCAG GCCGAGTGGG AAACCGATCC GCTCGAAAGC
GAGATCCACG CGCTCGACGT CGCCAGCGGC GCGATCGCCA CGCTCACCGA CCGCAACGGC
CCCGACGCCA ATCCCCTGGT GTCGCCCGAC GGCCGGCTGA TCGCCTATCT GGGCTTCGAC
GACGCGCTGC GCGCCTATGA GCAGACCGAA CTCTATGTGA TGAACCGCGA CGGGTCGGGC
CGCCGCCGCA TCGCCGCCAA CTGGGATTAC AGCGTCGATG CCGTGCAGTG GGGCGCCGAC
AGCCGCAGCC TCTATGTCCA ATATGACGAT CATGGCGAGA CGAAGGTCGC GCGCGTCACC
CTCGACGGGT CGGTGCGCGA CGTGGCGAAG GGGCTGTCGG GCGGCGGGCT CGACCGGCCC
TATACCGGTG GCAGCTTCAC CGTCGCGGGC AACGGCGCGA TCGCCTTCAC CGGCGGCACC
GCCACCCGCC CCGCCGAGGT GCAGCTCGCG CGCGGCGGCG GCGAGGCGCG GATGCTGACC
GACCTCAACC GCACGTTGCG CGAGGTCAAA TCGCTGGCGC AGGTACGCAA GATCACCGTG
GCGTCGAGCC ACGACGGCCT GCCGATCGAG GGCTGGCTGA CCCTGCCGCC CGGCTATGTC
GAGGGACAGC GCGTGCCGCT GATCCTCGAA ATCCATGGCG GGCCCTTCAC CGCTTATGGC
CCGCATTTTT CGACCGACAA TCAGCTTTAT GCCGCTGCGG GCTATGCGGT GCTGTCGGCG
AACCCGCGCG GCTCGACCAG CTATGGCGAG GCCTTCGCGC AACAGATCGA CAAGGCCTAT
CCGGGCAATG ATTATTTCGA CCTCATCTCG ATCGTCGATC AGGCGATCGC GCTCGGCATC
GCCGACCCCG ACGCCTTGTT CGTCACCGGC GGGTCGGGCG GCGGCGTGCT CACCAGCTGG
ATCGTCGGCA AGACGAACCG CTTCAAGGCC GCGGCGACGC AGAAACCCGT CATCAACTGG
CAGACGCAGG CGCTGACCGC CGACGGCCCC GCCTTTTTCG GCCCCTATTG GCTCGGTGCG
CAGCCATGGG AAGACCCCGA ACGCTACTGG GCACGCTCGC CGCTGTCGCT CGTCGGCAAT
GTCGAAACCC CGACGCTCGT CGTCGTCGGC GGCGAGGATT ATCGCACCCC GGTCAGCGAA
TCCGAACAAT ATTACACCGC GCTTCGCCTG CGCGGCGTGC CCACCGCGCT CGTCAAGGTG
CCCGGCGCCA GCCACGGCGG CATCGCCGCG CGTCCCTCGC AATCGGCGGC CAAGGCTGCC
GCGATCCTCG CCTGGTTCGA CAAATACCGG AAAGGCTGGA CGCGGCCCGC GGCATCGGAT
TAA
 
Protein sequence
MRKLALFGTA LIALTAQPLW AQQEEAPAQQ PPASETPAAT PTPMPAAAVG SARTGPERRF 
TGADLFDLAI AADPQISPDG RHIAYVRRAN DIMTDRAVSS IWLIDTATGR ETPLAGQDGP
AFSPRWSPDG SRLAYVSAAG GSAQLWVRWM DGGEAVRLTG LPTSPSSLTW APDGRSIAYT
MLVKDEGARL GSAPANKPEG AKWAEPLDVR TLLTYRADGQ GYVEPGFEKI FLIPATGGAP
RQLTFGPYHD GGPLSWSRDG RTLYFSANRQ AEWETDPLES EIHALDVASG AIATLTDRNG
PDANPLVSPD GRLIAYLGFD DALRAYEQTE LYVMNRDGSG RRRIAANWDY SVDAVQWGAD
SRSLYVQYDD HGETKVARVT LDGSVRDVAK GLSGGGLDRP YTGGSFTVAG NGAIAFTGGT
ATRPAEVQLA RGGGEARMLT DLNRTLREVK SLAQVRKITV ASSHDGLPIE GWLTLPPGYV
EGQRVPLILE IHGGPFTAYG PHFSTDNQLY AAAGYAVLSA NPRGSTSYGE AFAQQIDKAY
PGNDYFDLIS IVDQAIALGI ADPDALFVTG GSGGGVLTSW IVGKTNRFKA AATQKPVINW
QTQALTADGP AFFGPYWLGA QPWEDPERYW ARSPLSLVGN VETPTLVVVG GEDYRTPVSE
SEQYYTALRL RGVPTALVKV PGASHGGIAA RPSQSAAKAA AILAWFDKYR KGWTRPAASD