Gene Sala_2389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2389 
Symbol 
ID4080542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2521467 
End bp2523137 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content70% 
IMG OID638010769 
Productleucyl aminopeptidase 
Protein accessionYP_617431 
Protein GI103487870 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0738051 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGCC GATGTCGTTC GTCGAGCCCG ATTTGCGGAC GGGCTGGAGG CAGCTATGCA 
GGACGCAGGT CGTCAAACAA CGGGGATTAT CGAACGATGC GCATGAAAAC ACTGCTTCTT
TCCGCTTGCC TGTCGCTCGC CTTCACGCCC GGCGCGATGG CGCAAACCGT GATCGGGTCG
GGCGTTGTCC CGGCGAACGC CGCGAACAGC GCCGAACGCG CGATCGGCTT CGCATCGCGC
GCGCCGACGG GCGCCGCGCT CGTCATTGTG ATGACCGACG CCGCGCTGCC GCCGCTCGAT
GGCGTCGCGC TCTCCGCGCC CGAGCGGCAG GCAGTCGAGG CCGCGATCGC CGCCGCGAGC
TTCGACGGCA AGGCGGAATC GACGCTGTCG CTGCGCGGCA TCGGCGCGCA TCCGCGCATC
CTGCTCGTCG GCGCCGGGCC GGCGCCCTCG TCGCTCGCGC TCGCCGAAGC GGGCGGCAAG
GCGGCGCAGG AGATGAAGGG CGAGGCGCAT CCCGTGGCGA TCGCCGGCGC CTTTGGCGAC
ACCTCCGCCG CCGAGGTCGC TTATGGCTTC GCGCTCGGCC AATATCGTTT CGACCGCTAC
AAGACGGTCG ACCGCAAGAC GCCGCCCTCC GCCGCGGTCA CGCTCGTCGG CGCCAATCCC
TCGACCGCCG AGACCGCCTT TGCGACGCGC TGGCAGCCGC TCGTCGACGG CGTGCGCCTG
TCGCGCGATC TGGCCAACGA GCCCGCGAAC GTCATCTACC CCGAAAGCTT CGTTGCGCGC
GTGCGTGCGG CGTTCGCGGG CGTTCCGGGC GTCAGCATCG AGGTGCTCGA CGAAGCGGCG
ATGCGGCGGC TCGGCATGGG CACGCTCGTC GGCGTGGGCC AGGGCAGCCC GCGCGGCTCG
CGCCTGCTCG CGGTGCGCTA CCGCGGCGTG GGTGCGCCCG CCGCACCGCT GGCGTTCGTC
GGCAAGGGCA TCACCTTCGA TTCGGGCGGC ATTTCGCTCA AACCCGGCAC GGGCATGTGG
AACATGAAGG GCGACATGTC GGGCGCCGCG TCGGTCGTCG GCGCGGCGCT GTCGCTCGCC
AAGTCGCGCG TGCCGGTGCA TGTCGTCGCG GTCGCGGCGC TTGCCGAGAA TATGCCCGAC
GGCAACGCGC AGCGTCCGGG CGACGTCGTG CGCACCCTGT CGGGCAAGAC GATCGAGATG
CTGAACAGCG ACGCCGAGGG CCGCCTCGTC CTCGCCGACG CTAATGAATA TGTCGCGCGC
GAATATAAGC CGCGCGCGAT CGTCAATATC GCGACGCTCA CCGGGTCGAT CGTCGGTGCA
CTCGACGACC GATATGCGGG CCTCTTTTCG CGCGATGACG AGCTTGCCGC CGCGCTGCTC
GCCGCCGGAA CCGCCAGCGG CGAGGAGCTG TGGCGGATGC CGCTGCACCG GGATTATGCC
GACAAGCTCA AATCGGACAT CGCCGACATC CGCAACATCG CGGCGGGCCA GGGGCCGGGC
GCGAGCCTCG GCGCGCATTT CATCGGCTTC TTCGTCGATG AGGACATGCC ATGGGCGCAT
CTCGACATCG CAGGCGTCAA CCGCAGCGAA TCGGCAAGCC CGCTCGTGCC TAGGGGGATG
ACGGGCTTCG GCGTGCGTCT GCTCGACCAG CTGGCGCGCG GCGGGGAGTA G
 
Protein sequence
MASRCRSSSP ICGRAGGSYA GRRSSNNGDY RTMRMKTLLL SACLSLAFTP GAMAQTVIGS 
GVVPANAANS AERAIGFASR APTGAALVIV MTDAALPPLD GVALSAPERQ AVEAAIAAAS
FDGKAESTLS LRGIGAHPRI LLVGAGPAPS SLALAEAGGK AAQEMKGEAH PVAIAGAFGD
TSAAEVAYGF ALGQYRFDRY KTVDRKTPPS AAVTLVGANP STAETAFATR WQPLVDGVRL
SRDLANEPAN VIYPESFVAR VRAAFAGVPG VSIEVLDEAA MRRLGMGTLV GVGQGSPRGS
RLLAVRYRGV GAPAAPLAFV GKGITFDSGG ISLKPGTGMW NMKGDMSGAA SVVGAALSLA
KSRVPVHVVA VAALAENMPD GNAQRPGDVV RTLSGKTIEM LNSDAEGRLV LADANEYVAR
EYKPRAIVNI ATLTGSIVGA LDDRYAGLFS RDDELAAALL AAGTASGEEL WRMPLHRDYA
DKLKSDIADI RNIAAGQGPG ASLGAHFIGF FVDEDMPWAH LDIAGVNRSE SASPLVPRGM
TGFGVRLLDQ LARGGE