Gene Sala_1031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1031 
Symbol 
ID4082314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1065202 
End bp1066893 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content62% 
IMG OID638009391 
Productpeptidase M28 
Protein accessionYP_616081 
Protein GI103486520 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.418542 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTATCC CCCCCCCGCT CAAGGTCGCC GGCCTTGCCG CGCTGGCGTT GCTCGCCGCG 
TGCAACGGCA GCGCCGACCC ACAGAAAAAT GCCGCCACCG CGATCCCCGA TGTCGAGATT
CCCCAACTAT CGCTCGCAAC CTTGCAGGAA GTAACGAGAG AATTGTCGTC CGACGCCTAT
GAAGGGCGTG CGCCCGGCAC CGCGGGCGAA GAAAAGACCG TCGCCTACAT CATCAAGAAA
TACAGGGAAG CGGGCTTGCA GCCCGGCAAC AACGGCCGCT GGACACAGGA CGTGCCGCTG
GTTGAAATCA CCGCGAAGAA CGCCACCCCG CTGACCTTCA CCGGCGGCAA GACGCCCGTG
ACGGCACAAT ATGCCAAGGA TTATGTCGCG TTCAGCTACC GCGTCCAGCC GAGGACCGAA
GTCAAGGACA GCGACGTCGT GTTCGTGGGA TATGGCATCA ATGCCCCCGA AAAGGGCTGG
AACGACTATG CCGGGCTGGA TGTGAAGGGC AAGACGGTCG TTGTCCTGGT GAACGATCCC
GACTGGGAAA ACAGGGAAAC CGAAGGCCCG TTCAACGGCC GCGCCATGAC CTATTACGGC
CGCTGGTCAT ACAAATATGA GGAAGCTGCG CGGCAGGGAG CCGCCGCCGT GCTGATCGTT
CACGACACCG AGCCTGCCGC CTATGGCTGG AACGTCGTCG AATCGAGCAA TACGGGCACG
CAATATCTGG CCGAAAGCAA GAATGGCGGC GCCGACCAAA CGATTGCCAA TGGCTGGATC
CAGTTGGCCA AGGCGAAGGA ACTCTTCGCA AGCGCGGGAC AGGATTTCGA CAAGCTGCGC
GAGGCGGCGA AACAAAAGGG GTTCAAGCCC GTGCCGCTGG CCGGGGTGAA GGCGAGCTTT
GCCTTCGACA ATGACATCGC CAAGAAAATG TCGCGCAACG TCATCGGTGT GCTGCCGGGC
GCCAAGCGGC CCGACGAATA TGTGCTTTAC ACGGGTCATT GGGATCATCT GGGCCGCTGC
ACGCCCGTCG ACGGCGACGA CATCTGCAAC GGCGCGGTCG ACAATGCGAG CGGTATCGCA
GGGCTCGTGA CGCTGGCGAA GGCGTTCAAG CAGGCGGGCG CGCCCGATCG CAGCATCGTC
TTTCTTGCCG TCACCGCCGA GGAATCGGGC CTGCTCGGAT CGAAATACTA CGCCGAAAAC
CCGGTCTTCC CATTGTCGCA GACGGTCGGC GGCGTGAATA TGGATGCGCT GAACGCGGTC
GGGCCGGCGA AGGACATCGT CGTGGTCGGG GCCGGCAAGT CCGAACTTGA CGCCTATGTC
GAGAAACTCG CCCGGATGGA GGGTCGCACG GTCAAGCCCG AACCGACCCC CGAAAAGGGT
TTCTATTACC GGTCGGATCA TTTCAGCTTC GCCAAGCTGG GCGTCCCGAT GTTCAATTTC
GGCAGCGGCG ACGATCTGGT CGATGGCGGC GTCGAGGCGG GTCAGAAAGC GGCCGAAGAC
TATGAAAAGA ATCGCTATCA CGCCCCCGAC GACGAATATG AGGCGATCAC CAACTGGGAG
GGCATGATGT CGGACCTGCG CCTCTATTAT GCGGCGGGGC GGATGCTCGC GATGACCGAT
GCGTGGCCGA ACTGGAACGA AGGCGACGAG TTCCGCGCCG CCCGCGACAA GTCGCGCGCC
GCAGCAAAAT AA
 
Protein sequence
MPIPPPLKVA GLAALALLAA CNGSADPQKN AATAIPDVEI PQLSLATLQE VTRELSSDAY 
EGRAPGTAGE EKTVAYIIKK YREAGLQPGN NGRWTQDVPL VEITAKNATP LTFTGGKTPV
TAQYAKDYVA FSYRVQPRTE VKDSDVVFVG YGINAPEKGW NDYAGLDVKG KTVVVLVNDP
DWENRETEGP FNGRAMTYYG RWSYKYEEAA RQGAAAVLIV HDTEPAAYGW NVVESSNTGT
QYLAESKNGG ADQTIANGWI QLAKAKELFA SAGQDFDKLR EAAKQKGFKP VPLAGVKASF
AFDNDIAKKM SRNVIGVLPG AKRPDEYVLY TGHWDHLGRC TPVDGDDICN GAVDNASGIA
GLVTLAKAFK QAGAPDRSIV FLAVTAEESG LLGSKYYAEN PVFPLSQTVG GVNMDALNAV
GPAKDIVVVG AGKSELDAYV EKLARMEGRT VKPEPTPEKG FYYRSDHFSF AKLGVPMFNF
GSGDDLVDGG VEAGQKAAED YEKNRYHAPD DEYEAITNWE GMMSDLRLYY AAGRMLAMTD
AWPNWNEGDE FRAARDKSRA AAK