Gene Sala_1646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1646 
Symbol 
ID4080724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1729080 
End bp1730288 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content65% 
IMG OID638010019 
Producthypothetical protein 
Protein accessionYP_616692 
Protein GI103487131 
COG category[S] Function unknown 
COG ID[COG3876] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATCT CCTTCGGTAT CGACCGCCTG CTCGCCGACC CGGATCTCCG CAAACCGCTC 
GAAGGCAGGC GCGTCGCGCT GCTGGCGCAT CCGGCATCGG TTACCGCCGA CCTGACGCAC
AGCCTCGACG CCCTCGTTGC GGCCGGGGTG AAGGTCAGCG CGGTGTTCGG GCCGCAGCAT
GGGGTGCGCG GCGACCTTCA GGACAATATG ATGGAGTCGC CCGATTTCAC CGATCCGGTG
TATGGGGTTC CATGCTTTAG CCTATATGGC GAAGTGCGCC GACCGACCGG CCAGTCGATG
CACACATTCG ACGTGATGCT GGTCGATCTC CAGGATCTCG GCTGCCGCAT CTACACCTAT
GTGACGACCC TGCTCTATGT GCTCGAAGCC GCGGCGCAGC ATGGCAAGGC GGTGTGGGTG
CTCGACCGTC CCAATCCCGC GGGCCGTCCC GTCGAGGGGA CGCGCCTGCG CCCCGGCTGG
GAGAGTTTTG TCGGCGCCGG GCCGATGGTG ATGCGCCACG GGTTGACGAT GGGCGAGATG
GGGCACTGGT TCGTCCGTCA CTTCGGCCTC GACGTCGATT ACCGCGTGAT CGAAATGGAA
GGGTGGGCGC CCGAAGGCCC CGGCTTCGGC TGGCCCATGG AGCGCGTCTG GATCAACCCC
AGCCCCAATG CCGCGAACGT CAACATGGCG CGCGCCTATG CCGGGACGGT GATGGTCGAG
GGAGCGACCT TGAGCGAGGG CCGCGGCACG ACGCGCCCGC TCGAACTCTT CGGCGCGCCC
GACATCGACG CCAAGGCGGT GATCGCGGAG ATGCAGCGCC TTGCGCCCGA ATGGCTAAGC
GGATGCAAGC TGCGCGACAT CTGGTTCCAG CCGACCTTTC ACAAGCATGT CGGCCAGTTG
AGCAGCGGCG TCCATATCCA CGCCGAGGGT GCATGGTACG ATCATAGCTC GTTCCGCCCG
TGGCGCGTGC AGGCGCTGGG CTTCAAGGCG ATCCGCTCGC TCTATCCCGA CTATCCAATC
TGGCGCGGGC TCGATTTCAA ATATGAATAT ACCGACGATG TACTGGCGAT TGATGTGATC
AACGGCGGGC CGGGTTTGCG CGAATGGGTC GATGATGCGC GCGCTGGCCC CGGCGACCTC
GACGCGCTGG CGCTGCCCGA CGAGGCGGCG TGGCAAGAAG AAATCGCGGA TCTGCTGATC
TACAACTGA
 
Protein sequence
MTISFGIDRL LADPDLRKPL EGRRVALLAH PASVTADLTH SLDALVAAGV KVSAVFGPQH 
GVRGDLQDNM MESPDFTDPV YGVPCFSLYG EVRRPTGQSM HTFDVMLVDL QDLGCRIYTY
VTTLLYVLEA AAQHGKAVWV LDRPNPAGRP VEGTRLRPGW ESFVGAGPMV MRHGLTMGEM
GHWFVRHFGL DVDYRVIEME GWAPEGPGFG WPMERVWINP SPNAANVNMA RAYAGTVMVE
GATLSEGRGT TRPLELFGAP DIDAKAVIAE MQRLAPEWLS GCKLRDIWFQ PTFHKHVGQL
SSGVHIHAEG AWYDHSSFRP WRVQALGFKA IRSLYPDYPI WRGLDFKYEY TDDVLAIDVI
NGGPGLREWV DDARAGPGDL DALALPDEAA WQEEIADLLI YN