Gene Sala_2549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2549 
Symbol 
ID4081464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2687221 
End bp2688693 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content67% 
IMG OID638010926 
Productsecretion protein HlyD 
Protein accessionYP_617588 
Protein GI103488027 
COG category[M] Cell wall/membrane/envelope biogenesis
[S] Function unknown 
COG ID[COG0845] Membrane-fusion protein
[COG5569] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCACG CACTTTCCGC GCGGGCGCGG CTCCTTCTGG CCGCAGCGGC GATTGCCCTG 
GTCGGGGGTG CCGTTGGCTA TGGCGTGGCC AGCTTGAGCG GCACCGGGCC GGCAGCCGAG
AGGACCGAAG GGGACCGCAA GGTGCTCTAC TGGTATGACC CCATGTATCC CAACCAGCAT
TTCGACAAGC CCGGCAAGTC GCCGTTCATG GATATGGAGC TCGTCCCCAA ATATGCCGAT
GAGGCAACAA GCGAGGCAGG GGTGCGCATC GATCCGGCGC TCGTCCAGAA TCTGGGCGTG
CGCACGGCCG AGGTCCGCCG CGGCACGCTG GGCGGCGGCC TCTCGGCCAC AGGCGTCATC
GGCTATAACG AGCGCGAGAT AGCGATTGTC CAGCCGCGCG CCGGCGGCTT TGTCCAGCGG
ACCTATGGGC GGGCGCCCGA TGACGTGATC GAAGCAGGCG CGCCGCTGGT CGATCTGCTT
GTGCCGGACT GGGGCGGCGC ACAGGCCGAG TTCCTGGCGG TGCTGCGCAC GGGCGACAGG
GCCCTGGCAA GCGCCGCGCG CCAAAGGCTG GTGCTGCTCG GCATGCCGCA ATCCACCATT
GCCGCCGTCG AACGCAGTGG CAGGCAGCGC AATGTCATTA CCATCAACTC GCCGATTGGC
GGGACGATCA AGTCGCTCGG CGTGCGGCAG GGTATGAGCG TGATGGCGGG GCAAACGCTC
GCCGAAGTAA ACGGGCTCGG CACGGTGTGG CTCGATGCCG CGGTCCCCGA AGCCATGGCG
GGACGATTGC GGCCGGACAT GCCGGTGACG GCGACGCTGG CAGCCTATCC GGGCGAGAGT
TTTGCGGGCC GGATTCGGGC GATCCTGCCG CAGGCTGAAA CCGAAAGCCG GACCATCACC
GCGCGCGTCG AGATCCCCAA TCGCGGCGGG CGGCTGCGGC CAGGCATGTT CGCCACCGTC
AGCTTCGCTG GCGAGCAGCG CCCCGCGCTG CTCGTACCCT CCGAGGCGCT GATCCGCACG
GGCAAGCGCA CGCTCGTCAT GCTGGCGTTG GATAAGGGAC GCTATCAGCC GGCCGAAGTC
AGAACGGGAA TGGAAGCGGA CGGCCAAACC GAAGTGCTCG CCGGACTTGC CGAGGGTGAG
AAGGTCGTCA CCTCGGGCCA GTTCCTGATC GATTCCGAAG CCAGTCTTTC GGCAATGCAG
GCGCGGCCGA TCGCGGGTGG TTCGCCGCAG GCGCCGTCCA AGGCTCAGGG CCACAGGGCT
ATCGGCACAA TCGAAAAGAT CGAGCCCGGC AGCGTGACGC TGAGGCATGG GCCGGTCCCG
TCGGCGAGTT GGCCCGCAAT GACGATGCGC TTCCGGCTTG CCGATCCTGC AACGGTGCGC
GGGTTCAAAC CCGGCGACAA GGTGAACTTC ACCTTCGACC AGCCTGCGCA AGGCCCGACC
GTCCGCTCGA TCACGCGCGA GAACGGCCGA TGA
 
Protein sequence
MKHALSARAR LLLAAAAIAL VGGAVGYGVA SLSGTGPAAE RTEGDRKVLY WYDPMYPNQH 
FDKPGKSPFM DMELVPKYAD EATSEAGVRI DPALVQNLGV RTAEVRRGTL GGGLSATGVI
GYNEREIAIV QPRAGGFVQR TYGRAPDDVI EAGAPLVDLL VPDWGGAQAE FLAVLRTGDR
ALASAARQRL VLLGMPQSTI AAVERSGRQR NVITINSPIG GTIKSLGVRQ GMSVMAGQTL
AEVNGLGTVW LDAAVPEAMA GRLRPDMPVT ATLAAYPGES FAGRIRAILP QAETESRTIT
ARVEIPNRGG RLRPGMFATV SFAGEQRPAL LVPSEALIRT GKRTLVMLAL DKGRYQPAEV
RTGMEADGQT EVLAGLAEGE KVVTSGQFLI DSEASLSAMQ ARPIAGGSPQ APSKAQGHRA
IGTIEKIEPG SVTLRHGPVP SASWPAMTMR FRLADPATVR GFKPGDKVNF TFDQPAQGPT
VRSITRENGR