Gene Sala_0004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0004 
Symbol 
ID4083135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp3205 
End bp4959 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content64% 
IMG OID638008364 
Productputative inner membrane protein translocase component YidC 
Protein accessionYP_615063 
Protein GI103485502 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0706] Preprotein translocase subunit YidC 
TIGRFAM ID[TIGR03592] membrane protein insertase, YidC/Oxa1 family, C-terminal domain
[TIGR03593] membrane protein insertase, YidC/Oxa1 family, N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0662479 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGACA AGCGTAACCT GATCGCGGCA ATCCTGTTGT CGGTGGCGAT TTTGATCGGC 
TGGAACTTCG TCGCCGAGCG GTTTTTCCCG ACCCCCGACC AGCCCGATGT GACCAAGACC
GTCGCCGGCG CCAACGGCGC GCCCGCGACC GCGCCCACGG CGCAGGGCCA GCCGAGCGCG
CTGCCCGCCC CGACGGCGGC CACCCCCGCG GCGGCGCAGG CGATCCGCCC GGTCGACGTC
GTGCTCGCCG AAGGGCAGCG CATCCCGATC GAAACCCCGG CAATCCGCGG CTCGATCAAC
CTCGTCGGCG CGCGCATCGA CGACATCACG CTCACCAAAT ATCGCCAGAC GATCAAGAAG
GACTCGCCCG CGGTGCGGCT GTTCGCGCCG GGCGGCACCC CCGCCGCCTA TTTTGCCAGC
CTCGGCTGGA CGGCGCAGGG CATTCAGCTT CCGGGCGCGG GCACCGTCTG GACTGCGAAC
GGAACGAAGC TGACCCCGAC CACCCCGGTA ACGCTGAGCT GGACGAACAC GACCGGCCAG
ACCTTCCGCA TCGAATATAG CATCGACGCC AATTACCTGA TCACCGCGAA GCAGACGGTG
GTGAACGCCG GCACCGCGCC GGTGTCGCTC AGCAGCTTTG CGCTCATCGA CCGCCTCGGC
AAGCCGACCG ACCCGCACGA GGTCGACAGC TGGACGATCC ATGTCGGGCC GACCGGCTAT
CTCGACGGCA AGTCGGTGTT CGACATCGAC TATGACGACC TCGAGGAAGC GCCGAACCGC
AGCGTGCGCT ACACTTCGGC AGGCTGGCTG GGCTTCACCG ACAAATATTG GCTCGCCGCG
ATCGTCCCGG CGAAGGGCGA ACGCGTGACC GCGGCGATCA GCTCGCCTGC GACGAACAAT
TACCAGACCT TGTTCGCGCG TGATTTTACG CAGGTCGCGC CGGGCCAGCA ACTGACGGCG
ACGAGCCGCA TCTTTGCGGG CGCCAAGGAA GTCGAGATTC TCGAAACCTA TCAGGACGAC
CAGGGCATCA CCCGCCTGTC GAACGCGATC GACTGGGGCT GGTTCGAGTT TTTCGAGGTG
CCGATTTTCA AGCTGCTCCA CTGGCTGTTC GAGAAGGTCG GCAATTTCGG GCTCGCGATC
ATGGCGCTGA CGCTGATCAT CCGCCTGCTG ATGTTCCCGA TCGCCAACCG GCAGTTTTCG
TCGATGGCAC AGATGCGCGT CGTCCAGCCG AAGATGAAGG CGCTGCAGGA GCGCTACAAG
GACGACAAGC CCAGGATGCA GCAGGAGCTG ATGAAGCTCT ATAAGGACGA GAAGATCAAT
CCGCTCGCGG GCTGCCTGCC GATCGTCATC CAGATCCCGA TCTTCTATGC GCTGTACAAG
GTGCTGATGC TGGCGATCGA AATGCGCCAC CAACCGTTCA TCCTGTGGAT CAAGGATCTG
TCGGCGCCCG ATCCGCTGCA CATCCTCAAC CTCTTCGGCC TCTTGCCCTT CACCCCGCCG
TCGATCCTGG CGATCGGGCT GCTCGCGGTG ATCCTGGGCG TGACGATGTG GCTGCAGTTC
CGTCTGAACC CCGCCCCCGC CGACCCGGTG CAGGCGCAGG TGTTCAAGAT CATGCCGTGG
CTGTTCATGT TCATCATGGC GCCCTTCGCG GCGGGCCTGC TGCTTTACTG GATCACCAAC
AATATCCTGT CGATCGGGCA GCAGCAGTGG ATGTATCGCA AGTTCCCGGC GCTGAAGGCG
GCGCCGGCGA AGTGA
 
Protein sequence
MDDKRNLIAA ILLSVAILIG WNFVAERFFP TPDQPDVTKT VAGANGAPAT APTAQGQPSA 
LPAPTAATPA AAQAIRPVDV VLAEGQRIPI ETPAIRGSIN LVGARIDDIT LTKYRQTIKK
DSPAVRLFAP GGTPAAYFAS LGWTAQGIQL PGAGTVWTAN GTKLTPTTPV TLSWTNTTGQ
TFRIEYSIDA NYLITAKQTV VNAGTAPVSL SSFALIDRLG KPTDPHEVDS WTIHVGPTGY
LDGKSVFDID YDDLEEAPNR SVRYTSAGWL GFTDKYWLAA IVPAKGERVT AAISSPATNN
YQTLFARDFT QVAPGQQLTA TSRIFAGAKE VEILETYQDD QGITRLSNAI DWGWFEFFEV
PIFKLLHWLF EKVGNFGLAI MALTLIIRLL MFPIANRQFS SMAQMRVVQP KMKALQERYK
DDKPRMQQEL MKLYKDEKIN PLAGCLPIVI QIPIFYALYK VLMLAIEMRH QPFILWIKDL
SAPDPLHILN LFGLLPFTPP SILAIGLLAV ILGVTMWLQF RLNPAPADPV QAQVFKIMPW
LFMFIMAPFA AGLLLYWITN NILSIGQQQW MYRKFPALKA APAK