Gene Sala_0316 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0316 
Symbol 
ID4082347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp324333 
End bp325826 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content68% 
IMG OID638008675 
Producttype II secretion system protein E 
Protein accessionYP_615372 
Protein GI103485811 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.218046 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGACA GCGAACCGGC CGCGCCCGCA GCGGCGCCGG TCGACATCCC CTATGGCTTT 
GCGCGCGCAC ACGGTGTCGT CATCGCGCCG GGCGAGGAGG GCGTCTGGCT CGCGACCCTG
CGCGAAGGCA GCGATCCGGC GGTGCTGATC GAGGTCAAGC GCCACCTCGC GCAGCCGCTG
CGCGTCGCGA CCGCCGATGC GGCCGATTTC GACCGGCTGC TGTCCGATCA TTATGCGGTC
GACAGCTCGG CCGCGGCGAT GGCGGGGTCG GTCGGCAGCG ACGGGCTCGA CCTCGGCATC
CCCAGCGCCG AAGACCTGCT CGACAGCGCC GACGACGCCC CCGCGATCCG CCTCATCAAC
GCGATCATCG CCGAAGCGGT GCGGCAGGGG GTCAGCGACA TTCATATCGA ACCCTATGAA
AGCGGGCTGG TCGTGCGGAT GCGCGCCGAC GGCGTGCTGC GCGAGCATCT CAGGATGCCG
CCGCACGTCG CCCCCGTCGT CGTCAGCCGT ATCAAGGTGA TGGCGCGGCT CGACATCGCC
GAACGCCGCG TGCCGCAGGA CGGCCGCATC GGGCTGACGC TCGCGGGGAA AGCGGTCGAT
GTGCGCGTCT CGACCTTGCC GAGCCGCGCG GGCGAGCGCG TGGTGATGCG TATCCTCGAC
AAGGACGCCG CCGGGATCGA CTTCGACATA CTCGGCCTGT CTGGCGAGGC GGACCGGATC
TTGCGCGAGG CGCTGGCCGA ACCCAATGGC ATCATCCTCG TCACCGGGCC GACCGGATCG
GGCAAGACGA CGACGCTCTA TGCGGCCTTG AAGCAATTGA ACGACGGGCA GCGCAATATC
CTGACCGTCG AAGACCCGGT CGAATATGCC GTTGACGGCG TGGGTCAGAC GCAGGTGAAC
AGCAAGGTCG GGCTCGACTT TGCCGCGGGT CTGCGCGCGA TCCTGCGCCA GGACCCCGAT
GTGGTGATGG TCGGCGAAAT CCGCGACCGC GAAACCGCCG ACATCGCGGT GCAGGCCTCG
CTCACCGGCC ATCTCGTGCT CTCGACCGTC CACACCAATG ATGCGGTGGG GGCGATCACG
CGCCTGAAAG ATCTGAAGGT CGAACCCTTC CTGCTCGCCT CGACGCTGCG CGCGGTGATC
GCGCAGCGGC TGGTGCGCAA GCTCTGTGAC AATTGCCGCG AGCCGGTGCA GGCCGACAAC
AGCATTGCCG CGATGCTGGG GCTCGACATC GGCACCGTGA TCTGGCGGCC CAAGGGCTGC
GAGGCGTGCG GGGGCACGGG CTTCAAGGGC CGCATCGGCG TGTTCGAGGC GATCAAGGTC
GACGACACGG TGCGCCGCTA TATCTATGCG GGTGGCGACG AGGCGATGAT CGCGAAGCAC
GCCTTTCTGA AATCGCCGAC GCTGGCGTCC GCAGCGCGGA CGATGGTCGC GAAGGGACTG
ACGACCGCCG AGGAAGCGAT CCGCGTCGCG CGGCGCGAGG ATGTCGATGC CTGA
 
Protein sequence
MSDSEPAAPA AAPVDIPYGF ARAHGVVIAP GEEGVWLATL REGSDPAVLI EVKRHLAQPL 
RVATADAADF DRLLSDHYAV DSSAAAMAGS VGSDGLDLGI PSAEDLLDSA DDAPAIRLIN
AIIAEAVRQG VSDIHIEPYE SGLVVRMRAD GVLREHLRMP PHVAPVVVSR IKVMARLDIA
ERRVPQDGRI GLTLAGKAVD VRVSTLPSRA GERVVMRILD KDAAGIDFDI LGLSGEADRI
LREALAEPNG IILVTGPTGS GKTTTLYAAL KQLNDGQRNI LTVEDPVEYA VDGVGQTQVN
SKVGLDFAAG LRAILRQDPD VVMVGEIRDR ETADIAVQAS LTGHLVLSTV HTNDAVGAIT
RLKDLKVEPF LLASTLRAVI AQRLVRKLCD NCREPVQADN SIAAMLGLDI GTVIWRPKGC
EACGGTGFKG RIGVFEAIKV DDTVRRYIYA GGDEAMIAKH AFLKSPTLAS AARTMVAKGL
TTAEEAIRVA RREDVDA