Gene Sala_2290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2290 
Symbol 
ID4080786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2414708 
End bp2416204 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content64% 
IMG OID638010669 
Producttype II secretion system protein E 
Protein accessionYP_617332 
Protein GI103487771 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCAT TCGGACGCCG ACCCGGAACC GCTGGCCGCC CCGCCTTTGG CGTGGCAAAG 
CCGATGCAGG GTGGGCCGGG TGTGCCCGGC GGCGCGCAGT TCCCCGCGAT CGAAACGCCG
CCGGCGATCG ACATTCCGCA AATGCCGTCG AACCTGTCAC CCGAAGGCGA GGCGATGGAA
CGGCTGAACC AGCGTTCGAC CGCCGAGGCG GTGGAGCCCG AAAAGGCGCA AGGGTTCGAG
GCGAGCGTCC ACAAGATCAA GGAACAGGTG CTGCCGCGCC TGCTGGAGCG CGTCGACCCC
GAAGCCGCAG CGACGCTCAG CAAGGACGAG CTGACCGAAG AGTTCCGTCC GATCATCCTC
GAAGTGCTCG CCGAACTGCG CATCACGCTC AATCGCCGCG AACAATTCGC GCTCGAAAAG
GTGCTCGTCG ACGAGCTGCT CGGCTTCGGC CCGCTCGAGG AGTTGCTCGC CGATCCCGAC
ATCAGCGACA TCATGGTCAA CGGCCCCTAT CAAACCTATG TCGAGCGCAA GGGCCAGCTC
GTCCTAGCGC CGATCCAGTT CCGCGACGAA CAGCATCTGT TCCAGATCGC GCAGCGCATC
TGCAACCTCG TCGGCCGCCG CGTCGACCAG ACGACGCCGC TCGCCGACGC GCGCCTCAAG
GACGGCAGCC GCGTCAACGT GATCGTGCCG CCGCTCTCGC TTCGCGGCAC CGCCATCTCG
ATCCGCAAAT TCTCGGCCAA GCCGATCACG CTCGACATGC TCTGCCAATG GGGCGCGATG
AGCCAGAAAA TGTGTACCGC GCTGAAAATC GCGGGCGCCA GCCGCTTCAA CATCGTCATT
TCGGGCGGCA CGGGTTCGGG CAAGACCACC ATGCTCAACG CGCTCTCCAA GATGATCGAC
CCCGGCGAGC GCGTGCTGAC GATCGAGGAC GCCGCCGAAC TTCGCCTGCA ACAGCCGCAC
TGGCTGCCGC TCGAAACGCG CCCCGCGAAC CTCGAGGGCA ATGGCGCGAT CCATATGGGC
GACCTCGTCA AAAACGCGCT GCGTATGCGC CCCGACCGCA TCATCATGGG CGAGGTTCGC
GGCGCCGAAT GTTTCGATCT GCTCGCCGCG ATGAACACCG GTCACGACGG GTCGATGTGT
ACGCTGCACA GCAACAGCCC GCGCGAATGC CTCGGCCGCA TGGAGAACAT GGTTCTCATG
GGCGACATCA AGATTCCGAA GGAAGCCATC TCGAAACAGA TCGCCGATTC GGTCGACCTG
ATCGTCCAGA TCAAGCGCCT GCGCGACGGT TCGCGCCGCG TCACCAACAT CACCGAGGTG
ATCGGCATGG AAGGCGACGT CATCGTCACG CAGGAATTGT TCAAGTTCGA ATATCTGGAC
GAGGACAAGG ACGGCAAGAT CGTCGGCGAA TATCGCTCGA TGGGCCTCAG GCCCTATACG
CTCGAAAAAG CGCGGCAATA CGGGTTCGAT CAGCCCTATC TGGAGGCGTG TCTTTGA
 
Protein sequence
MSAFGRRPGT AGRPAFGVAK PMQGGPGVPG GAQFPAIETP PAIDIPQMPS NLSPEGEAME 
RLNQRSTAEA VEPEKAQGFE ASVHKIKEQV LPRLLERVDP EAAATLSKDE LTEEFRPIIL
EVLAELRITL NRREQFALEK VLVDELLGFG PLEELLADPD ISDIMVNGPY QTYVERKGQL
VLAPIQFRDE QHLFQIAQRI CNLVGRRVDQ TTPLADARLK DGSRVNVIVP PLSLRGTAIS
IRKFSAKPIT LDMLCQWGAM SQKMCTALKI AGASRFNIVI SGGTGSGKTT MLNALSKMID
PGERVLTIED AAELRLQQPH WLPLETRPAN LEGNGAIHMG DLVKNALRMR PDRIIMGEVR
GAECFDLLAA MNTGHDGSMC TLHSNSPREC LGRMENMVLM GDIKIPKEAI SKQIADSVDL
IVQIKRLRDG SRRVTNITEV IGMEGDVIVT QELFKFEYLD EDKDGKIVGE YRSMGLRPYT
LEKARQYGFD QPYLEACL