Gene Sala_3042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_3042 
Symbol 
ID4083050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp3191320 
End bp3192933 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content66% 
IMG OID638011428 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_618079 
Protein GI103488518 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.351967 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.568346 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACC GCCGCGATTT GCTCAAACTG GCCGGTACCG GCCTCGTCGC GACGGGGCTT 
TCGGGCGTTG CGGGTCCGGC CGCGCTGGGG CAGGCCGCGT GTGCATCCCG GCCGTCGGTG
CCGGCGTGGG CCAAGGGTTT CGATGGCCAG CGTATCGCCG ACCTTGGCGA CGGGCGCTTC
CTCAATCCGA TCATGGCGGG CGATCATCCC GACCCGTCGA TCCTGAAGGA CGGCGCCGAT
TATTATATGA CCTTTTCGAC CTTCGACAGC TATCCGGGGC TGGTCATCTG GCATTCGCGC
GATCTGGTGA ACTGGCGCCC GGTCGGGCCC GCGCTGCACC GGAATATCGG GTCGGTCTGG
GCACCCGAAC TCTGCAAGCA TGGCGGGCGC TTCTATCTCT ACATTCCCAC GAAAGGGCCG
AACACGAGCT GGGTGATCTG GGCCGACCGG ATCGAGGGAC CGTGGAGCGA TCCGGTCGAC
CTGAACCTGC CCAACCACAT CGACCCCGGC CATGCGGTCG GCGAGGACGG GTCGCGCTGG
CTGTTCCTGT CGGGCGGCGA CCGCGTGCGG CTGTCGGACG ATGGGTGCAA GCGCATCGGC
GAACCCGAAC ATGTCTATGA CCCGTGGCGC TATCCATCCG ACTGGGTGGT CGAAGGCTTC
GCGCCCGAAG GGCCAAAGAT CACGCGGCAT GGCGACTATT ATTATATGAT CACCGCGGTC
GGCGGCACCG CAGGCCCGCC GACCGGCCAT ATGGTGATCG CCGCGCGCTC GACATCGATC
GACGGGCCGT GGGAAAATTG CCCGGCGAAC CCGCTCGTCC GCACGGTGTC GGCCGCCGAA
AAATGGTGGT CGCGCGGCCA TGCGACGCTT GTCGAGGGGC CTGCGGGCGA CTGGTGGGCG
GTCTATCACG GCTATGAGAA TGGCTTTTGG ACCCTGGGGC GGCAGGCCTT GCTCGACCCC
GTCGAATGGA CCGACGATGG CTGGTTGCGC ATGACGGGCG GCGATTTGTC GCAGCCGATC
GCCAAGCCGA AAGGCGGCAG CGTCGCGGGG CCGCATGGTA TGGCGCTGTC GGACGATTTT
TCGTCGCTCG CGCTCGGCGC CAAGTGGAAC TTCTTCAAGC CCGCGTCGGA CGAACACCGT
CGCGCGCGTG TCGAGGACGG CGCGCTGGTG CTGACGGCGC GCGGGGAGGC ACCGGTCGAT
TCATCGCCGC TGCTGCTGAT CGCGGGCGAC CATGCGTATC GTTTCGAATG CGATATCGAG
ATCGCGCCCG GCGGCACCGC GGGGCTGATC CTTTTCTATG ACGAGAAACT CTATTGCGGA
CTCGGCTTTG ACCGCGATCG CTTCGTGACG CATCAATATG GCATCGAACG CGCGCGGCCG
GTCAATCCGC ACGGGACACG GATGCGGATG CGCGTCACCA ACGACCGGCA TATCGTGACT
TATGACACCA GCGGCGACGG CGGACGGACC TGGGTGCGCT TCGACCGGGG GATGGAGGTG
TCGGGATACC ACCATAATGT GCGCGGCGGC TTCCTGATGC TGCGCCCCGG CCTCTATTCG
GCGGGCGCCG GCGAGGCGCG CTTTCGCAAT TTCACCTTCC GCGCGCTCGA TTAA
 
Protein sequence
MTDRRDLLKL AGTGLVATGL SGVAGPAALG QAACASRPSV PAWAKGFDGQ RIADLGDGRF 
LNPIMAGDHP DPSILKDGAD YYMTFSTFDS YPGLVIWHSR DLVNWRPVGP ALHRNIGSVW
APELCKHGGR FYLYIPTKGP NTSWVIWADR IEGPWSDPVD LNLPNHIDPG HAVGEDGSRW
LFLSGGDRVR LSDDGCKRIG EPEHVYDPWR YPSDWVVEGF APEGPKITRH GDYYYMITAV
GGTAGPPTGH MVIAARSTSI DGPWENCPAN PLVRTVSAAE KWWSRGHATL VEGPAGDWWA
VYHGYENGFW TLGRQALLDP VEWTDDGWLR MTGGDLSQPI AKPKGGSVAG PHGMALSDDF
SSLALGAKWN FFKPASDEHR RARVEDGALV LTARGEAPVD SSPLLLIAGD HAYRFECDIE
IAPGGTAGLI LFYDEKLYCG LGFDRDRFVT HQYGIERARP VNPHGTRMRM RVTNDRHIVT
YDTSGDGGRT WVRFDRGMEV SGYHHNVRGG FLMLRPGLYS AGAGEARFRN FTFRALD