Gene Sala_1992 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1992 
Symbol 
ID4082157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2101805 
End bp2104129 
Gene Length2325 bp 
Protein Length774 aa 
Translation table11 
GC content71% 
IMG OID638010368 
Producthypothetical protein 
Protein accessionYP_617036 
Protein GI103487475 
COG category[S] Function unknown 
COG ID[COG5448] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02217] conserved hypothetical protein TIGR02217 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0145174 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0070387 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGCTGGG CCCTGGTGGC GGCGGCCGAG CCGCATCATC GCAAGGGGTG GATCAAGCGG 
TTCGACCCGC GCTTCTGGAC GGTCGATTTT GCGCGGCCGA TGATGGCGAG CGTGACGAGC
GCGGCGCCGG GGGCGCTGCG CGTCGAGGCG GTCTTTTACC GCAAGCAGGA TCTGGCGGGG
CTGATCTGGG AGACGGCGGA CCGGTGGGAT CACCCGCTGC TCGCCTATGA AACGCGGCGT
GATTTCCGGC ACACGCAGCT CCTTTTCCGA TGGCGGTCGG GCGGGGTGAA GCCGCTCGAC
GCGCTGCATG GCCCGACGCT GACGATCGAG GGGCGCGATG CGGGAGGCAA TCCGCGCGCC
TGGTATGTGC GGTTGTGGAA CTATGCCGAG GGGAGTGCGG AGGATGCGGT GGTGACGCTC
GATTTCGATG CGCTCGACGG CGGTTTCCTG CTTCCCGGCG AGGCGGATCC GGTGTGGGCG
GGGGATGTCG ACAGGATGTT CGTTTCGCTG GTGCCGCCGA CCTATGACGG CGGCGAGGGC
GATTTGGATG CGCCGGTCGA GGGCTGGGCC GAGATGAGCG ACATCGTCTG CACCGGATCG
GGTTCGGTGC TGGCGATCGG CGATGCGGTG CTCCCCGAAA CGGCGCTGGG CATGACCAAC
GGCTATGACG ATTGCTATCA CCTGACCCCG GCGCGCGTGG TGCGGCAGAT CGTGCAATTG
GGCTATCGCG GCGACGTCGT CCATTATGTC GGGATGAGCC ATTATATGCG GCTGGCGCCG
TCAGCCGCGG GCCTGCTCGC GGATGCGTCG GCGGGCGCGT TGAACGGTCC GTGCGCGGCC
TGGCATCGCG ATTTGGCCGC CGCGTGCGCG GCGGCGGGGC TGGGGCTGAT CTGGTCGCTG
TCCTATGAAT TGTTCGACGC CTATTGCCCC GAGGACTGGA AGCAGCGCGA CGCCGATGGC
GCGCCCGCGC TGACCGGGTG GGAGCCGCCC TCGACCCTGC TGTCGCCGGC GAATGCGGCG
GCGATGGCCT GGTTGCAGGC GGTCGCGCGG GCGTTCGTCG CGATCGGGCG CGATGCCGGG
CTGGCGTGCA GGTTTCAGGT CGGCGAGCCC TGGTGGTGGA TCGCGGACGG GGGGCGCATC
TGCGCCTATG ACGCGGCGGC GTCGGCGGCG CTGGGCGGCG CGGGCGTGGC GATTGCCGAC
GTGCGCGGGC CGCTCGACGC GGCGCAGCGG GCGATGCTCG ATGCGCTGGG CGCCTTGCTC
GCCGATTCGA CCGCCGCGCT GGTCGCGGCG GCGCGGGAGG AGGCGGGTGC GGCGGGGCTG
GTCAGCCACT TGCTCGTCTA TCTGCCGACG GTGCTCGACC CGGCGGCGGC AGAGGTGCGG
CGCGCCAATG TGCCGCTCGG CTGGGCGGCG CCGGCGTTCG ACGTGCTCCA GCTAGAAGAT
TATGACTGGG TGACCGGCGG GCGCAGCGGC GAGACCGCTG GCGCGCGCGC CGCGATGGCG
CTGCGGCTCG GTTATCCCAT TGCAGAACAG CATTATTTTT CGGGGTTCGT GCTGCTGCCC
GAACAGCGCG GCCGGTGGGC GGCGATCGCA GAGGCCGCCG ACGCGGCGCG GCGCGCGGGG
GTGGCGCGGA CCTTTATCTG GGCGCTGCCG CAGGTGGCGC GCGACGGCTT CACGGCCTTT
GATGGGGAGG ACAGGGTGCA GGCGTTCGAT GCAGTGGATT TCCCGATTGC GATCGGGCGC
GAGGCGGTGG CGCTGACCGA ATTTTCGACG CAGATCGTCA GCTCGCCATC GGGGCACGAA
CAGCGCGCGA GCGAATGGGC CGAGGCACGG ATGCGCTATG ACGCCGGGCC GGGGATCAGG
TCCGAGGCCG ATGTGCGCGC GCTGACGGCG TTTTTCCGCG CGCGGCGCGG CGCGGCGCGC
GCGTTCCGCT TTCGCGATCC GTTCGACAGC AGTTCGGCCG CCGATAACGG GCTGCCGACC
GCCGAGGATC AGTGGCTGGG GACAGGCGAC GGGGTGCGGC GGCAATTTGC GCTGGTGAAG
CGTTATGGCG AAGGCGACGC CGAAGCGGTG CGGCCGATCC GCCTGCCGGT CGCGGGGAGC
GTGCGCGTGT CGGTGGGTGG GATCGAGACG GCGGCGTTCC TGGTGACGGG CGAAGGCGAG
GTGCTGCTCG ACGCGGCGCC CGCCGAGGGC GCGGTAGTGC GGGCGGGATT CCGCTTCGAC
GTGCCGGTGC GCTTTGCCGA CGACCGGCTG GAGGTGAGCC GCGCGACCTT CCTCGCGGGC
GAGCTGGCGA GCGTGCCGCT GGTCGAGGTG CGCGCGCCAT GGTGA
 
Protein sequence
MGWALVAAAE PHHRKGWIKR FDPRFWTVDF ARPMMASVTS AAPGALRVEA VFYRKQDLAG 
LIWETADRWD HPLLAYETRR DFRHTQLLFR WRSGGVKPLD ALHGPTLTIE GRDAGGNPRA
WYVRLWNYAE GSAEDAVVTL DFDALDGGFL LPGEADPVWA GDVDRMFVSL VPPTYDGGEG
DLDAPVEGWA EMSDIVCTGS GSVLAIGDAV LPETALGMTN GYDDCYHLTP ARVVRQIVQL
GYRGDVVHYV GMSHYMRLAP SAAGLLADAS AGALNGPCAA WHRDLAAACA AAGLGLIWSL
SYELFDAYCP EDWKQRDADG APALTGWEPP STLLSPANAA AMAWLQAVAR AFVAIGRDAG
LACRFQVGEP WWWIADGGRI CAYDAAASAA LGGAGVAIAD VRGPLDAAQR AMLDALGALL
ADSTAALVAA AREEAGAAGL VSHLLVYLPT VLDPAAAEVR RANVPLGWAA PAFDVLQLED
YDWVTGGRSG ETAGARAAMA LRLGYPIAEQ HYFSGFVLLP EQRGRWAAIA EAADAARRAG
VARTFIWALP QVARDGFTAF DGEDRVQAFD AVDFPIAIGR EAVALTEFST QIVSSPSGHE
QRASEWAEAR MRYDAGPGIR SEADVRALTA FFRARRGAAR AFRFRDPFDS SSAADNGLPT
AEDQWLGTGD GVRRQFALVK RYGEGDAEAV RPIRLPVAGS VRVSVGGIET AAFLVTGEGE
VLLDAAPAEG AVVRAGFRFD VPVRFADDRL EVSRATFLAG ELASVPLVEV RAPW