Gene Sala_0354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0354 
Symbol 
ID4081226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp366220 
End bp369297 
Gene Length3078 bp 
Protein Length1025 aa 
Translation table11 
GC content69% 
IMG OID638008713 
Producthypothetical protein 
Protein accessionYP_615410 
Protein GI103485849 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.219727 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.251109 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTTC GAGTTCCGCA TTGGATGCCC GCTCTGGCGG CCGTTGTCCT GACGGTGCCG 
ACAGGCGCGC TGGCGCAGGG CGGCGATCCG ACGCTGCGCG ACAGTTTTTC GATCGGGTCC
GAAGGCGGCG CGCTGTGCGA GGTTCAGGCA ACGGTGCGCG ACCCGGTTGT CGAGGGCATG
TTCGAGCGCG CATGGACCGT CGTGTGCCGC GACGCGAGCC AGCCCGTCGG CACGATTCGT
GTGCTGCGCG CGGCGACCGA CGACGCCCGC GCGCGCATCG AACGGGCGCG GGCAGACCGC
GTGACCTGCA CGGCGGGCCG CTGCACGCTG CGCGACAGCG ACGCCGTCTG GACGACGCGG
ATCGAAACCA ACGGCGCAAA TGCCTACACC GCCGAAGGGT TCGAAGCCTA TGCCGACGCA
CTCGCCATCG CGCTCGAATC GGTGCGCCAG CGGCGCGTCG TTCCCGGCGT GATCCGCGTC
GCGACAACCT CGGTCGGCGG CAATGACGGT TTTGCCCGCA CGCTCGCGGG CGCGATCGAC
ATCGACAAGG CGCTCGCCGA GGGATACCGC CGCAACCACA GCGGCGATTA TGCCGAGGCG
GCCGAGTTTT TCGACGCGCT GTCGCGCCGC GCGCTGGAGG AGCAGGCGGC GGTCGGCGTC
GACGCGACCG AATTCACGCT CAATCGCGCG CTGCAACGTT CGAACCTCGG CGAATTTGCC
GAGGCCGAAC GCCTGTTTGC CGAAGTGGAA GCGATCCCGA CCAGCGATCC GGTCCAGCTG
CGCCTGCGTC GCAATTTCCG GGCAATCCAC GCGCTCAATC AGCGCGATCT GGACGGCGCC
GCGGCGCGAC TGCAAGCAGC GATCCCGCCG CTCGCAACGG GCGTGATCGT CGCCGACGGC
GCGGTGACGC TGGCGCCGGC GATCGTCGCC GGGGTGAACA GCGGCGGCGA TGCCCGGTTG
GTGCAGGCGG GCGAACGCGA ACGATTGTCG CCGATCGAGC GGGCACAGAT CATCGACGCG
CAGGCGGTCC ACCTGCTCGG CACCGTCGAA CGCTTGCGCG GCGATGCCGC GGCGGCCAAG
GCGGCGCAGA TCAAGGGCCT GGCCGATGCC CTCGCGGTGC GCGAAGGGCG CGTCACCTCG
ATCATCCGCC TGCGTTCGCA GATGCTGGGC GAACTGGCGC TCGCGGAGGA AGCGCTCGGC
GATATCGGCG CTGCCGACGC GCGCTTTGGC GAGGCGGTCG CGCTGCTCGC GGTCGAATAT
CCCGAAACCA CCGCCCTCGC CTCGGCGCGC GCGCGATATG CCGCCTTCCT GACCCGGCAG
GGGCAGGACG ACAAGGCGCT GGGCATCTAT CGCGAGGTTG TCGGCGCACT CGCCGAATCG
CAACGTTCGA CCGTGGGCAT GGCCAATATG ATGGCGCCCT ATTACCGGCT GCTCGCGGCG
CGCGCCGACA GCGACCCCGC CGCGTTGCAG GATTTCTTCG TCGCCAGCCA GCTCCAGATA
CGCCCCGGCG TCGCCGATAC GCAGGCCGTA CTCGCGCGCG AACTGTCGAG CGGCAGCGAT
GAGGGCGCGC GCCTGTTCCG GCAGGCGACG ACGCTGAACC GCGACATCGA ACGCGCGCGG
ATCGAGGATG CGCGGCTCGC GCAGCTCCCG CAATCGGCCG AAATTGCCGC GCTGCGCGCA
GACATCCGCA CCCAACTCGA CAATCTGGCG TTTCAGCAGG CGGAAACGAT CGTCCGCCTA
TCGGCCTTTC CGCAATATCG CGTCGTCGCG CCGGGCAAGC TGGACCTCGG CGAGCTGCAA
GCGGTATTGC GCGACGATGA AGCCTATCTG AAGATGCTGG TGGTCGGCGA CAGCGTCTAT
GCGATGCTCG TCGAATCCGG CGGCGCGATG CTCTGGAAAT CGGACATCGG CGCTGCCGAC
CTCGAACGCG CGGTCGATGC GATCCGATCG ACGATCTCGA TCGTCGAAAA CGGCCGCCGC
GTCACCTATC CCTTTGATGC GGCGACGGCG CGCCGTCTCT ACGGCCAGCT TTTCGGCCCC
GTCGCCGCGC GGCTACCGAT GGTGCCGCAT CTGATTTTCG AGCCCGACGG CGCGATGCTG
CGCCTGCCGG TCAACCTGCT CATCACGTCC GATACGGGCC TTGCCGCTTT TGAACAGCGC
GTGCTGGATC CCGAGGCCGA CCCGTTCGAC ATGCGCGACA TTGCGTGGCT CGGCCGCACG
AGCCGCCCCA GCACCGCCGT TTCCGCCCTG GCCTTTCGCA ACGCGCGGCA GGCCGCGCCG
TCGAAGGCTG CGAACCAGTA TTTCGGGCTT GGCGAGAATC TGCCGCTTGG CGACCGGCTG
CCTTCGCTCG GCACGCGCGG CGCGGCGGGC GGCATGGACG GCGACTGCCT GTGGGACGCC
TCGCAATGGG CGCGGCCGAT CTCCGCCGAT GAACTGGTCA CCGCGCGCAA CGCAATGGGC
GCAGACGCGG GCGCCTTGCT CACCGGCGGC GCCTTCACCG ATACGGCGGT CAAGACCCGC
GACGACCTTG CCGACTATCG CATCATCCAT TTCGCAACGC ACGGCCTTGT CACCGCCCCG
CGCCCCGCCT GCCCCGCGCG CCCGGCGCTT GTCACCTCCT TTGGCGGGCA GGAATCGGAC
GGACTGCTGA CCTTTCAGGA AATCTTCGAC CTCAGGATCG ACGCCGACCT TGTCATCCTT
TCGGCGTGCG ACACCGCGGG CGCGGCGAGC GTCGCGGCGA CGCGCGAGGC GGGGCTCTCG
GGCGGCGGCA ATGCGCTCGA CGGGCTGGTG CGCAGCTTCA TTGGCGCCGG CGGCCGCTCG
GTGATTGCGA GCCACTGGCC GGCCCCCGAT GATTTCGACG CGACGACGCG GCTGATCAGT
GGACTGTTCA CCGCCGACGA CGGCGCGAGC GTGGCCGATG CGCTGTGGGC GACCCAGCGG
CGGCTGATGG ACGATCAGCA GACGTCCCAT CCCTATTATT GGGCGGGCTT CGCGATCATC
GGCGACGGTG CGCAGCCGCT GCTTCGCGGC GCGCAAACGG CGCGGCACGG ACAGGCGGCC
GGCCGCGCCG CGCGCTGA
 
Protein sequence
MTVRVPHWMP ALAAVVLTVP TGALAQGGDP TLRDSFSIGS EGGALCEVQA TVRDPVVEGM 
FERAWTVVCR DASQPVGTIR VLRAATDDAR ARIERARADR VTCTAGRCTL RDSDAVWTTR
IETNGANAYT AEGFEAYADA LAIALESVRQ RRVVPGVIRV ATTSVGGNDG FARTLAGAID
IDKALAEGYR RNHSGDYAEA AEFFDALSRR ALEEQAAVGV DATEFTLNRA LQRSNLGEFA
EAERLFAEVE AIPTSDPVQL RLRRNFRAIH ALNQRDLDGA AARLQAAIPP LATGVIVADG
AVTLAPAIVA GVNSGGDARL VQAGERERLS PIERAQIIDA QAVHLLGTVE RLRGDAAAAK
AAQIKGLADA LAVREGRVTS IIRLRSQMLG ELALAEEALG DIGAADARFG EAVALLAVEY
PETTALASAR ARYAAFLTRQ GQDDKALGIY REVVGALAES QRSTVGMANM MAPYYRLLAA
RADSDPAALQ DFFVASQLQI RPGVADTQAV LARELSSGSD EGARLFRQAT TLNRDIERAR
IEDARLAQLP QSAEIAALRA DIRTQLDNLA FQQAETIVRL SAFPQYRVVA PGKLDLGELQ
AVLRDDEAYL KMLVVGDSVY AMLVESGGAM LWKSDIGAAD LERAVDAIRS TISIVENGRR
VTYPFDAATA RRLYGQLFGP VAARLPMVPH LIFEPDGAML RLPVNLLITS DTGLAAFEQR
VLDPEADPFD MRDIAWLGRT SRPSTAVSAL AFRNARQAAP SKAANQYFGL GENLPLGDRL
PSLGTRGAAG GMDGDCLWDA SQWARPISAD ELVTARNAMG ADAGALLTGG AFTDTAVKTR
DDLADYRIIH FATHGLVTAP RPACPARPAL VTSFGGQESD GLLTFQEIFD LRIDADLVIL
SACDTAGAAS VAATREAGLS GGGNALDGLV RSFIGAGGRS VIASHWPAPD DFDATTRLIS
GLFTADDGAS VADALWATQR RLMDDQQTSH PYYWAGFAII GDGAQPLLRG AQTARHGQAA
GRAAR