Gene Sala_1585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1585 
Symbol 
ID4083022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1660411 
End bp1662372 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content60% 
IMG OID638009954 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_616631 
Protein GI103487070 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.216403 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.968528 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTTGAAC GATTCTTCAC TGATGCTTCG ACCCAAATGA TCACGTTCGC TGCGCGTATG 
CTTTTCTGGC CGCGTTGGGG AAAGCTGCTG CTTGTTCTCG TTTGCGACGC GATTTTGGGA
CTGGTAGCGT CCGCGATGGC CTTTTCCATT CGGTTGGGCG AATGGAGTTC CGATGACTGG
CCGGTCCTGC GCTTTGGCCT CACCATGTTG TTGCTATGGT TTCCGATTGC CTTTTGGCGC
GGGATCTATT CGGCCATATT TCGATACGCC GGTCGCGGGG TATTGATCTC GCTGGCGGTG
GCCGTCGCCA TGATGACGGT GCCGCTGATC GTCATCTATA TGTATGTCGG TTATCCAGGT
GTGCCTCGGA CGATCGCCAT TCTCGGCCCG ATCCTGTTCC TGCTGCTTAT GGGAGTCGCG
CGGATCGTCG GCCGCTATGT GCTGGTGGAT CTTTTCCATT CGCGTGACTT TGTGGGGCGC
GAGCGGAACG TGCTCATCTA TGGTGCCGGC ACGACCGGTC AGCGATTGGC AGCATCTTTG
AGTTCCGAAG GCGGCCTGAG GGTTGCGGGC TTTCTTGACG ATGATCGTGA CAAGCGGGGC
AAGCGAATAG ACGGTGCCCG CATCTTTCAC AGCGACGATA TCGAAAGCGT GCTGAATCAG
CTCGGCGTCA CCGACATCGT TCTGGCGATG ACGCAGGTCG GCGATGAGCG GCGCAAGCAG
ATCATCCGGA ATCTGGCGCG GTTCAGCATA AACGTCCAGA TGCTTCCACC CGTCAGGGAC
ATTCTTGAAG GCAAGATTTC GGCAAGCGCC ATCCGTCCGA TAGAGGTCGA GGATCTGTTG
GGTCGCCCGC CGGTGGCACC CGACCGGGAA TTGTTGTCGC GGTCCGTCAA GGGCAAGCAT
GTGATGGTAA CCGGCGCCGG CGGTTCGATC GGCAGCGAAT TGTGCCGACA GATACTACGG
CTGGCACCGC ACTCGCTCAC CCTCGTCGAA TCCAGTGAGT TTTCCCTCTT CAGGCTTCAA
AACGAACTGG AAGCGATCCT CGATCGGCTC CCCGACGGTA TCCGGCCCCT TCTTCGGGCG
AAATTGTCGA ATGTTGCGGA TGCCGCAGCG GTGGAGCGCC TGTTCGCTGA TGAGGCGCCC
GACACCATCT ATCATGCCGC GGCGTACAAG CATGTGCCGC TGCTGGAGGA GAACCCGCTT
GACGGCGTCG CCAACAATAT CCGCGGCACA CGCAACATGG CCGAGATGGC CGTCAAAAAG
GGTGTCGGTC GCTTTATTCT GATCAGCACC GACAAGGCCG TGCGTCCTCC GAACGTGATG
GGGGCCAGCA AACGCGTCTG CGAGATGCTG CTGCAGGACA TGAGCCGGTC GCGGAAGCCC
GACGGCACCA TTTTCTCGAT GGTTCGCTTC GGCAACGTGC TGGGTTCGAG CGGATCGGTC
GTTCCGACCT TTCGACAACA GATCGAACGC GGCGGCCCGG TGACGGTGAC CCACCGCGAT
GTTACACGCT ATTTCATGAC GATCCCCGAA GCGGCCGAGC TGGTGATCCA GGCAGGCAGC
ATGGCGACGG GCGGCGAAGT GTTCCTGCTC GATATGGGGG AGCCGGTGCG CATCTGGGAC
CTCGCCGAAA CCATGGTGCG GCTGTCCGGC CTTACGATCC GTTCGTCCGC CAACCCCGGT
GGCAGCATCG AGATCGTCGA GCGCGGGTTG CGCAAGGGCG AAAAATTGTT CGAGGAACTT
CTGGTCGGCG AGGAATCGCA GCCCACCGCG CATCCGCGGA TCATGCAGGC GCGCGAAGAA
TGTGTCAGCC ACGAGTGCCT TTATGAATAT CTGACGGCAA TCGAGGCCGC GATTGCGGCG
GCCGATCCCC GTGGTTGCCG AGCCGCATTG AAACGGCTAG TGCCCACCCT GCACGATCAG
TCCCCAGCAA CCGTCCCCGC GCCGGAACAT TCAACTAGAT GA
 
Protein sequence
MFERFFTDAS TQMITFAARM LFWPRWGKLL LVLVCDAILG LVASAMAFSI RLGEWSSDDW 
PVLRFGLTML LLWFPIAFWR GIYSAIFRYA GRGVLISLAV AVAMMTVPLI VIYMYVGYPG
VPRTIAILGP ILFLLLMGVA RIVGRYVLVD LFHSRDFVGR ERNVLIYGAG TTGQRLAASL
SSEGGLRVAG FLDDDRDKRG KRIDGARIFH SDDIESVLNQ LGVTDIVLAM TQVGDERRKQ
IIRNLARFSI NVQMLPPVRD ILEGKISASA IRPIEVEDLL GRPPVAPDRE LLSRSVKGKH
VMVTGAGGSI GSELCRQILR LAPHSLTLVE SSEFSLFRLQ NELEAILDRL PDGIRPLLRA
KLSNVADAAA VERLFADEAP DTIYHAAAYK HVPLLEENPL DGVANNIRGT RNMAEMAVKK
GVGRFILIST DKAVRPPNVM GASKRVCEML LQDMSRSRKP DGTIFSMVRF GNVLGSSGSV
VPTFRQQIER GGPVTVTHRD VTRYFMTIPE AAELVIQAGS MATGGEVFLL DMGEPVRIWD
LAETMVRLSG LTIRSSANPG GSIEIVERGL RKGEKLFEEL LVGEESQPTA HPRIMQAREE
CVSHECLYEY LTAIEAAIAA ADPRGCRAAL KRLVPTLHDQ SPATVPAPEH STR