Gene Sala_1918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1918 
Symbol 
ID4082775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2019746 
End bp2020930 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content70% 
IMG OID638010295 
Producthypothetical protein 
Protein accessionYP_616963 
Protein GI103487402 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.618119 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTCC ATCCTGCCTT TCCGGTCAGC AGCATCGACG CCGCGCCGCT GACGGTGCGC 
GTGGTCGATC CGCTGGCGCT GTCGGAGGGG CTGGCGGCGG CGTGGGACGG GCTGGCGGGC
GAGGCAAGCG AGCCCAATCC CTTTGCCGAA CGCTGGTGCC TGCAATCGGC ACTGCACCTG
CTCGATCCCG AACGCCAAGC GCGGCTTATC GTCGTGCAGG GCGGCGCGGA CGGGCCGCTG
ATCGGCGTGA TGCCGCTCGC ACCCGCGGCG CGTTACAGCC GCCTGCCGCT GCGTCACGCC
GTGGGCTGGG CGCACCCCAA TCATTTTCAC GGCGCGCCTC TGGTGCGCGC AGGTTTTGAA
AGCCTGTTCT GGTCGATCCT GCTCGGCTGG TGCGATGCGG CGCCCTGGGC GCGCACCTTG
CTGCATGTGC CGCGATTGAC CGAGGACGGG CCGCTCCACC GCGCGCTGAT CGATGCGGCG
CGGGGGCGTG GTGGCGAGGC CGTGGTCGTC CACCGCGAGG AGCGTGCGCT GCTCGCAAGC
GACCTCTCGC CCGCCGCCTA TTGGGACGCA GCGGTGCGCG CGAAGAAGCG CAAGGAATTG
AGGCGGCAGG CGAACCGGCT CGCCGATGAG GGTGTGGTGC AATTTCGCCG GTGGCAGGCG
GGCGATCCGC CGGGTCCGTG GATCGACGCC TTCCTCGCCC TGGAGGCGCG CGGCTGGAAG
GGGCGCGCGG GATCGGCGCT TGCGAGTAAC AGCGACACCC AGGCCTGGTT CCGCGCCATC
GTGCCCGCCG CCGCCGCGGC GGGGCGGCTC GACATGCGCG CGCTCGACCT CGATGGCCGC
CCGCTGGCAA TGCTCGTCAA CTTCCTGTGC CCGCCCGGCG GCTTTTCGTT CAAGACCGCG
TTCGATGAGG ATTATGCACG CTTTTCGCCG GGCGTCCTGT TGCAACAGGC GAATCTGGAC
CTGCTCGACG ACCCGCGCAT CGAATGGGTC GACAGCTGCG CCGCGCCCGG CCATCCGATG
ATCGACAGCG TCTGGCGCGA ACGCCGTGCG CTCGTCTGGG TCAACGTCCC GCTGACAGGG
CGCTCCGACC GGCTGCGTTT TGCGATGCTG ATGCGCGCCG AGCGAATGTG GCGGCGCTGG
AAGGGTGCCG CTCAGCACGC CGATGAAGTG GAAAGCCCGA CATGA
 
Protein sequence
MTVHPAFPVS SIDAAPLTVR VVDPLALSEG LAAAWDGLAG EASEPNPFAE RWCLQSALHL 
LDPERQARLI VVQGGADGPL IGVMPLAPAA RYSRLPLRHA VGWAHPNHFH GAPLVRAGFE
SLFWSILLGW CDAAPWARTL LHVPRLTEDG PLHRALIDAA RGRGGEAVVV HREERALLAS
DLSPAAYWDA AVRAKKRKEL RRQANRLADE GVVQFRRWQA GDPPGPWIDA FLALEARGWK
GRAGSALASN SDTQAWFRAI VPAAAAAGRL DMRALDLDGR PLAMLVNFLC PPGGFSFKTA
FDEDYARFSP GVLLQQANLD LLDDPRIEWV DSCAAPGHPM IDSVWRERRA LVWVNVPLTG
RSDRLRFAML MRAERMWRRW KGAAQHADEV ESPT