Gene Sala_2863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2863 
Symbol 
ID4080656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp3014196 
End bp3016034 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content70% 
IMG OID638011247 
Producthypothetical protein 
Protein accessionYP_617901 
Protein GI103488340 
COG category[R] General function prediction only 
COG ID[COG1287] Uncharacterized membrane protein, required for N-linked glycosylation 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.440889 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGG CGGCGGTAAC GCCGCCGGAC GGGATCCGGC TCGGCCGCTG GTTCACTCCG 
GCGCGGATCG CCCTGGTCGT CTGGGGGCTG ATGAGCCTGA TCGCGATCGC GGCGAACTGG
CAGGCGATTG GCGCGCTCAA GCTTGGCGAC ACCGACGACG CGATGCGCAT GGCGCAGGTG
CGCGACCTGC TCGCGGGGCA GGGGTGGTGG GATCTCACGC AATATCGGGT CAATCCGGCG
GGCGGCGGTG TGGTGATGCA CTGGTCGCGC CTGGTCGATG CGCCGCTGGC GGCGGGCATC
CTGCTCTTGA AACCCCTGTT CGGGCAGGTG ACGGCCGAAC GCATCGTGAT GGCCGTCTGG
CCGCCGCTGC TCGGCGCGGC GTTGAGCATC GCGTGCGCGC TCGGCTATCG CAACCTGTCC
GACCGCCGCA TCGCCTATGT TGCGCCGCTG TTCCTGATCA TGTCGGCTTA TATTCTTGTC
CAGTTCCGGC CGCTGCGTGT CGATCATCAC GGCTGGCAGA TATTGCTCGC GATGCTGATG
ATGGGACAGG CGCTGCGCCC AGCGTCGTGG CAGGCGGGAC TACTCGGCGG ATTCTTCGCG
GCCGCGCTGC TCGCGGTGTC GATTGAGGGG CTGCCGATCG TGACGCTGTT CGCCGCGCTC
GCGGCACTGC GCTGGGCGCT GCATGGCCGC GGCGACGAAC GTGCGCGACT TTGCGGCTAT
ATGGGCGCGC TCGCCGTCGG CGCGATCCTG TTCCAGTTCG CCACGCGCGG GCCTGCGGGC
CTTATGGGCA CCTGGTGCGA TTCGCTTTCG GCGCCCTATA TGGCGGCGTT CGTTGCGGCG
GCAATGGTGG TTTTCGCTGC GGTGCGCGTC GGCCCGACGC ATGTGCCCGC GCGTTTGCTG
TTGCTCGCTT TCGGTGGCGT CGCCGCGGCG GGCGCGCTGG TGTGGACCGA ACCGCTCTGC
GCGAAAGGCC CCTTTGCCAC GCTCGATCCC ATCGTCTTTG ACCTCTGGTA TCGCAACGTC
ACCGAAGGCC GGCCGCTCTG GACCGCGAAG GCGCACGACA TGATTCATGT CCTTGTGCCC
TCGCTCGTCG GGATCGCGGG CGCGCTGCTC GCGTGGCGCA GTGCGAAGGC GGCGGACGAC
CGGCGGAACT GGGCGACGGT GATTGCCGCG CTGTCGGCCG CCGCCACGCT GTCGCTGTTC
GTTTTCCGCA GCGTGTCGAC CGCGCATCTT TTCGCGCTGC CCGGCTGCGC GTGGCTTGGC
CTGCGCGCCT GGGCGTGGGC GCGCACCATT CGCTCGATCG GACCGCGCAT CCTTGCCTCG
GCGATGGCCG CGCTGACGTT GCCGCTGCTC GGCGGCATGG CGGTGGCGGC GCTGCTCAGC
CTTGCCGTGC CCGCTCTGCA AAAGGCGGAG GGGCGTGCCG CGGCCAGGGT CGCCGGCACC
GACGGCTATC GCACCGACTG CCTTGACCCA ACGGCCATCG CCGACCTCAA CCGCCTGCCG
CCCGCGACAT TGTTGACCCC GATCGACCTT GGCGCCCCGC TCGTCTTCTG GACGCCGCAC
CGGCTGGTCG CGACGCCGCA TCACCGCAAC AGCGAGGCGA TGGCCGACAC GATCCGCGCC
TTTGCGGGCG ACCCGGCGCG CGCCGAGGCG CTGGTGCGGC GGCAGCGGGC GACGCTGATC
GTCGTTTGCC GCACCGCCAA TGATTTCAAC AAATATCGCC ATGCGCGCCA GGATGGGCTC
GCCGCGCAGC TTTATGCCGG AACCCCGCCG GCGTGGCTCG AAGCGGTGCC GATCACCTCG
CGCGCGGGGC TGGCGCTCTG GCGCGTGAAG CCCGAGTGA
 
Protein sequence
MSEAAVTPPD GIRLGRWFTP ARIALVVWGL MSLIAIAANW QAIGALKLGD TDDAMRMAQV 
RDLLAGQGWW DLTQYRVNPA GGGVVMHWSR LVDAPLAAGI LLLKPLFGQV TAERIVMAVW
PPLLGAALSI ACALGYRNLS DRRIAYVAPL FLIMSAYILV QFRPLRVDHH GWQILLAMLM
MGQALRPASW QAGLLGGFFA AALLAVSIEG LPIVTLFAAL AALRWALHGR GDERARLCGY
MGALAVGAIL FQFATRGPAG LMGTWCDSLS APYMAAFVAA AMVVFAAVRV GPTHVPARLL
LLAFGGVAAA GALVWTEPLC AKGPFATLDP IVFDLWYRNV TEGRPLWTAK AHDMIHVLVP
SLVGIAGALL AWRSAKAADD RRNWATVIAA LSAAATLSLF VFRSVSTAHL FALPGCAWLG
LRAWAWARTI RSIGPRILAS AMAALTLPLL GGMAVAALLS LAVPALQKAE GRAAARVAGT
DGYRTDCLDP TAIADLNRLP PATLLTPIDL GAPLVFWTPH RLVATPHHRN SEAMADTIRA
FAGDPARAEA LVRRQRATLI VVCRTANDFN KYRHARQDGL AAQLYAGTPP AWLEAVPITS
RAGLALWRVK PE