Gene Sala_1102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1102 
Symbol 
ID4082040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1131983 
End bp1133611 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content70% 
IMG OID638009464 
Producthypothetical protein 
Protein accessionYP_616152 
Protein GI103486591 
COG category[S] Function unknown 
COG ID[COG4655] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATCC GCTCGCGTCT CGCATCGCTG ATCCGCTGCC GCCGCGCGGG AATCAGCATC 
GCTGCGGCGA TCGGTATGCC GATGCTGATC GGCGCGGCGG CGCTCGCGGT CGATGTCGGC
TCGCTTTATC TCGACCGGCG CAAGCTTCAG GGCATCGCCG ATGCCGCGGC GCTGGCGGCG
GCGGGTCGCC CCGGCGAAGA GCGCGCGGCG GTCGAGCGGA TCATCGCCGC CAATTGCACC
TGCGTCATCC GTATCGAGGC GCTGACCGTC GGCACCTATA CCGCCGATCC GGCGCGCCCG
GCCGAGGCGC GCTTCGCAGC GGGCGGGGCG GCGCCTAACG CGGTGCGGAT CACGCTGTCG
CAGGACCGGC CGCTGTTTTT CGGCGGCTTC CTGACAGGAC GGCCTGACAG CATCATCCGC
GCGACCGCGA CGGGCGCGCG GCGCGGTTAT GCCGCCTTTT CGCTGGGGTC GCGCGTCGCG
GCGCTGAACG GCGGCGTGGC GAATGCGCTG CTGTCGGGGC TGACCGGCAG CGAGGTCAAC
CTGTCGGTGA TGGACTATAA CGCGCTTGCC AGCACCGACA TCGACCTGCT CGCCTTTTCC
GACGCCCTCA GGACCGAGAT CGACGCCGAT GTGCTGACCT TTGGCCAGAC GCTCGACAGC
CAGGTGACGC TGCCGCAGGT GGTGTCGGCG CTCGCGAGCG CTTCGAGCGG TGATGCGGCG
GCGGCGCTCG AGCGGATCGC CGATACCGCC TTGCCACGCG GCCTGATCCC CTCGCGCGCG
ATCGACCTTG GCCCGCGCGC GTCGAGCGTC CGCGTCGATG CCGCGAACCC GGTGAGGGTC
AATGCGCTGA GCCTGCTGCG CACGATGTTG CTGCTCGGCA GCGCGAACCG ACAGGTCGAC
CTGTCGCTCG CGAGCGAACT GCCCGGCGGA TCGGGGATCG ACGTCGCGCT GCTGATCGGC
GAACCGCCTG CGGATTCGCC GCTGATCGCC GTCACCGATA CGAACGATGT GATCGTCCGC
ACCGCGCAGG TGCGGCTCAA AATCGATACG CGCATAGCGA CACCGCTGGC GAGTGTTCGC
GTCCCGCTGC TCGCCGAACT GGGCTCCGCC TCTGCGCGGA TCACCGATAT CGATTGCGCC
CCGAACAGCA GTGCCGCGGT GACGCTCGGC GTCGTCACGT CGCCCGCCAT GGTGGCGATC
GGCACGGTCG ACGATGGCGA TTTCGCCGAC ATGCGGCGCC GGCTCGACCC GATGCCCGCG
CGTCTCGTCA AACTGCCGCT CGTCAGCATC GACGCGCAGG CTGAAATGAC GCTGTCGGAC
CTCAACGAAA AGCCCGTCGC CTTTTCGCGC GGCGAGATCG ACGACGGGAG GGTGAAGACG
GTGTCGAGCA GCGGGCTGGT CGCGGGCGCG GCCGAATCGC TGTCCGACGA GCTGGAGCTC
GACGTCAATG TGGTGGGGCT GGGGCTCAAT CTCGGCGCGC TGACCTCTGC GGTCGGCGAC
ACGGTCGCGC TCGCTGCGCC CGTCATTGAC GGTGTCCTCG GCGACCTCAC CGGTCTGCTC
GGTCTGCATG TCGGGCAGGC CGATACGCGG ATCAATGCGC TGCGCTGCGG CCGTGCGCGG
CTGGTGTGA
 
Protein sequence
MAIRSRLASL IRCRRAGISI AAAIGMPMLI GAAALAVDVG SLYLDRRKLQ GIADAAALAA 
AGRPGEERAA VERIIAANCT CVIRIEALTV GTYTADPARP AEARFAAGGA APNAVRITLS
QDRPLFFGGF LTGRPDSIIR ATATGARRGY AAFSLGSRVA ALNGGVANAL LSGLTGSEVN
LSVMDYNALA STDIDLLAFS DALRTEIDAD VLTFGQTLDS QVTLPQVVSA LASASSGDAA
AALERIADTA LPRGLIPSRA IDLGPRASSV RVDAANPVRV NALSLLRTML LLGSANRQVD
LSLASELPGG SGIDVALLIG EPPADSPLIA VTDTNDVIVR TAQVRLKIDT RIATPLASVR
VPLLAELGSA SARITDIDCA PNSSAAVTLG VVTSPAMVAI GTVDDGDFAD MRRRLDPMPA
RLVKLPLVSI DAQAEMTLSD LNEKPVAFSR GEIDDGRVKT VSSSGLVAGA AESLSDELEL
DVNVVGLGLN LGALTSAVGD TVALAAPVID GVLGDLTGLL GLHVGQADTR INALRCGRAR
LV