Gene Sala_2201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2201 
Symbol 
ID4080159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2312828 
End bp2314078 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content64% 
IMG OID638010579 
Producthypothetical protein 
Protein accessionYP_617243 
Protein GI103487682 
COG category[S] Function unknown 
COG ID[COG2311] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.964455 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.524788 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGGG GAACCACGAC GCGCTACGAA AGCCTCGACG CGATCCGCGG GGTCGCGGTG 
ATGGGCATCC TTGCGATGAA CATCGTCGCC TTTGCCCTGC CCTTCCCCGC CTATGGCAAC
CCCGCCGCGG GCGGACCGCC CACCGACAGC GACGTCGCGA CATGGTTCTT CAACTTCGTT
TTCGTGGATT CGAAGATGCG CGGCATGTTT TCGATGCTGT TCGGGGCGAG CACCCTGCTG
GTGATCGAAA GCGCCGCCGT CGCGGGACGC AGCGGCGCGG GCGCGCATTA TTCGCGCATG
TTCTGGCTCG CGATCTTCGG CCTCGCGCAT TTCTATCTCA TCTGGTTCGG CGACATATTG
TTCCTTTATG CAATCTGCGG GCTGCTCATC TTCCTGTTCC GCAACCTGTC GGTGCGTGCG
CTCCTGCTCT GGGCGATCCC CTTTTTCCTC ATCGCTATCG GTCTGCACAC GAGCCTCTGG
GCGATGATGT CGATGGCACA GGCGGGAACG CTGCCGCCCG AAGCGGCCAC CGCGATGCAG
GAGGCGCTGC GGCAGATGAA CGCCGATATG GGCCCGTCCA CCCCCGTCTA TGCCGAAGAG
AAGGCGCTCT ATCTCGGCAG CTATGCCAGC ATCGTCGCAT ATCGCACCGG CGCGATGGCG
GGCGATCCGC TCTTCTTCCT CGGCCTGTTC CTGTGGGAAA CGGTGGGGCT GATGCTGATC
GGCATGGCGC TGTTCAAATC GCATATGCTG ACCGGCGAAT GGGAGGCGGC GCGCTATCGC
AAATGGGCGA TCGCCTGTTT TGCGATCGCC GTGCCGCCGC TCGTCGGGCT CGCCCTCTAT
CAGATGCGAA CGGGTTATGA CGCGGTATCG GTCTTCGGTT CGACGATCGC GCTGTCGGTG
CCCTTCGACA CGCTGATGAC GATCGGCTGG GCGGCGCTCA TCATGCTGCT GGTCAAGACA
GCGGCCAGCC ACGCCCTGCG CGCGCGGCTC GCGGCGGCGG GGCGCATGGC CTTCACCAAT
TATCTCGTCA CCTCGATCGT GATGACGACG ATATTTTACG GCTATGGGCT CGGCCTCTTC
GGCAGCATCG GCCGCCTGCC GCTCTATCTT TTCTGCATCG GCATGTGGGC GGCGATGCTG
CTGTGGTCAA AGCCCTGGCT CGACCGTTTT CAATATGGCC CGCTCGAGTG GCTGTGGCGC
AGCCTGTCGC GCGGGCAGGT GCAGCCGATG CGAAAACGCT TGCCGGGCTG A
 
Protein sequence
MNRGTTTRYE SLDAIRGVAV MGILAMNIVA FALPFPAYGN PAAGGPPTDS DVATWFFNFV 
FVDSKMRGMF SMLFGASTLL VIESAAVAGR SGAGAHYSRM FWLAIFGLAH FYLIWFGDIL
FLYAICGLLI FLFRNLSVRA LLLWAIPFFL IAIGLHTSLW AMMSMAQAGT LPPEAATAMQ
EALRQMNADM GPSTPVYAEE KALYLGSYAS IVAYRTGAMA GDPLFFLGLF LWETVGLMLI
GMALFKSHML TGEWEAARYR KWAIACFAIA VPPLVGLALY QMRTGYDAVS VFGSTIALSV
PFDTLMTIGW AALIMLLVKT AASHALRARL AAAGRMAFTN YLVTSIVMTT IFYGYGLGLF
GSIGRLPLYL FCIGMWAAML LWSKPWLDRF QYGPLEWLWR SLSRGQVQPM RKRLPG