Gene Sala_1603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1603 
Symbol 
ID4082755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1679910 
End bp1681100 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content66% 
IMG OID638009972 
Productphospholipase D/transphosphatidylase 
Protein accessionYP_616649 
Protein GI103487088 
COG category[I] Lipid transport and metabolism 
COG ID[COG1502] Phosphatidylserine/phosphatidylglycerophosphate/cardiolipin synthases and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.420349 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.113302 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGATC CCTGCCCCAC TGCCGATCCG ATGCAAGGGC TGACGGCGGA CGTGGCGGAT 
CATCGGATCG AGTTGATCTT CGACGGCGGC GAGCGGATGA CGCGCCTGCT CCGACTGATC
GATCGGGCGG CGCACAGCAT CGACCTCATC ATCTATATAT TTGAAGGCGA CGCCGCAGGA
CTCAGCGTCC TGCGCGCGCT GACCGCCGCG GCGCGGCGCG GCGTGCGCGT GCGCGTGTTG
ATCGACAGCT TCGGGTCGGG CGACACGCCC GACGCGCTGT TCGCGCCGCT GCGCGAAGCG
GGCGGCGGCG CCACCTTTTT CTCGCGCCGC TGGCGTTCGT CCTATCTCAT CCGCAATCAC
CAGAAGCTGA TCCTGATCGA CAATGCGGTT GCCATGACGG GGGGCTTCAA CATCGCCGAC
GATTATCTGA GCGCGCCGCG CAGCGATGGC TGGCTCGACA TCGGCATGAT CGTCGAGGGG
CCGAGCGTCG CGCGCGCGGC CGACTGGTTT GCCGAAATCC ATGATTATAC GGTCAGCAAC
GACGGTAAGC TCCTGATATT GCGGCGGCTG ATCCGCGAAT GGCCGGTCGA TGGCGGCGCA
GTGTCATGGC TTGTCGGCGG GCCGACGCAG CGGCTTTCGC CCTGGGCACG CGCCGTGCGC
GCCGACCTGA ACGATGCGCG TCAACTCGAC ATGGCAATGG CCTATTTTTC ACCCGGCCAG
GGCCTGCTCC GCCGCCTCGG CCGCGTCGCG CAGCGCGGTC GCGCGCGCTT CATCATGGCG
GGCAAGTCCG ACAATGGCGC CACGATCGGG GCCTCGCGCC TTCTCTATGG CTATCTGCTT
CGCAAGGGCG CCGACGTGTG GGAATATCGG CGGTGCAAGC TGCACATGAA GCTGATCGTC
GTCGACGATG TGGTCTATAT CGGCTCGGCC AATTTCGACG TGCGCAGCCT GTTCGTCAAT
GTCGAGCTGA TGGTGCGGAT CGCCGATGCC GGCTTTGCGG CGCAGATGCG GCGCTTTGTG
GCGGGCCTCC AGCCCGATTG CGACATCATC ACCGCCGAGG CGCACAAGGC ACGCGCAAGC
TGGTGGACGC GGCTGCGCTG GACACTCGCC TGGTTCGTCG TCGGCGTCGC CGATTATACG
GTGTCGCGCA AACTCAACTT CGGCCTCGGT GACCCCGATC CCGATGTCTA G
 
Protein sequence
MADPCPTADP MQGLTADVAD HRIELIFDGG ERMTRLLRLI DRAAHSIDLI IYIFEGDAAG 
LSVLRALTAA ARRGVRVRVL IDSFGSGDTP DALFAPLREA GGGATFFSRR WRSSYLIRNH
QKLILIDNAV AMTGGFNIAD DYLSAPRSDG WLDIGMIVEG PSVARAADWF AEIHDYTVSN
DGKLLILRRL IREWPVDGGA VSWLVGGPTQ RLSPWARAVR ADLNDARQLD MAMAYFSPGQ
GLLRRLGRVA QRGRARFIMA GKSDNGATIG ASRLLYGYLL RKGADVWEYR RCKLHMKLIV
VDDVVYIGSA NFDVRSLFVN VELMVRIADA GFAAQMRRFV AGLQPDCDII TAEAHKARAS
WWTRLRWTLA WFVVGVADYT VSRKLNFGLG DPDPDV