Gene Sala_1984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1984 
Symbol 
ID4081028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2092258 
End bp2093478 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content64% 
IMG OID638010360 
Productpolyhydroxyalkanoate depolymerase, intracellular 
Protein accessionYP_617028 
Protein GI103487467 
COG category[I] Lipid transport and metabolism 
COG ID[COG4553] Poly-beta-hydroxyalkanoate depolymerase 
TIGRFAM ID[TIGR01849] polyhydroxyalkanoate depolymerase, intracellular 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.309579 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTATC ACGCCTTTGA CATGCAGAAG AACTGGCTGG CAGGCGCCAG CGCGCTGGCG 
ACCGCGGGCG CGCAGGTGAT GCAGCACCCC GCGAATCCAC TCGGCTATTT CAGCGGCAGC
CCGATGTTTG CCTCGGCGCT CGAGGTCTTT GCCCACGCCG CTGCGCCGCG CGGCAAGCCG
GGGTTCGAGC TGTACGAAAC GACCGTCGAT GGCGAGACGG TGCGCGTGAC CGAGGCGATC
GAGGCGCGGA GGCCCTTTGG CCAGCTCAAG CATTTCAAGC ACAAGGGCGC GAAAGATGCG
CCCAGGCTGC TGATCGTCGC ACCGATGTCG GGCCATTATG CGACGCTGCT GCGCGGGACG
GTCGAACGGA TGCTGCCCGG CCACGACGTG TGGATCACCG ACTGGCGCGA TGCGCGCAAC
GTGCCGCTGG AAGCGGGCCG GTTCGACCTC GACGATTATA TCGACTATCT GATCGCCTGG
CTGGAGCATA TCGGGCCGGG CGCGCACATG CTTGCGGTGT GCCAGCCATC GGTGCCGAGC
CTGGCCGCCG CGGCGATCAT GGCGGCGAAC AGGCATCAAT GCCGTCCGAA GACGCTGACG
ATGATGGGAG GGCCGATCGA CACGCGCAAG GCGCCGACCG CGGTGAACGA ACATGCGACC
ACGCGCCCCT ATGCCTGGTT CCAGGAAAAT GTCATCGCGA CCGTTCCGGC CTATTATCCG
GGAGCGGGGC GGTGCGTCTA TCCGGGCTTC CTGCAGCTTG CCGGTTTCAT GTCGATGAAC
CTTGGCAACC ATATGATGAG CCATTGGGAG ATGTTCAAAC ATCTGGTCGA TGGCGACGGC
GAAAGCGCCG ACAAGACCAA GGAATTCTAT GACGAGTATC GCGCGGTGTG CGACATGACC
GCCGAATTTT ATTTGCAGAC GGTCGATGCC GTGTTCCAGC GCCACCTGCT GCCCAGGGGC
GCGTTCGAGC ACCGCGGCGA GCTGGTGGAC ATCGCAGCGA TCGAGGATAT TGCGATCCTG
GCGATCGAGG GCGAGCGCGA CGATATTTCG GGGATCGGCC AGACCAAGGC GGCGCTGACG
CTGGCCAGGG CACTGCCTGC GGAACGGAAG AAATATCTGC TGGCCAGGGG AGTGGGCCAT
TATGGCATTT TCAACGGCCG CAAGTGGCGC GAGGAGATTG CGCCGGTGGT CGAGCAATGG
ATCCGCGCGC ACGGCGGCTG A
 
Protein sequence
MLYHAFDMQK NWLAGASALA TAGAQVMQHP ANPLGYFSGS PMFASALEVF AHAAAPRGKP 
GFELYETTVD GETVRVTEAI EARRPFGQLK HFKHKGAKDA PRLLIVAPMS GHYATLLRGT
VERMLPGHDV WITDWRDARN VPLEAGRFDL DDYIDYLIAW LEHIGPGAHM LAVCQPSVPS
LAAAAIMAAN RHQCRPKTLT MMGGPIDTRK APTAVNEHAT TRPYAWFQEN VIATVPAYYP
GAGRCVYPGF LQLAGFMSMN LGNHMMSHWE MFKHLVDGDG ESADKTKEFY DEYRAVCDMT
AEFYLQTVDA VFQRHLLPRG AFEHRGELVD IAAIEDIAIL AIEGERDDIS GIGQTKAALT
LARALPAERK KYLLARGVGH YGIFNGRKWR EEIAPVVEQW IRAHGG