Gene Sala_1386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1386 
Symbol 
ID4081858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1441559 
End bp1442467 
Gene Length909 bp 
Protein Length302 aa 
Translation table11 
GC content63% 
IMG OID638009752 
Product5-oxopent-3-ene-1,2,5-tricarboxylate decarboxylase 
Protein accessionYP_616433 
Protein GI103486872 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.221337 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTCG TAACATATCG CACGGTCGAA ACCGAACCGC GGCTGGGCCT TCTCCACGAC 
GGCCTTGTGA TCGATGTCGA TTATTTCGGC GATGCGATCG GGCAGGATTT GCCATCGACG
ATGCTCGATT TCATCGACCT GGGACCGATC GGCCCGCGCT TCCTGCGCGA AGCGGTCGAA
AGCGCGACGC CCGCCGACCT GCTCGGCACT TCGCTGCCCC AGGGCAATGT CACCTTGCTC
GCGCCGATCC CGCGCCCGCG CAAGAATATC TTCGGCATCG GGCTTAACTA TACCGAGCAT
GTCGCCGAAT CCGCCCGCGC GCTCGATACC GCGCACGAAC TGCCGCAGCA GCCGGTGATC
TTCTCGAAGC CGCCCACGGC CGTCGCCGCG TGGAACGACC CGATCCGTCA CAATGCGAAA
GTGACACAGC AACTCGACTG GGAAACTGAA TTGGCGGTGA TCATCGGCAG TACCGCGCGC
GGCGTGGCCG AGGCCGACGC GCTGAACCAC GTGTTCGGCT ATACCGTCAT CAACGATGTG
TCGGCGCGTG ACTGCCGCCG CGCCGGGCAA TGGATCGTCT CGAAAGGGCA GGACAGCTTT
GCCCCCATGG GGCCATGCAT CGTCACCGCC GACGAGATCG GCGACCCGCA TAATCTCAAT
ATCCTCACCC ATGTGAACGG AGTGGAAAAG CAGAACAGCA ACACGCGCTT CATGCTGTTC
AACGTGCCCC AGCTGATCGC TGACATTGCC CGTGTGATGA CGCTCGAACC CGGCGACATC
ATCGCGACCG GAACGCCCGC CGGGGTCGGC GCGGGGCGCG ATCCGCAGGA GTTTCTGTGG
CCCGGCGATG TCGTCGAATG CACCGTCGAA GGCATCGGCA CACTCCGCAA CCCGGTTGTC
GCGGTCTGA
 
Protein sequence
MRFVTYRTVE TEPRLGLLHD GLVIDVDYFG DAIGQDLPST MLDFIDLGPI GPRFLREAVE 
SATPADLLGT SLPQGNVTLL APIPRPRKNI FGIGLNYTEH VAESARALDT AHELPQQPVI
FSKPPTAVAA WNDPIRHNAK VTQQLDWETE LAVIIGSTAR GVAEADALNH VFGYTVINDV
SARDCRRAGQ WIVSKGQDSF APMGPCIVTA DEIGDPHNLN ILTHVNGVEK QNSNTRFMLF
NVPQLIADIA RVMTLEPGDI IATGTPAGVG AGRDPQEFLW PGDVVECTVE GIGTLRNPVV
AV