Gene Sala_3165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_3165 
Symbol 
ID4082501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp3314648 
End bp3315829 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content64% 
IMG OID638011550 
ProductRieske (2Fe-2S) region 
Protein accessionYP_618201 
Protein GI103488640 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATGC TGCGCGAAAC CTTCGACAAC ATCGACCCGC TCGATGGCTG GTCGCTCCCG 
GCGTGGACCT ACAGCGACCC CGATTTCTAC GCCGTCGAAA TGGCGCGCAT CTTCCGCCCC
AGCTGGCAGG TCGTCTGCCA TGACAGCGAC ATCGCGAACC CCGGCGACTG GCACAGCATC
GACTATTGCG GCGAAAGCAT CATCCTTGTG CGCGGAACCG ACCGCATCGT GCGCGCCTTC
ACCAACGTCT GCCGCCACCG CGGCTCGCGC CTCGTCGATG GTGCCGCGGG CTGTGCCAAA
AAGCTCGTCT GCCCCTATCA CGCCTGGACC TATGAACTCG ACGGCCGACT GACGGGCGTT
CCTGATTCGG CGAGCTATCC GACGCTCGAC AAGGGCAGGG CGGGGCTCGT CGGCGTCGAG
GCCGAACAAT GGCGCGGCTT CTGGTTCGTC CGGCTCGATG ATGACGGCGG GCCGTCGGTC
GCCGACATGA TGGCGCCCTA TGAGACGACG GTTGAGCCGT ATCGTTTCGA GGAACTCGGC
GCGCTCGGCC GCGTCACGCT TCGCCCGCGC GCGGTCAACT GGAAAAATGT CGGCGACAAT
TATTCGGACG GCCTTCACAT CCCCGTCGCG CATCCGGGCC TGACCCGGCT TTTTGGCAAA
AGCTATGGCG TCGAGGCCAG GGAGCGCGTC GATCGCATGT GGGGCGACCT CGTCGACCGG
CCGTCGGCGA ACTGGTCCGA ACGTCTGTAC CAGCGGCTGT TGCCGCCGAT TCCGCACCTG
CCCGCCGACC GCCAGCGCCA CTGGCTCTAT TTCAAGCTGT GGCCCAATGT CGCCTTCGAC
ATCTATCCCG ACCAGGTCGA TTTCATGCAG TGGCTGCCGA CCGGTCCGAC GAGCTGTCTG
ATCCGCGAAA TCTCCTATGT GCTGCCCGAC GCCTATACTG GAGAGTGGCG CCACGAAATG
CGCGCCGCGC GCTACCTCAA CTGGCGCATC AACCGTCAGG TCAATGCCGA GGACACGGCG
CTCATCACGC GCGTCCAGCA GGGCATGCAA TCGCAGAGCT TCTCGATGGG ACCGCTCAGC
GACAAGGAAG TCTGCCTCAA ACATTTCTGC GCGCGGATGC GCGCCATCAT TCCCGAAGCG
CGGCTCGAGC ACGCGCCTGC CGCCGGTTGG AGCAATAAAT GA
 
Protein sequence
MAMLRETFDN IDPLDGWSLP AWTYSDPDFY AVEMARIFRP SWQVVCHDSD IANPGDWHSI 
DYCGESIILV RGTDRIVRAF TNVCRHRGSR LVDGAAGCAK KLVCPYHAWT YELDGRLTGV
PDSASYPTLD KGRAGLVGVE AEQWRGFWFV RLDDDGGPSV ADMMAPYETT VEPYRFEELG
ALGRVTLRPR AVNWKNVGDN YSDGLHIPVA HPGLTRLFGK SYGVEARERV DRMWGDLVDR
PSANWSERLY QRLLPPIPHL PADRQRHWLY FKLWPNVAFD IYPDQVDFMQ WLPTGPTSCL
IREISYVLPD AYTGEWRHEM RAARYLNWRI NRQVNAEDTA LITRVQQGMQ SQSFSMGPLS
DKEVCLKHFC ARMRAIIPEA RLEHAPAAGW SNK