Gene Sala_1387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1387 
Symbol 
ID4081859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1442455 
End bp1443594 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content65% 
IMG OID638009753 
Productcupin 2, barrel 
Protein accessionYP_616434 
Protein GI103486873 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3435] Gentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR02272] gentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.125216 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGGCC CTTCGACGAT CATCGATCCG CGCGACGACG TGCTCGGCCG GTCGCGCGTC 
ACCGACACGC CCGAGCTGGA GGCGTTTTAC GAAGAGCTCG CGGCGCGCAA CGCCGGCGCC
TTCTGGAAGC GCGCCAATGC GATCGAACCA TGGGAGCCCG CCACGCGCTA TCGCCCGACG
CTCTGGCGTT ATGCCGAGAT GCGCGCCATG TGCCTGCGCG CGCTTGATCT CGTAAGGCCC
GACGAAGCGG GGCGGCGCGT CGTCACCCTG CTCAACGACA GCGATGCGGG GCGCGAGAAT
GTCGCGGTGT GCGGCTGGCT GTTCAGCGGA ATGCAGGCGA TGCGCCCCGG CGAGATCACC
CCCGCGCACA AACACACGGC GTCGGCGCAC CGTTTCATCA TGGAGGGGAA GGGCGCCTAT
ACCGTTGTCG ACGGGCATCA CATCACGCTG GGTGCCAACG ACTATGTGCT GACCCCGAAC
GGCTGCTGGC ACGACCATGG CGTCGCCGCC GACGGCGAAG TGTCGATCTG GCAGGACGGG
CTCGACATCC CGCTGATGAA CAGCCTCGAA ACCAATTTCT ATGCCGTCTA CGACCAGCCC
GCGCAGACGG CAGCCTATCC GGCGGACGAT CTGCCGCTGA CCTATGGCGG CGCGGCGCTC
CGCCCCGAAG GCGTCGCGGC CTGGGAAAAA CCCTATTCGC CGGTGATGGT CTATCGCTGG
GAGGCCGTGC GCGATGCCTT GTTGAACCTT GCGAAAGTGT CGGTCGGGTC GCCCTTCGAC
GGTCATATGA TGCGCTATGC CAACCCGCTG ACCGGCGGCT GGGCGCTCCA GACGATGGGC
GCGCATATGC GGATGCTGCC CGGCGGTTTT CGCGGCAAGG CGCACCGCCA CACGGGCAAT
GTCGTCTATA ATGTCGCGCG CGGCCGCGGC TGTTCGATCA TCGGCGGTCA GCGGTACGAC
TGGCAGACAC ACGATATTTT CTGTGTGCCC GCGTGGACCT GGCACGAGCA TGTCAATCTC
GATGCCGCGG AAGAAGCCTT CCTCTTCTCG TTCAACGACT TCCCCGTGAT GGAGGCGCTC
GGCGTCCGGA TCGAGGAACC TTTCCCGAAA AACGACGGAC ATCAAATATG CGCTTCGTAA
 
Protein sequence
MTGPSTIIDP RDDVLGRSRV TDTPELEAFY EELAARNAGA FWKRANAIEP WEPATRYRPT 
LWRYAEMRAM CLRALDLVRP DEAGRRVVTL LNDSDAGREN VAVCGWLFSG MQAMRPGEIT
PAHKHTASAH RFIMEGKGAY TVVDGHHITL GANDYVLTPN GCWHDHGVAA DGEVSIWQDG
LDIPLMNSLE TNFYAVYDQP AQTAAYPADD LPLTYGGAAL RPEGVAAWEK PYSPVMVYRW
EAVRDALLNL AKVSVGSPFD GHMMRYANPL TGGWALQTMG AHMRMLPGGF RGKAHRHTGN
VVYNVARGRG CSIIGGQRYD WQTHDIFCVP AWTWHEHVNL DAAEEAFLFS FNDFPVMEAL
GVRIEEPFPK NDGHQICAS