Gene Sala_1688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1688 
Symbol 
ID4081091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1776993 
End bp1778204 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content67% 
IMG OID638010062 
Productimidazolonepropionase 
Protein accessionYP_616734 
Protein GI103487173 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.823771 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCATC GGCGCGACAC TGTCCAGATG GAGCAACGCT GGACGAACGC CCGACTCGCG 
ACGATGGCCG GTGACGATCT CGGCCTGATC GACGATGGCG TGGTCGCGGC AAAGGATGGC
CGCATCATCT ATGCCGGTCC GGCGAAGGAC GCGCCTGCGA CGCAAGGTGC GGTCCATGAC
TGCGAGGGCC GCCTGATCAC GCCCGGCCTG ATCGACTGCC ACACCCATCT GATACATGGC
GGCAACCGTG CGAACGAATG GGCGATGAGG CTCGAAGGTG CGAGCTATGA CGAGATCGCC
CGCGCGGGCG GCGGCATCGT CTCGACGATG CGCGCGACGC GCGACGCGAG CGAAGCCGAA
CTGGTCGCGA GCGCCCTCCC CCGCCTCGAC GCGCTGATCG CCGAGGGTGT GACGACGATC
GAGATCAAGT CAGGCTATGG GCTCTCGACC AACGACGAGT TGAAGATGCT GCGCGCCGCG
CGCGCGCTCG CCAACATCCG CGCGATCCGC GTCGAGCCGA CCTTCCTCGG CGCGCACGCG
CTGCCTCCCG AATATCAAGG CGACAGCGAC GCCTATATCG ACCTCGTCGT GGGCGAGATG
ATCCCGGCCG TGGCCTCGCT AGCCACCGCG GTCGACGCCT TTTGCGAAGG CATCGGCTTT
TCGCCCGAAC AGTGCGCGCG CGTCCTTGCC GCCGCGAAGG CCCATGGGCT CAAGGTCAAG
CTCCACGCCG AACAATTGTC GGCGCTCCAC GGCAGCGCGC TCGCCGCGCG GCACGGCGCA
TTGTCCGCCG ACCATCTCGA ACATGCGACC GACGAGGATG TGCGCGCGAT GGCCGAAGCC
GGCAGCGTCG CGGTGCTGCT CCCCGGCGCC TATTATTTCA TGCGCGAAAC CAGACTGCCG
CCCGTCCAGG CAATGCGCCG CCATGGGACG CGCATCGCGC TGGCGACCGA CAACAACCCC
GGAACCTCGC CGACGAGTTC GCTGCTGCTG ATGCTCAACA TGGGCGCGAC CTTGTTCGGG
CTGACCGTCA TCGAAGCGCT GCGCGGCGTG ACGGTCAACG CCGCCGCCGC GCTCGGCCTG
TCGGCGGAGA TCGGCACGAT TGAGGTCGGC AAGGCCTGCG ACCTCGCCAT CTGGGACGTC
GGCGACCCCG CCGAGCTGGT CTATCGCATC GGCTTCAACC CGCTGCACCA ACGTATCAAG
GACGGACAAT GA
 
Protein sequence
MTHRRDTVQM EQRWTNARLA TMAGDDLGLI DDGVVAAKDG RIIYAGPAKD APATQGAVHD 
CEGRLITPGL IDCHTHLIHG GNRANEWAMR LEGASYDEIA RAGGGIVSTM RATRDASEAE
LVASALPRLD ALIAEGVTTI EIKSGYGLST NDELKMLRAA RALANIRAIR VEPTFLGAHA
LPPEYQGDSD AYIDLVVGEM IPAVASLATA VDAFCEGIGF SPEQCARVLA AAKAHGLKVK
LHAEQLSALH GSALAARHGA LSADHLEHAT DEDVRAMAEA GSVAVLLPGA YYFMRETRLP
PVQAMRRHGT RIALATDNNP GTSPTSSLLL MLNMGATLFG LTVIEALRGV TVNAAAALGL
SAEIGTIEVG KACDLAIWDV GDPAELVYRI GFNPLHQRIK DGQ