Gene Sala_2825 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2825 
Symbol 
ID4080406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2979464 
End bp2980765 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content69% 
IMG OID638011209 
Productamidohydrolase 
Protein accessionYP_617863 
Protein GI103488302 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.886429 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.061566 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCTT TCCATCGCTT GTCGCTCGCG CTGGTCGGCG CGGCGCTCGC CACGCCGCTG 
GCCGCCCAGA CGCCTGCGCG CACGGTGATC CACGCCGGGC ATCTGCTCGC CGAACCCGGC
AAGCCCGCGC GCGGCGCGGC GACGATCATT GTCGAGGGGG GCAAGATCGT CAGCATCGCC
GACGGCCACC AACCTGCCGA GGCGGGCGCG ACGCTGATCG ACCTCAGCGA CAAATATGTC
CTGCCGGGCC TGATCGACAG CCATGTCCAT CTGACGAGCG ACGCGGGCGG CCTCGCGGGC
CAGCTGGAGG AAGTGACGCT CAGCCCCGCC GCGCAGGCCT TCAACGCCGA GGTGAACGGG
ATGAAGACGC TGCGCGCGGG CTTCACGACG GTGCGCAACC TCGGCGACGG CGACGGTGCG
ACGCTGGCGC TCCGCGACGC GATCCTTGCG GGCAAGGTGC AGGGGCCGCG CATCGTCGAT
GCCGGCGCCA GCATCTCGGG CAGCGCCGGG CATATGGACG GTTCCCTGGG CTATCGCGAC
GAGCTGCGGC CCTTCTTCGC GGGGGCGGGC AATACCTGCA ACGGCGCCGA CGATTGCCGC
CGCGCGGTGC GGCTCCAGAT CGGGCGCGGC GCCGACGTGA TCAAATTCGC GTCCACCGGC
GGGGTCAACA GCCGGATCGG CGCGGGTCTC GGCAAGCAGA TGTTCGACGA CGAGGCGCAG
GCGATCGTCG ATACCGCGCA TCTGTTCGGG AAAAAGGTCG CGGTCCACGC GCATGGCGCC
GACGGCATCC GCCTCGCGCT CGCGGCGGGG GCCGATTCGA TCGAACATGG CACGATCCTC
GACGAGGCGA CGATCGCCCA ATGGGCGAAA TCGAAAACCT ATTATGTTCC GACGCTCTCG
ACCGTGAACG GGTACAAGGA ACGGCTCGCC GCCAATCCGG ACGCGTATGA ACCCGACGTG
CTGGCGAAGA TCCGGTGGCG CATCTCGATC ACCGGCAAGA GCCTCGAGCA GCTCGTCCCG
CGCGGGGTGA AGATCGCCTT CGGCACCGAC GCCGGCGTGT CGAAGCACGG CCGCAACGGC
GACGAGTTCG AACTGATGGT CCAGCACGGC ATGACGCCTC TCGAGGCGCT GAAGGCGGCG
ACGGTGAACG CCGCCGATCT GCTCGGCCTC GCCGACCAGA TCGGCAGCAT CGCGCCGGGC
AAGAGCGCCG ACATTATCGC GGTGGCCAGC GATCCGCTCG CCGACGTCCG CGTGCTCAAA
AAGGTCGATT TCGTCATGGC GCGGGGCAGG GTGGTCGACT GA
 
Protein sequence
MKPFHRLSLA LVGAALATPL AAQTPARTVI HAGHLLAEPG KPARGAATII VEGGKIVSIA 
DGHQPAEAGA TLIDLSDKYV LPGLIDSHVH LTSDAGGLAG QLEEVTLSPA AQAFNAEVNG
MKTLRAGFTT VRNLGDGDGA TLALRDAILA GKVQGPRIVD AGASISGSAG HMDGSLGYRD
ELRPFFAGAG NTCNGADDCR RAVRLQIGRG ADVIKFASTG GVNSRIGAGL GKQMFDDEAQ
AIVDTAHLFG KKVAVHAHGA DGIRLALAAG ADSIEHGTIL DEATIAQWAK SKTYYVPTLS
TVNGYKERLA ANPDAYEPDV LAKIRWRISI TGKSLEQLVP RGVKIAFGTD AGVSKHGRNG
DEFELMVQHG MTPLEALKAA TVNAADLLGL ADQIGSIAPG KSADIIAVAS DPLADVRVLK
KVDFVMARGR VVD