Gene Sala_1098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1098 
Symbol 
ID4082036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1125857 
End bp1129093 
Gene Length3237 bp 
Protein Length1078 aa 
Translation table11 
GC content67% 
IMG OID638009460 
Productamidohydrolase 
Protein accessionYP_616148 
Protein GI103486587 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGCT TCGCTCTAGC CCTTGCGGCG ACACTGGCTT GTTCAACCTC CGTCTGGGCG 
CAGATTGCAC CCGCTCCCGC CGCTGCGACC AAAGAGGACA AATGGGACGT CAACGCCCCG
CCGGGCATGA CGACGCGCAA GGTGCCGATC GCGGTCGACG AGGGCAGCTG GATGAACGTC
GATGTTGCCC CCGACGGCCG CACCATCGCC TTCGACCTTT TGGGCGACAT CTATACGGTG
CCGATCGAGG GCGGCACGCC GACGCGCATC GCCGAGGGGC TCGCCTATGA GCATCAGCCG
CGCTTCGCGC CCGACGGCAA GCGCATTGCC TTCGTCTCCG ACCGCGGCGG CGGCGACAAT
GTCTGGATCA TGAACCGCGA CGGCAGCGAC AAAAGGCAGG TGTCGAAAGA GGATTTCCGC
CTGCTCAACC AGCCGAACTG GTCGCCCGAC GGCCAGTTCC TCGTCGCCAA AAAGCATTTC
ACCACCAGCC GCTCGCTCGG CACGGGTGAG GTGTGGATCT ATCATGTGTC GGGCGGCGCC
GGCGTGCCGC TGGTCAAGAA ACCCGATGAA CGGCATCAGA AGGAACTGGG CGAACCCATT
TTCGCGCCCG ACGGCAAAAG CGTCTATTAC ACGCGCAACG TCACGCCGGG GCCGATCTTC
GAATATGCGC AGGACAGCAA CCGCGACCTG TTCCACATCG AACGCTACAG CCTCGAGGAC
GGCAGCATCA GCACCGTGGC GTCGGGCAAT GGCGGCGCGG TGCGCCCGAC CCCGTCACCC
GACGGCAAGC GCCTCGCCTT CGTCCGCCGC GAAGGCGCGC GCTCGAAGCT CTATGTAAAG
GATCTCGCGT CGGGCGCCGA AACGAAGCTT TACGACGCGC TCGACCAGGA TGTGCAGGAA
ACCTGGGCGG TCACCGGCGT CTATCCGAAC ATGGCGTGGA CCCCCGACAG CCGCGACATC
CTCTTCTGGG CGGGCGGCAA GCTGCGCCGC GTCGGCGCCG GCGGCGGCGA GGCGCGCGTC
ATTCCGTTCA GCATCGACGA CGATCGCGTG ATCGTCGACG CGGCGCATCC GGCAGTCGAG
GTCGCCCCCG ACAGCTTCAC CACCAAAATG CCGCGCTGGG CCGAAGTATC GCCCGACGGG
CGCAGCATCG TCTTTGAAAC GCTCGGCAAG CTGTGGGTCA AGCCCGCGAC CGGCGGCACC
GCGCGGCGGC TGACCAGCGC GAAGGACGCG GCGATGGAGG CGTGGCCGAG CTGGTCGCGC
GACGGCAAGT CGATCGTCTT CGTGCGCTGG ACCGATGCAG GCCTCGGCGA AATCCATGTG
ACCGGCGCCA GCGGCGGAAG CTCGCGCAAG GTGACGGCAA CGCCCGGCCA TTATGCCGAA
CCGCGCTTCT CGCCCGACGA CAGGACGATC GTTTTCGAGT GGCGCCGCGG CGGCGGGCTG
GTTTCGGAAC GCTGGGGTGA AGACCCCGGC GTCTATCGCA TCGCGGCGAC CGGCGGCACC
GCGGAACGGA TCAGCCGCGA CGGGGCCAAA CCGCAATTCG GCGCCGCGAA CGACCGCGTC
TTCATGGTCG CGTCGACCGA CGGCAAGAGC CAGCTCGTCA GTACGGACCT GGACGGCGAG
GACAAGCGCG TCCACGCGAG CGGCGAGCTG GTCAGCGACT ATGAAGTATC GCCCGACGGT
CGCACCCTCG CCTTCCGACA GAATTACGAC GCCTATGTCA CGCCGCTGAT GCCCGGCGGG
CAGGATGTGT CGCTCGGCAT CAAGAGCGGG GCGCTGCCCG TGACGCGCGT GTCGGGCAGC
GGCGCCGACT ATATCCACTG GTCGGACGGC GGCCGCCGCC TGCACTGGAG CCGCGGCGCG
ACCTTGTTCA GCGCCGATCT GGCGAACCTC TTCGCCAATG CTCCCGTCGA CGACAAGGCG
CCGAAGTTCA CGCCGCCGAC CGATGGCGTG TCGCTGGCAA TGACGCAGGC GGCGGCGAAG
CACAAGGGAA CGGTCCTTAT CACCGGCGCC AGGATCGTCA CCATGGCCGA CAAGGACGGC
GGAGTGATGG AGAATGGCGC GATCCTGATC GAGGACGACC GCATCGCCGC GATCGGCCCC
GCGGGCGCGA TCACCATTCC CGCAGGCGCG GTGACGATCG ACGCCACCGG CAAGACGATC
GTTCCCGGCT TCGTCGACGC CCATGCGCAT GGCCCGCACG GCGCCGACGA ACTGGTGCCG
CAGCAGAATT GGTCCGAAAT CGCCAATCTG GCGATGGGCA CGACGACCAG TCACAATCCC
TCGTCGCGCG CGTCGGAAAT CTTCGTTTCG TCCGAAATGC AGCGCGCGGG GCTGATCCTC
GCGCCGCGCA TCTTTTCGAC CGGCGAGATC ATCTATGGCG CGAAGGCGGC GGGCGTCTAT
GCCGAGATCA ACGGCTATGA CGATGCGCTC GCACACGTCC GGCGGCTGAA GGCGCAGGGC
GCTCACAGCG TCAAAAACTA TAATCAGCCG CGGCGCGACC AGCGGCAGAT GGTCGTCAAG
GCGGCACAGG CGGAGGGGCT GACCGTCGTG CCGGAGGGGG GATCGCTCTA CACGATGGAC
GTCTCGCTCA TCCAGGACGG CAATGCGACC GTCGAGCATA ATATCCCGCT GCACGTCTTC
TACCGCGATC TCGTGCAGCT TTGGGGCCAA ACGCAGGTCG ATTACACGCC CACGCTGGTC
GTCACCTATG GCGGCCCCGC GGGCGACCCC TATTGGCGCG CGCACACCAA TGTGTGGGAG
CATCCGATCC TGACGAAGCA TATCCCGCCG ACCGAACTCG CCGCCAACAA CAAGCGCCGC
GTGATCGCGC CCGAAAGCGA CTATGTCGAC GACGATGCCG CGCGCCAGGC CGGCAAGATC
GCCGCGGCGG GGCGCATGGT GTCGATCGGC GCGCACGGCC AGCAGGCGGG CCTTGGCGCG
CATTGGGAAA TCTGGTCGTT CGTGCGCGGC GGCTGGAGCA ACATCGATGC GCTCCGCGCC
GCGACGATCA TGCCGGCCAC CTCGCTCGGC TATGCGAAGG ACGTGGGCTC GCTCGAGGTT
GGAAAGCTCG CCGACCTGCT TATCATCGAC GCCGACCCGA CCGAAAACAT CCGCAATACC
GAGCGAATCC ACCGCGTCAT GCTCGGCGGA CGACTCTATG ATCCCATCAC CATGAACGAG
GCCGAAACCG GGTCGCGCAA GCGCGATGCC TATTGGTGGG AAACGGAAAA GCCCTGA
 
Protein sequence
MARFALALAA TLACSTSVWA QIAPAPAAAT KEDKWDVNAP PGMTTRKVPI AVDEGSWMNV 
DVAPDGRTIA FDLLGDIYTV PIEGGTPTRI AEGLAYEHQP RFAPDGKRIA FVSDRGGGDN
VWIMNRDGSD KRQVSKEDFR LLNQPNWSPD GQFLVAKKHF TTSRSLGTGE VWIYHVSGGA
GVPLVKKPDE RHQKELGEPI FAPDGKSVYY TRNVTPGPIF EYAQDSNRDL FHIERYSLED
GSISTVASGN GGAVRPTPSP DGKRLAFVRR EGARSKLYVK DLASGAETKL YDALDQDVQE
TWAVTGVYPN MAWTPDSRDI LFWAGGKLRR VGAGGGEARV IPFSIDDDRV IVDAAHPAVE
VAPDSFTTKM PRWAEVSPDG RSIVFETLGK LWVKPATGGT ARRLTSAKDA AMEAWPSWSR
DGKSIVFVRW TDAGLGEIHV TGASGGSSRK VTATPGHYAE PRFSPDDRTI VFEWRRGGGL
VSERWGEDPG VYRIAATGGT AERISRDGAK PQFGAANDRV FMVASTDGKS QLVSTDLDGE
DKRVHASGEL VSDYEVSPDG RTLAFRQNYD AYVTPLMPGG QDVSLGIKSG ALPVTRVSGS
GADYIHWSDG GRRLHWSRGA TLFSADLANL FANAPVDDKA PKFTPPTDGV SLAMTQAAAK
HKGTVLITGA RIVTMADKDG GVMENGAILI EDDRIAAIGP AGAITIPAGA VTIDATGKTI
VPGFVDAHAH GPHGADELVP QQNWSEIANL AMGTTTSHNP SSRASEIFVS SEMQRAGLIL
APRIFSTGEI IYGAKAAGVY AEINGYDDAL AHVRRLKAQG AHSVKNYNQP RRDQRQMVVK
AAQAEGLTVV PEGGSLYTMD VSLIQDGNAT VEHNIPLHVF YRDLVQLWGQ TQVDYTPTLV
VTYGGPAGDP YWRAHTNVWE HPILTKHIPP TELAANNKRR VIAPESDYVD DDAARQAGKI
AAAGRMVSIG AHGQQAGLGA HWEIWSFVRG GWSNIDALRA ATIMPATSLG YAKDVGSLEV
GKLADLLIID ADPTENIRNT ERIHRVMLGG RLYDPITMNE AETGSRKRDA YWWETEKP