Gene Sala_0171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0171 
Symbol 
ID4082929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp174007 
End bp175845 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content69% 
IMG OID638008530 
Productpara-aminobenzoate synthase, component I 
Protein accessionYP_615228 
Protein GI103485667 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase
[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.394655 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGACA GGATTCCCGT TCGGGGGCCG AATAGCCCCC CCTTCGTGCT GCTCGACGAT 
GCGCGCACCG ACGGGGCGGC TTGGGCGCGG CTCTTCGTCG ATCCGATCGA GGTACTGACC
GCTGCATCGG CCGCCGACGT GCCCGAGCTG CTCGCGGAGC TGGAGGCCGC TGTGGCGCGT
GGCCTCCATG TCGCGGGCTT CCTTGCCTAT GAAGCCGGGA AGGGACTCGG AACGGCGTGG
CGCGGTGCCG TCGATGGCGT GGTCGGCGCG ATGCCGCTCG GCTGGTTCGG CCTGTTCACC
GGGGTGCAGC GCATCGACTC CGATCATGTC GCAGATCTGC TGTCCGATCC GGCGTCGGCA
TGGATAGGCT GCGTGACGCC ACGGATCGAC CGCGCCGACT ATGTCGCCGC GGTCGAGGCG
GTGCTTGCCT ATATCCGCGC AGGTGACATT TATCAGGCCA ACCTCACCTT CCGCGCCGAT
GTGCCCGTCG CGGGGGATCC GCTCGCCGTT TACGCGCGGC TCCGGCAAAC GGCGCGGGCG
GGATATGGCG GCTTCATCTG GACGGGCGAG CAGGCGATCG CATCGCTGTC CCCCGAACTT
TTCTTTGCGC TGCGCGGGCG CGAGGTCATC GCGCGGCCGA TGAAGGGTAC CGCGACCCGG
CTCGCAGATG GCGCCGCCGA CGCGGCATTG GCGCGCGCGC TGGCCAAAGA CCCCAAGCAG
CGCGCGGAAA ATCTGATGAT CGTCGACCTG ATCCGCAACG ATCTGTCGCG CGTTGCGGCG
CCGGGCTCGG TCGCGGTGCC CGACCTGTTT CGCGTCGAAA GCTTCCCGAC GGTCCACCAG
CTTGTGTCCG ATGTCAGCGC CCGGCTGCCC GAGGGCGTCG GCGCCGTCGA CGTGCTCCGC
GCGGCCTTTC CCTGCGGGTC GATCACCGGG GCGCCCAAGG TGCGCGCGAT GGAAATCATC
GAGGCGCTGG AGAGCGAACC GCGCGGCCTT TATACGGGCT CGATCGGCTT TATCGAGCCG
GGCGGCGATG CCGCATTCAA CGTCGCGATC CGCACGCTCG TCTTTCCCCG CATTGCCACG
CAAGGCGGGT TGCAGGACGC ACCGTCATGC GCCACGCTGG GTCTGGGGTC GGGAATCGTC
GCCGACAGCG TGCCGGACGA AGAATGGCGC GAATGTCTGG CCAAGGGGGA ATTTGTGGGC
GCAGCGGGTG AGAGCTTCGA CCTGATTGAA ACCATGTTTT TCGATCCCGT GGAGGGGATT
CAGCGCCTCG ATGGGCATCT CGCGCGGATG AAGGCGAGCG CGGCGACGCT GGGCTTCGCC
TTCGATCGCC ACGGCGCGCG CAACAGCCTG CAATCTGCGA CCTTCCGCCT GCGCAATGCG
GCGCGCGTGC GAATGCGGCT CGCACCTTCG GGGGCGCTTG CGGTTGAGGT GTCGCCGCTT
CCCCGGCTCG CCGAACTGCC GGTACCCGTC GCGGTGCGTC CCGCGCCGAT GGGGCCGGGC
GATTTTCGCG TGGCGCACAA GACGAGCCTG CGCGCCCCTT ATGATATGGC GCGGCACGAC
AGCGGCGCCG CCGAAGTCGT GTTCGTCGAC GAGCCGGGCT TCGTTACCGA AGGAAGCTGG
AGCAATATCT TCGTCGAGCG CGACGGGGTG CTGTTGACCC CGCCGCTGGC GCTCGGACTG
CTTCCCGGCG TGCTGCGCGG CGAACTCATC GACAAGGGGC GCGCCGTCGA ATCGTACCTG
CGCCTTGCCG ACCTGTCCGC CGGCTTCTTC CTCGGTAACA GCCTGCGCGG CCTGGTGCCG
GCGCGGCTCG CCGAGCCTGC CGACGACGCG ACAGGCTGA
 
Protein sequence
MADRIPVRGP NSPPFVLLDD ARTDGAAWAR LFVDPIEVLT AASAADVPEL LAELEAAVAR 
GLHVAGFLAY EAGKGLGTAW RGAVDGVVGA MPLGWFGLFT GVQRIDSDHV ADLLSDPASA
WIGCVTPRID RADYVAAVEA VLAYIRAGDI YQANLTFRAD VPVAGDPLAV YARLRQTARA
GYGGFIWTGE QAIASLSPEL FFALRGREVI ARPMKGTATR LADGAADAAL ARALAKDPKQ
RAENLMIVDL IRNDLSRVAA PGSVAVPDLF RVESFPTVHQ LVSDVSARLP EGVGAVDVLR
AAFPCGSITG APKVRAMEII EALESEPRGL YTGSIGFIEP GGDAAFNVAI RTLVFPRIAT
QGGLQDAPSC ATLGLGSGIV ADSVPDEEWR ECLAKGEFVG AAGESFDLIE TMFFDPVEGI
QRLDGHLARM KASAATLGFA FDRHGARNSL QSATFRLRNA ARVRMRLAPS GALAVEVSPL
PRLAELPVPV AVRPAPMGPG DFRVAHKTSL RAPYDMARHD SGAAEVVFVD EPGFVTEGSW
SNIFVERDGV LLTPPLALGL LPGVLRGELI DKGRAVESYL RLADLSAGFF LGNSLRGLVP
ARLAEPADDA TG