Gene Saro_3789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3789 
Symbol 
ID5077937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp433184 
End bp434911 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content66% 
IMG OID640481512 
Productbacteriocin/lantibiotic ABC transporter 
Protein accessionYP_001166174 
Protein GI146276014 
COG category[V] Defense mechanisms 
COG ID[COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAACA TGCAGAATGA TGCCCGGAAT GCCGGAAATG CGGGATCCTC GCCGAGTTTC 
CGCGAACTGG CGCTGTGGTG GTCGCGCGTC CTCGGGCCAG ACGCGGGCTT CATACGTCTC
GCTCTGCTCT ATGGCGCGGC GATCTCGCTG CTTTCGCTGG CAACGCCGAT TTCTGTCCAG
CTCCTGATCA ATTCGGTTGC CAACACTGCC CTGCCCGCCC CGCTGTTCAC GCTCGCGGCG
ATCCTGTTCG TGCTTCTGCT GCTGTCAGGC GTCCTCGCTG CCTTCCGCGT CCACATGCTT
GCCATGTTCG AGCGTCGCTT CTTCGCGCGG CTCGTAGCGG AGATTACCTT GCGCGCCGTC
CATGCGCAGA ACCCGTTCTT CGTCGATGCG CGGCGCGGCG ACCTGTTCAA CCGGTTCTTC
GACATGGGCA TCGTCCAGAA ATCGCTGACG AGCCTGCTCA TCGGCGGCTT CACCATCGTC
CTGCAAAGCC TGGTCGGGCT GGTCGTGACC AGCTTCTATC ACCCCTTCTT CCTTGGCTTT
AATCTGGTTC TGGTGTTCGC CGTTGTCGTC ATCTGGCGGG TCTGGGCCAA TGCCTCGCTC
ACCAGCGCGG TGACCAAGAG TCACGCCAAG CACGCCGCCG CGCATTGGCT CGGCAGCGTC
GGCAGCTCGA ACGGCATATA CAAGTCGAGC CGCCACATGC ACTTCGCCAT CGACCGATCG
GAGGAGCTGA CGGCCCGCTA CGTCTCGTCG CACCGCCGCC ATTTCCGCTA CACCTTCGGG
CAGACCCTCG CGTTCCTGTT CCTCTATGCG CTAGCCAGCG CGGGCCTGCT TGCGCTCGGC
GGATGGCTGA TTCTTCAGGG CGAACTTTCC ATCGGCCAGC TCGTCGCGGC CGAGCTGATC
CTGTCCGGCG TGTTCTACGG CATCGCCCAG CTCGGCTCCT ATCTCGAAAC CTTCTACGAC
CTCGCTGCCA GCCTCGAGGA ATTGCACCTG TTCTGGGAAG TTCCGCAGGA ATCCGCGCCC
GATGCCACCG CCCCCAGCCC TCCCGACGGC GCGATCCGCC TGCGCGGCGT GCGCCACCAG
GGCCATCTGT TCGATTTCGC CGTGGAAAGC GGCGCCCAGG TCGGAGTCCT TGCCGCACCG
GGGGTGGAAC GCAGCCTCGT CGCCCTGCTC AAGCGCCTGG AAGTGCCCGA AAACGGCTTC
GTCTCGGTCG GCGGGGCCGA TCTTGGCGCC CTCGACATGT ACCGCCTCCG CTCGGACGTG
ATCGTGCTCG ACCGGCCTGC CATCGTCGAG ATGACGGTTC GCGAGTATCT CTCACTCAGC
GCTCCGGGTC GGCCGGAAGC CATGATCGAA GCACTTGAAC TCGTCGGCCT TGCCGGCCGC
GTCGGCGGTC TACCCGGTGG CCTCGACGCG CGCCTGTCGA ACTCGGGCTG GCCGCTGTCC
GTAGGGGAAA CCATGGCCCT CAAGCTGGCC GGTGCCGTGC TTGCCCGCCC GCGCGTTCTC
ATGCTCTCGC CGCTTTACGA CATGCTGCCT CCGGGCCGAC TGGATCGGGT CCTGGCCGCG
CTGCGCCCGC ATGGCACCAC CGTCCTCCAG TTCACGGAGA GGCCCGAAGG ACTCACCCGG
GACACATGGT TGTGGATCGG CCAGAAGTCG CAGCATCGCG CGCCCGACCT TGCCGCCGTC
CTCCCCTTCG CCAGCGCGGA AGACGCCGCC ATCGCAATGG AGGGCTGA
 
Protein sequence
MSNMQNDARN AGNAGSSPSF RELALWWSRV LGPDAGFIRL ALLYGAAISL LSLATPISVQ 
LLINSVANTA LPAPLFTLAA ILFVLLLLSG VLAAFRVHML AMFERRFFAR LVAEITLRAV
HAQNPFFVDA RRGDLFNRFF DMGIVQKSLT SLLIGGFTIV LQSLVGLVVT SFYHPFFLGF
NLVLVFAVVV IWRVWANASL TSAVTKSHAK HAAAHWLGSV GSSNGIYKSS RHMHFAIDRS
EELTARYVSS HRRHFRYTFG QTLAFLFLYA LASAGLLALG GWLILQGELS IGQLVAAELI
LSGVFYGIAQ LGSYLETFYD LAASLEELHL FWEVPQESAP DATAPSPPDG AIRLRGVRHQ
GHLFDFAVES GAQVGVLAAP GVERSLVALL KRLEVPENGF VSVGGADLGA LDMYRLRSDV
IVLDRPAIVE MTVREYLSLS APGRPEAMIE ALELVGLAGR VGGLPGGLDA RLSNSGWPLS
VGETMALKLA GAVLARPRVL MLSPLYDMLP PGRLDRVLAA LRPHGTTVLQ FTERPEGLTR
DTWLWIGQKS QHRAPDLAAV LPFASAEDAA IAMEG