Gene Saro_0040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0040 
Symbol 
ID3916043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp44243 
End bp45412 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content70% 
IMG OID640442765 
Productsecretion protein HlyD 
Protein accessionYP_495323 
Protein GI87198066 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTACG AAACGACCAT CGATGCCGAG GGCGCACAGG CGCTGGGATC CCTGGCCGAC 
GGGGAGGACA GCGCAAGCCA ATCGCGGCGC AAGTGGATCA TCGGCGTCGC CGTGGTGCTG
CTGGTCGTGC TTGCCTGGTG GTTCCTGCAT GGACCGAGCG AGCCGGCCGG TCCGGCGAAG
ACCCAGGCCC AGGTGGTCAC GGTCGTCGTG CCGGGCAAGA CCGTCATTGC CGGTACCATA
ACCGCCAGCG GCACGATCGC CGCGCGCCGC GAGATGCCCG TTGGCGTCGC GGGCGAGGGC
GGACAGATCG TGCAGGTCCT TGTCGACGCG GGCGACTGGG TTCGTGCAGG GCAGGTGCTG
GCCGTCATCG ACCGTTCGGT GCAGGGCCAG CAGATCGCCA GCCAGGCCGC CAACGTCGAA
GTTGCGGCGG CTGACGCGCG GCTGGCGCAA GCCAATCTCG ACCGTGCACT CAAGCTGGTC
GAGCGCGGTT TCATTTCCAA GGCCGACGTC GACCGCCTGA CCGCCACCCG CGACGCGGCC
GTGGCGCGCG TTCGCGTGGC CCGGGCAAGC CTGGGAGAAC TGGGCGCGCG TGCGGCGCGG
CTCAACATCG TGGCGCCGGC GGCGGGGCTT GTCCTCACCC GCGCGGCCGA ACCGGGCCAG
ATCGTCAGCT CGGGTTCGGG CGTGCTGTTC TCGCTTGCCC GCGACGGGCA GATGGAAATG
CAGGCGCGCC TTGCCGAAGC CGACCTTGCG CGGTTGACAG TGGGCGCCAC CGCCGAGGTG
ACGCCCGTGG GCACGACCCG CGTCTTCAAC GGGCAGGTCT GGCAGCTTTC GCCGACCATC
GACCAGCAAT CGCGCGAGGG CATCGCCCGC ATCGCGTTGT CCTATGATCC TGCGCTGCGT
CCGGGCGGTT TCGCCAGCGC GACGCTGCGT TCGGGCACGG TCACCGCGCC GCTCCTGCCG
GAATCGGCCA TTCTGAGCGA CGACAAGGGC ACCTTCGTCT ACGTCGTGGG CAGCGACAAC
AAGGCCCAGC GGCGCGACGT GAAGACCGGC GAGGTCGGCG CGCGCGGCAT TTCGGTGGTC
CAGGGTCTGG CGGGCAACGA GCGGGTGGTG CTGCGGGCGG GCGGATTCCT GAATCCGGGC
GACGCGGTCC AGCCAGTCCT CGCCAAGTAG
 
Protein sequence
MNYETTIDAE GAQALGSLAD GEDSASQSRR KWIIGVAVVL LVVLAWWFLH GPSEPAGPAK 
TQAQVVTVVV PGKTVIAGTI TASGTIAARR EMPVGVAGEG GQIVQVLVDA GDWVRAGQVL
AVIDRSVQGQ QIASQAANVE VAAADARLAQ ANLDRALKLV ERGFISKADV DRLTATRDAA
VARVRVARAS LGELGARAAR LNIVAPAAGL VLTRAAEPGQ IVSSGSGVLF SLARDGQMEM
QARLAEADLA RLTVGATAEV TPVGTTRVFN GQVWQLSPTI DQQSREGIAR IALSYDPALR
PGGFASATLR SGTVTAPLLP ESAILSDDKG TFVYVVGSDN KAQRRDVKTG EVGARGISVV
QGLAGNERVV LRAGGFLNPG DAVQPVLAK