Gene Saro_3802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3802 
Symbol 
ID5077950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp451544 
End bp457795 
Gene Length6252 bp 
Protein Length2083 aa 
Translation table11 
GC content65% 
IMG OID640481525 
Productouter membrane autotransporter 
Protein accessionYP_001166187 
Protein GI146276027 
COG category 
COG ID 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACCA ACAAGAGCAG GCTTGCCCTG GGCGCAGCCA GTGCGGCCGT TGCCGTGGGC 
CTCGCGGCCC AGGTCCAGGC CGCCACTACG ACCAGCGGAA CCACCGTAAC GGTCGCCGGG
ACCGACAACA CCGCCGACAG CGTGAATACT GCGACCAACG CCGTGTCCGG TTCGACCGTC
ACGGTAAGCA TCAGCACCGG CGCGACCGTC GTCCAGCCCA GCACTGCCCT CTCCCCGACG
CAGCAGGGCG CGGTCGCGAT CACCAATCGC GGGACCGTCG GCACCTCGGT GGCGCCGGTT
GGCATCGTTT ACGCAGGAAC CTCCACCAGC AGCGACAACA CCTTCGACCT TGCCAACCTC
GGTGCGATCA CCGGTGGCGT CTCGGTCAAC GGCGTTGGCG GCACAGTCAA CATCGTCAAC
AGCGGCACGA TCGACAACGG CATCGGCGTC ACGGCCGCAG GCGCGGTCAC GATCCGCACC
GGTGCGGTAA AGAGCGGATC GGGCATTGCC GTATCGGCGG CCAGCCAGGA CAACGTGTCC
ATCGCGGTCG ACGGAACGAT CGGCACCGCC GGCACCGAGA TGGTGTCCTC TGCCCTGCGC
GACGTGCAGG CCATCAGCAT CGGCACGACT TCTACTTCGA CCGGCCCGAC CACCGAAACC
GTCGATGGCA CCACGACCGT CTCGGGCTCT TCCTCCAGCG ACTACACCGG CGGAAGCGCT
GATGTCGCGG TGGAAGCGGG CGGCGCATCG GGCGCCGTTC TGGCTGTCGG CCTCGCTTCG
GCAAGCGTCT CGGTTGACGG GGCAGTAGGC GCCGAGGACC AGAGCGCGCA GGTGAACGCG
ATCGCCAACC TGGGCGAAGT CCACTCGGCA TCGGACTACG AATACGTCAC GACCGCGACC
GGCTCCAGCT ATGCCACGCA CGAGGAATTC GCCACGATGG GTGGCGACGC ATCGGTCGAC
ATCGGCGAAG GCGGCTTCGT TTCGGGCGCA GTGACGGCTG CGGGCCTTAC CTCTGCCGAA
GTGACCGTCG ATGGCACCGT GGGCCGCGAG GGCTTCTCCT CGTCGGTCAA TGCACGCGCG
CTCGGCGTCT CGCACAGCGA GGACGAAACC GGCACCGTGG TCAATGCCAG CGGTGACTAC
ACCAACAACT TCACTGCCAC CAACGCCCGC ACCGGCGGAA CGGCTTCTGT AACGGTCGGG
GAAAACGGCA TCGTTCGCGG GGCAATCAAT GCTCGTGCGG ACGGCGACGC CACGATCGAC
AACAGCGGTC AGGTGGACGG AGCGCTCAAC GCCCGCTCGA ACGAATATGT CTCGAACCAG
GAAGTGACTT CCTACAATTC GGGCACCTTC ACCTCGGCGG ACGGCGTCTC GGTAGATGCC
TATACTTCGG GCTATGCCTT CACCAGCGGC AATGTCGGCA ACGCAGCTTC GGTGACGAAC
GCCCAGGGCG CGTCGGCCGG ATACGCCAAC CTCAATGCCA TCGGCGATGC CACGCTCACC
AATGGCGGTG AACTGGTCAA CGGCGCACTG GTGAATTCGT CGGGCACGAC CTCGACGGGC
AGCGGCGCCA CGAGCGAAAC CTACACCTTC ACCGACACCG GCGAAGGCAC CTATCGTTCC
GCCTATTCGT ATAGCGATGA AACCACCTCG GGCTCTTCGA CCATCGGCGG CACGGCCAGC
GTGGCCAACG CCAGCGGCGG CACGATCGGC AACGGTCCGG GCTCGTCGCT CTCGGTCAAC
GGCTTCGCCG GTGCCCAGGT GCAGAACGCG GGGATCATCA ACGCTAACGT CAACCTCGGC
TCGACCGGGT CGGACTCTAC CAGCGCGTCG CTTTCGACCT ACGACGAGAC GGCAGATACC
GTTGGTGTCA CCGCTTCCGA GTCCACCTAC GAATATGCCA ACGCGAGCAC GGCCACGGGC
GGCTCCGCCA GCCTGACCAA TGCGGCCGGT GGCCTCGTCG GACTGAACGA CGAAAGCCCC
GTCTCGGTCT CGCTCCATGC CAATAGCGAT GCGTCGGTCA CCAACGCCGG CCGCATCAAC
GGCCCGGTCT CGGTGGTTGC GGACGCGACC GATACGGCCA ACTCCGGCGG TTCGGCCAAT
ACCTTCACGG TCGACAGGAC GACCGGCGTG GTGAACACGT CCGAAGCGGA GCATTACGAA
AGCTCGTCGG CTTCTGCCGG AGGCAGCGCA TCGTTTGCCA ACGCCGCTGG CGGACTGGTT
GTCGGGTCCG TGAACGTGAG CGGCGATGCC GGCGTCACAG TCTCTAACGC CGGTGTGGTC
ACCGGAACGA CCTACGCGTC GAGCAACTCC TCCGCTTCGA CTTTCGCGTA CGAATCGCTC
GAGACGGGCG TCTTCACGCC GGGTGCTGAA GGCGGGTTTG TCTCCGACTA CACGCAGAGC
GTCAGCTCGT CCTCGAACGA CGATGGCGGT GACGTCACCG GGACCTACGA TGGCTTCAAC
GGTGCGGTGC AATTCACCAA TTCCGGTGCG TCCGACGGCT CGGTCACGCA GTTCGCCAAT
GGGAACAGCA CGGCGACGGT AAGCGGAACG ATCTTCGGCA GCTTCAACGA AACGGCTTCG
GGCAGCGCCT CGACGTCGAC CTATACCGAG ACCACCCACT ACGCCAACGA CGCCGACGGC
AACACCTTCG AAGGCACCTA CGACGAGGCC TCGTCGGGCA CCTACGATCG CACGGGCGGC
AACAGCTCGC TTGCGGTTGC GGGCGGCACG ATCACCGGCA ACGCGACAGT CAACGCCGAT
GGCGATGCCT CCGCGCAGCT CGGCAACGGT GCCGAAATTG GCGGCAATCT CGACGTGCAT
GCCGCGACCT TCGGCAGCAA CAGCACGTTC GCCGAAACCA GCTCGACAGT CGGGACTTAC
GTCGACGGCG ACCTGACGGG CTATACCCTC GAGGAAGCGT CGGAATCGTC GAGCACCAAC
CTGGATAGCG CAGGTTCGGT CAGCATCGGC AATGCCGAAG TCGGTGGCTA TGTCAGCGTC
AGCGGCGCCA AGGGCGGTGC CACGCTGGAC CTCGCCTCGG CCGGTTCGAT CGGTGGCTCG
GCCTACGTCT ATGCCGGCGG GTCGGACTCG GAATCGGCGG GTACCTCGAC CACGGTGGTT
TCCGATGGTG AAACAACCGT CGATACTGCC TCTGAATCGA GCAGCACCGC AAACGGTGGC
AACGTTTCGG CGACTGTCGC CGGTACGATC GGCGGCGACC TGTTCGTCGA CACCAATGCC
GGAAACGCGA CTGTTGCCCT GACCGGGCAG GTTGGCTCCG ACATCGTCGT GGACGCGGTT
GGCTTCTCGG GAACGAGTGC CGGCACGACC CACACCGACG CGGACGGTTA TGTCACCACG
ACGGAATCGA CCTCCACCCC GGTGGGCGGA ACGGCAAGCC TTGCGGTCAA TGCCGCATCG
CTCGACGTTC CTGCGTCCTA TGGCGACATC GACGTCAGCG GCCTGGGCGG TTCGACCGTC
ACGATCGGCG CGAAGAGCGC GGTTCTGGCG GGCGCATACG ACACCGAACT GAACGTTGGC
GGCACCTATG CGGCCACCAC CTCCAGCTCG GAGTTCACCG ATCCCGCCAT CGGCCTTGCC
ACCTACCACG AGGAAGGTAC TTCGACCGCG GTCGGTGGTC CCGCTTCGCT GACCAATGCC
GGCACGATCG GCTACGACAA CGGCGATGAC AGCGCGACGC TGGCCTCGGT CACTGTCGCG
AGCGTCGGAG GTGCCACGGC GGTCAATACG GGCAAGATCT TCGGCAGCCT TTCGGCCAAT
GCGCTGGGGA CCGATACCGT AACGACTGTC GACCAGATCA ACCTCTATGA CGTGACCCGC
GTCGATACCA CGGTCGTCGA ATACACGGCG GTGGGCGGCA ATGCGGCGAT CACCAACAGC
GGCCTGGTCA CCGGATCGGT CTCGCTGGCG GGTGCAACCG GCACGGTCAC CAACAGCGGC
ACGATCGGCG GCGACATCGC GGTCGGGCAG TCGGTGGACA ACTACACCAC GACCAGCGTC
GACACTCTGA CCCAGATTGG CGAAGAGCTG GTTACGGCCC AGCCGGAAGC GCCCTTCACC
CAGACCTATA CCGTCAATCA GAACGGTACG GTCGAAGGCG ACATCCGGAT CGGCGGCGCG
TTCGGCAACT ATGCCCTCGC TCCGTCGGAT GGCGGCGAAG TCACCACCGC AGCGGTCGAC
ACCACGGCGG TCGATGCCGA CGGCCATCCG CTGACCAGCG TGATCAACGC CACGGTCAAC
CTCGGCAACA ACTCCGTCAC CAACGGCGGT GTCTATGCCG AATACGATCT TGATACCGGC
GAGCGCTTCA CCAACACGGT GGTGAACGTG GCATCGACGG CCAAGCTCGG CGGCGGCGTC
CATGGCGTCG AAAAGCTGAA CAAGCTTGGT ACCGGCGTGT TCACGCTGAC CGGCCCTTCC
TATGTCGCGG CGACCGATGT CGATCCGGCT GAATGGACCC TCGATCTCGG CCAGTTCGAG
ATCCTGACCG GCGAAGTGCA GCTGGCCACC GACGACGGCG GCGTCTTCGG CATCCGCGGG
GACGTGAAGA ACGCGGGCAG CCTCGTTCTG GGCACCCGCC AGACGCTCGT CCCGACGCCG
TTCGGTTCCA ACCTGACCAG CACCGCGACC CAGACCATCG CCGGCGTGGA CGTCTACCAG
CAGGGCGATT TCGTCCAGAC CAGCAGCGGT TCGCTGACGG TGGCGATGAT GCCGTCGCTG
GTCCGCGTGG TCGATCCGTC GATCAACGGC TCGGCTTCGT CAAACGAGCC GCTGGGCGTG
CAGCAGATCC TGTTCTCGCA GGGCCTGTTC ACCACCCCGG ACAAGGCGTT CGGTTCCCAG
TATGCCGCGC TCTACGCACC CAGCTCGTGG ACCATCGATG GCGATCTCGA CCTCGCCGGA
ACGGTCAACG TGCTGATGCC GAAGGGCGGG CTGTTCCTGG ACGGCCAGTC GCTGGACCTG
TTCAGCGTCT CGGGCGATGT CACTGAAAAC GCCACCGTTG CCACCGGTAC GGCCAACAAC
TTCGTTGCCT TCGATCTTGT CAGCCGGACC TCGGATGGCA GGACGATCGT GGCGGTCGTG
GCCGATCGCA AGGGCTACGA GACGGCTGCG GCCAACAGCA ATGCCGCTGC GGCCGGTGCT
GCGCTCTCGG CGGCGCTGCC GGGCGTGGTG GCCGACCTGA CCGCCGATGC GGCCGGGACT
GCGACGTTCG CTTCGGTGCA GGAGTTCGCA CTGACGCAGG ACCTCGCAAC GGTCATGGCC
GGGCTCGACA GCCAGCTTAC GCTGGCCCAG GCGACCCAGG CACTGACCGA GCTGGGCGGC
GGTTCGTACT ACGGTTCGAT CGCCACCATC CGCACGACCG CACCGTTCAT CGACGTGCTC
AGCAACCGTC GTCTTCCGGA AGGCGCCACC GGGTTCAACC TGTGGATCCA GCCCACCGGC
GACTTCGTGC GCACTTCGGG TGACGCGGCA ACCGGCGCTT CGAAGATCCG TTCGGACAAC
TACGGCGGTT CGGCGGGCTT CGGTGTGGCG ACCGGTTCGG GCGAGTTCGG CATTGGCTTC
GGCTATGGCC GGACCAACAG CCACTCGGAC GATGGCCTTG CCAAGGCCAA TGCCGATACC
TGGATGGTCG GCGGTTATGC CCGCCAGTCG TTCGGCGCGC TGACCGTTGC TGCGGACCTC
GTGTTCGGCT GGAGCAACTG GGACGCCTAC CGCTCGCTTC CGACCCTGTC GCGCAAGGCC
ACGGCCGACT TCGACAGCAA GGAAACCCGC GGCGACCTGC GCATCGAGTA CGCGCTGCAG
ACCGGTGGAC TGACGGTTTC GCCGTTCGGT CAGCTCGAAC TGCGCCACTA CAGCTTCGAC
GGCTTCGCCG AAGAAGGTGC AGGCTCGGTC GGGCTCTCGG TTGCCGAAGC CAGCAAGACC
GTGTTCACGC CGACGGTCGG CGTGAAGCTT GGCGGAGAGT TCGAGACCGG TCTCGCAACG
ATCCGTCCGG AAGCCTCGGT CAGCTACAGC TTCCAGGGTG ACAACGAGGC GGATCGCACC
GTCGCGTTCC TGGGTGCTCC GGCGCAGAAC TTCCGTCTGC AGGGCGTCGA TCCCGACGGC
TTCGTCACGG TCCAGGCGGG TCTGTTCGCC GACATCGGGA CGCGCTCGGG CGTCTTCGTC
CGCGGCAGCT ACTCGACTGG CGGCGGCAAC AACGTTGCAG CCATCCGCAC GGGAGTCGTG
ATCGGCTTCT GA
 
Protein sequence
MKTNKSRLAL GAASAAVAVG LAAQVQAATT TSGTTVTVAG TDNTADSVNT ATNAVSGSTV 
TVSISTGATV VQPSTALSPT QQGAVAITNR GTVGTSVAPV GIVYAGTSTS SDNTFDLANL
GAITGGVSVN GVGGTVNIVN SGTIDNGIGV TAAGAVTIRT GAVKSGSGIA VSAASQDNVS
IAVDGTIGTA GTEMVSSALR DVQAISIGTT STSTGPTTET VDGTTTVSGS SSSDYTGGSA
DVAVEAGGAS GAVLAVGLAS ASVSVDGAVG AEDQSAQVNA IANLGEVHSA SDYEYVTTAT
GSSYATHEEF ATMGGDASVD IGEGGFVSGA VTAAGLTSAE VTVDGTVGRE GFSSSVNARA
LGVSHSEDET GTVVNASGDY TNNFTATNAR TGGTASVTVG ENGIVRGAIN ARADGDATID
NSGQVDGALN ARSNEYVSNQ EVTSYNSGTF TSADGVSVDA YTSGYAFTSG NVGNAASVTN
AQGASAGYAN LNAIGDATLT NGGELVNGAL VNSSGTTSTG SGATSETYTF TDTGEGTYRS
AYSYSDETTS GSSTIGGTAS VANASGGTIG NGPGSSLSVN GFAGAQVQNA GIINANVNLG
STGSDSTSAS LSTYDETADT VGVTASESTY EYANASTATG GSASLTNAAG GLVGLNDESP
VSVSLHANSD ASVTNAGRIN GPVSVVADAT DTANSGGSAN TFTVDRTTGV VNTSEAEHYE
SSSASAGGSA SFANAAGGLV VGSVNVSGDA GVTVSNAGVV TGTTYASSNS SASTFAYESL
ETGVFTPGAE GGFVSDYTQS VSSSSNDDGG DVTGTYDGFN GAVQFTNSGA SDGSVTQFAN
GNSTATVSGT IFGSFNETAS GSASTSTYTE TTHYANDADG NTFEGTYDEA SSGTYDRTGG
NSSLAVAGGT ITGNATVNAD GDASAQLGNG AEIGGNLDVH AATFGSNSTF AETSSTVGTY
VDGDLTGYTL EEASESSSTN LDSAGSVSIG NAEVGGYVSV SGAKGGATLD LASAGSIGGS
AYVYAGGSDS ESAGTSTTVV SDGETTVDTA SESSSTANGG NVSATVAGTI GGDLFVDTNA
GNATVALTGQ VGSDIVVDAV GFSGTSAGTT HTDADGYVTT TESTSTPVGG TASLAVNAAS
LDVPASYGDI DVSGLGGSTV TIGAKSAVLA GAYDTELNVG GTYAATTSSS EFTDPAIGLA
TYHEEGTSTA VGGPASLTNA GTIGYDNGDD SATLASVTVA SVGGATAVNT GKIFGSLSAN
ALGTDTVTTV DQINLYDVTR VDTTVVEYTA VGGNAAITNS GLVTGSVSLA GATGTVTNSG
TIGGDIAVGQ SVDNYTTTSV DTLTQIGEEL VTAQPEAPFT QTYTVNQNGT VEGDIRIGGA
FGNYALAPSD GGEVTTAAVD TTAVDADGHP LTSVINATVN LGNNSVTNGG VYAEYDLDTG
ERFTNTVVNV ASTAKLGGGV HGVEKLNKLG TGVFTLTGPS YVAATDVDPA EWTLDLGQFE
ILTGEVQLAT DDGGVFGIRG DVKNAGSLVL GTRQTLVPTP FGSNLTSTAT QTIAGVDVYQ
QGDFVQTSSG SLTVAMMPSL VRVVDPSING SASSNEPLGV QQILFSQGLF TTPDKAFGSQ
YAALYAPSSW TIDGDLDLAG TVNVLMPKGG LFLDGQSLDL FSVSGDVTEN ATVATGTANN
FVAFDLVSRT SDGRTIVAVV ADRKGYETAA ANSNAAAAGA ALSAALPGVV ADLTADAAGT
ATFASVQEFA LTQDLATVMA GLDSQLTLAQ ATQALTELGG GSYYGSIATI RTTAPFIDVL
SNRRLPEGAT GFNLWIQPTG DFVRTSGDAA TGASKIRSDN YGGSAGFGVA TGSGEFGIGF
GYGRTNSHSD DGLAKANADT WMVGGYARQS FGALTVAADL VFGWSNWDAY RSLPTLSRKA
TADFDSKETR GDLRIEYALQ TGGLTVSPFG QLELRHYSFD GFAEEGAGSV GLSVAEASKT
VFTPTVGVKL GGEFETGLAT IRPEASVSYS FQGDNEADRT VAFLGAPAQN FRLQGVDPDG
FVTVQAGLFA DIGTRSGVFV RGSYSTGGGN NVAAIRTGVV IGF