Gene Saro_0545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0545 
Symbol 
ID3918675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp594815 
End bp597991 
Gene Length3177 bp 
Protein Length1058 aa 
Translation table11 
GC content64% 
IMG OID640443275 
ProductOuter membrane autotransporter barrel protein 
Protein accessionYP_495826 
Protein GI87198569 
COG category 
COG ID 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCGTC TGCGCACCAC CACCATTCTA TCGACCCTGG CAGGGACGCC CGTGGCGTTG 
GCGCTGCTGG TTCCGCAGGC AGCCAATGCC GCGACCAGCA TCACCACCAG CCAGACCACG
CCGGTCAAGA CTTCCACCGC AGGCGACCTG ACCATCGGCG ACGACGGCAA GATCACGCTG
GAAACGGGCG AGCCTGCGGT CACGATCGAT TCCAACAACA CCGTGACCAT CGATTCGGGC
GGCGCGATCA AGAACGTGGA AGGCGAGGAT GGCGCCATCG GAATCGCCGT CGACGCGGGC
AAGACCACGA CGATCACCAA TGACGGCACG ATCACGATCA CCGAGACCTT CACCGTCTCG
GACGACGATT CAAACGGCAT CGCCGACGGC CCGATTTCCT CGGCCAGCGA CCGCTACGGC
ATTCTAGTGC GGTCGGGAAG CACGACGAAG GCGTCGATCG AGAACACCGG GACGATCACG
GTCGAAGGCC TCAACTCGGG CGGCATCGTC GTGAAGTCGG ACCTTGATGG CAGCATCGAG
AATACCGGCA CGATCAAGGT CCTGGGTGAC AACGGCGTGG GCATCTCGAC CCAGGGCGTG
ACCGGCGACG TGACCATCGA AGGCACCGTT GCGGTCGTCG GCAAGGGCGC GCAAGGCGTG
GTACTGGGCG GCGACGTGGG TGGCACATTC CGCATCCAGG GCGCGATCGC GCAGTCATCG
TCCTATACCA CCGATGACGG CACCTCGCAG ACCCTGTCGC GCACGGACTT GCGCACCGGA
AAGGCCGCGG TCGAAGTCAC CGGCAATGTC GCGGGCGGTA TCCTGCTCGA CGCGGCTCCC
TACAACCGCG ACAGCTCCAA CACCGACGAG GACGGCGACG GCGTCGCCGA CGCATCGGAG
GAAACGGGTT CGATCGCCTC GGTCGGCAAC AGCCCCGCCC TGCTGATCGG CGGCACCAGC
GACATCACGA TCGGCAAGGT CACCGGCAGG GACGGTGACT TTTCGCTCGC CATCGACGGC
AACATCACCG CCAGTTCGGT CTACAGCAAT ACCAATGCCT ATGCCGTGGT GATCGGTGGC
CAGGGCGGTT CGGTGACGAT GGCGAACGGC ATCGGCGTGT CGGGCTCGGT CATCGCGACC
ACCGTCGACG AGACTGCGAT CGCCGTGCTC ATCAACGAGG GATCGACCGT CCCCACCCTG
TCGAACAGCG GGACGATCAA GGCCAACATC AGCTCGCCGG GCGAAGGCGC GGCCTATGCC
ATCCAGGACA AGTCCGGCAC CCTTACCACC ATCGAGAACA CCGGTTTCAT CACCGTGACC
GGATCGAGTA CCGACGACAT GCGGGCGATC GACGTCAGCG CGAACACGAC CGGCGTGACG
ATCAAGCAGT ACCTCAACGA CCTCGACGAG CTGGCGCAGG AGAACGAGCA GGAGGAAGAC
GGCTACGACG CCAGCAATCC CACCATCTAT GCCGCGATCA CCGGCAACAT CTACACCGGC
AGCGGCAACG ACGTACTCGA TATCGCGACG GGGCGGATCT ACGGCAACAG CTACCTCAAT
GCCGGTAACG ACCAGGTCCT GCTGTCGGGC GACAGCGGCT ACGAAGGCAA GATCTACTTC
GGCAGCGGCA CGGCGACGAT GACCATGTCG GACACGGCAT ACTTCGTGGG CAACCTCGAC
CTCGCGGGCA ACGCGGGCAC GCTGACGATG TCAGATTCCT CGAGCTTTTC GGGCACGATC
AGCAACGGTG CGAATCTCGA CGTGACCGTG AACGGCGGCA CGTTCGGCGC AAGCAGCGCG
ACGACGCTTT CGTTCGATAC CCTGACGGTG AAATCCGGCG GCGCGCTCAA CGTCTACATC
GATGGCAGCG AAGGCACCGC CTCGCTGATC GACGTGAACA CCGCGACATT CGCCAGCGGC
TCAAAGGTCT CGGCGACGAT CTCCTCGCTG GAGAATGCGG AAGGGTCCTA CACCATCCTC
AAGGCGGACT CGCTCGAGGG AACGCCGTCG TTCGATTCGA CGACGACCGA ATTGCCGGTG
CTGTTCAACG GCGACGTCAG CGTGGTGGGC GAGACGCTGG TGCTCGACGT GACTCGCAAG
ACCGCAAGCG AACTCGGACT GACCGCGCCG CAATCGGCCG CCTACGAAGC GATCTATTCC
CAGGCGGTCG CGATCGACGA TCTCGGAACC AGCCTGTTGC AGGTGGAAGA CGTTGCCGCG
CTCCAGGAAC AGTTCAACCA ACTCTTGCCC GACTATGCCG GCGGCGTGTT CGACTTCGTC
ACCCGCAGCG GCCGGCTCGC CTCGCGGCAC CTGATGGACG ACAGTTCGCT GTTCGACATC
AGCAACGCGG GCGGTTGGCT GGAGCCGATC TGGTTCCGGG GCAGCAAGGA CGACACTGGC
ACGGCGGGCT TCAAGGTCAA GGGCTGGGGC ATTTCCTCGG GCATCGAGCG GATCACTGGG
ATCGGCAACG TCGGCCTCTC GTTCGCCTAT ACCAAGGGCA GCATCTCCAC CGGCAGCTAC
CAGAAGACCG ACGCCAGCAA CTACGAGCTT GGCGCATTCT GGCGCACCGG CACCGGACCG
TTCTATGCCT ATGCCAAGAT CTCGGTAGGC CGCGTGTCAC TGAATTCGAC CCGCACCTTC
ACCGGCGAAG TGGACAGCGA CAGCCTGTCC TACAGCGCCA ATGGCCAGTG GAAGGGCTGG
ACCTTCGGTG GCCAGGGCGG CGCGTCCTAC AAGCTGGCGC TGGGCGGCGG GCTCGCGCTC
AAGCCGATGG CGCGCTTCGA CTGGTACCGC CTGAACGAGA AGGGCTATAC CGAAAGCGGC
GACGACGAGA TCTACCTCAC CGTCGCCAAG CGCAACTCCA GCCTGCTCAG CGGCACCGGC
AGCCTTACCG CTTCATGGAG CGCGGGCGAA TCGACGCGCG AAAGCCGGCC GCTGACGGTC
GAACTGGAAG GCGGCTATCG CTCGCGCCTG GCGGGCAAGC TGGGCACCAC GGTCGCCAAC
TTCGAGGACG GCGACCAGTT CCGCCTCACG CCGGACGCGA TGAAATCGGG CTGGACCACC
GAAGCCCGCA TCCTGGCCGG CGGTCTCGAC TACACCTGGC AACTTGCCGG CGGCGCCGAG
CAGATCCAGG GCAGCGTCGA CTATTCGGTG CGCGGCTCGC TCAGCATCGC GTTCTGA
 
Protein sequence
MDRLRTTTIL STLAGTPVAL ALLVPQAANA ATSITTSQTT PVKTSTAGDL TIGDDGKITL 
ETGEPAVTID SNNTVTIDSG GAIKNVEGED GAIGIAVDAG KTTTITNDGT ITITETFTVS
DDDSNGIADG PISSASDRYG ILVRSGSTTK ASIENTGTIT VEGLNSGGIV VKSDLDGSIE
NTGTIKVLGD NGVGISTQGV TGDVTIEGTV AVVGKGAQGV VLGGDVGGTF RIQGAIAQSS
SYTTDDGTSQ TLSRTDLRTG KAAVEVTGNV AGGILLDAAP YNRDSSNTDE DGDGVADASE
ETGSIASVGN SPALLIGGTS DITIGKVTGR DGDFSLAIDG NITASSVYSN TNAYAVVIGG
QGGSVTMANG IGVSGSVIAT TVDETAIAVL INEGSTVPTL SNSGTIKANI SSPGEGAAYA
IQDKSGTLTT IENTGFITVT GSSTDDMRAI DVSANTTGVT IKQYLNDLDE LAQENEQEED
GYDASNPTIY AAITGNIYTG SGNDVLDIAT GRIYGNSYLN AGNDQVLLSG DSGYEGKIYF
GSGTATMTMS DTAYFVGNLD LAGNAGTLTM SDSSSFSGTI SNGANLDVTV NGGTFGASSA
TTLSFDTLTV KSGGALNVYI DGSEGTASLI DVNTATFASG SKVSATISSL ENAEGSYTIL
KADSLEGTPS FDSTTTELPV LFNGDVSVVG ETLVLDVTRK TASELGLTAP QSAAYEAIYS
QAVAIDDLGT SLLQVEDVAA LQEQFNQLLP DYAGGVFDFV TRSGRLASRH LMDDSSLFDI
SNAGGWLEPI WFRGSKDDTG TAGFKVKGWG ISSGIERITG IGNVGLSFAY TKGSISTGSY
QKTDASNYEL GAFWRTGTGP FYAYAKISVG RVSLNSTRTF TGEVDSDSLS YSANGQWKGW
TFGGQGGASY KLALGGGLAL KPMARFDWYR LNEKGYTESG DDEIYLTVAK RNSSLLSGTG
SLTASWSAGE STRESRPLTV ELEGGYRSRL AGKLGTTVAN FEDGDQFRLT PDAMKSGWTT
EARILAGGLD YTWQLAGGAE QIQGSVDYSV RGSLSIAF