Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0545 |
Symbol | |
ID | 3918675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 594815 |
End bp | 597991 |
Gene Length | 3177 bp |
Protein Length | 1058 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640443275 |
Product | Outer membrane autotransporter barrel protein |
Protein accession | YP_495826 |
Protein GI | 87198569 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCGTC TGCGCACCAC CACCATTCTA TCGACCCTGG CAGGGACGCC CGTGGCGTTG GCGCTGCTGG TTCCGCAGGC AGCCAATGCC GCGACCAGCA TCACCACCAG CCAGACCACG CCGGTCAAGA CTTCCACCGC AGGCGACCTG ACCATCGGCG ACGACGGCAA GATCACGCTG GAAACGGGCG AGCCTGCGGT CACGATCGAT TCCAACAACA CCGTGACCAT CGATTCGGGC GGCGCGATCA AGAACGTGGA AGGCGAGGAT GGCGCCATCG GAATCGCCGT CGACGCGGGC AAGACCACGA CGATCACCAA TGACGGCACG ATCACGATCA CCGAGACCTT CACCGTCTCG GACGACGATT CAAACGGCAT CGCCGACGGC CCGATTTCCT CGGCCAGCGA CCGCTACGGC ATTCTAGTGC GGTCGGGAAG CACGACGAAG GCGTCGATCG AGAACACCGG GACGATCACG GTCGAAGGCC TCAACTCGGG CGGCATCGTC GTGAAGTCGG ACCTTGATGG CAGCATCGAG AATACCGGCA CGATCAAGGT CCTGGGTGAC AACGGCGTGG GCATCTCGAC CCAGGGCGTG ACCGGCGACG TGACCATCGA AGGCACCGTT GCGGTCGTCG GCAAGGGCGC GCAAGGCGTG GTACTGGGCG GCGACGTGGG TGGCACATTC CGCATCCAGG GCGCGATCGC GCAGTCATCG TCCTATACCA CCGATGACGG CACCTCGCAG ACCCTGTCGC GCACGGACTT GCGCACCGGA AAGGCCGCGG TCGAAGTCAC CGGCAATGTC GCGGGCGGTA TCCTGCTCGA CGCGGCTCCC TACAACCGCG ACAGCTCCAA CACCGACGAG GACGGCGACG GCGTCGCCGA CGCATCGGAG GAAACGGGTT CGATCGCCTC GGTCGGCAAC AGCCCCGCCC TGCTGATCGG CGGCACCAGC GACATCACGA TCGGCAAGGT CACCGGCAGG GACGGTGACT TTTCGCTCGC CATCGACGGC AACATCACCG CCAGTTCGGT CTACAGCAAT ACCAATGCCT ATGCCGTGGT GATCGGTGGC CAGGGCGGTT CGGTGACGAT GGCGAACGGC ATCGGCGTGT CGGGCTCGGT CATCGCGACC ACCGTCGACG AGACTGCGAT CGCCGTGCTC ATCAACGAGG GATCGACCGT CCCCACCCTG TCGAACAGCG GGACGATCAA GGCCAACATC AGCTCGCCGG GCGAAGGCGC GGCCTATGCC ATCCAGGACA AGTCCGGCAC CCTTACCACC ATCGAGAACA CCGGTTTCAT CACCGTGACC GGATCGAGTA CCGACGACAT GCGGGCGATC GACGTCAGCG CGAACACGAC CGGCGTGACG ATCAAGCAGT ACCTCAACGA CCTCGACGAG CTGGCGCAGG AGAACGAGCA GGAGGAAGAC GGCTACGACG CCAGCAATCC CACCATCTAT GCCGCGATCA CCGGCAACAT CTACACCGGC AGCGGCAACG ACGTACTCGA TATCGCGACG GGGCGGATCT ACGGCAACAG CTACCTCAAT GCCGGTAACG ACCAGGTCCT GCTGTCGGGC GACAGCGGCT ACGAAGGCAA GATCTACTTC GGCAGCGGCA CGGCGACGAT GACCATGTCG GACACGGCAT ACTTCGTGGG CAACCTCGAC CTCGCGGGCA ACGCGGGCAC GCTGACGATG TCAGATTCCT CGAGCTTTTC GGGCACGATC AGCAACGGTG CGAATCTCGA CGTGACCGTG AACGGCGGCA CGTTCGGCGC AAGCAGCGCG ACGACGCTTT CGTTCGATAC CCTGACGGTG AAATCCGGCG GCGCGCTCAA CGTCTACATC GATGGCAGCG AAGGCACCGC CTCGCTGATC GACGTGAACA CCGCGACATT CGCCAGCGGC TCAAAGGTCT CGGCGACGAT CTCCTCGCTG GAGAATGCGG AAGGGTCCTA CACCATCCTC AAGGCGGACT CGCTCGAGGG AACGCCGTCG TTCGATTCGA CGACGACCGA ATTGCCGGTG CTGTTCAACG GCGACGTCAG CGTGGTGGGC GAGACGCTGG TGCTCGACGT GACTCGCAAG ACCGCAAGCG AACTCGGACT GACCGCGCCG CAATCGGCCG CCTACGAAGC GATCTATTCC CAGGCGGTCG CGATCGACGA TCTCGGAACC AGCCTGTTGC AGGTGGAAGA CGTTGCCGCG CTCCAGGAAC AGTTCAACCA ACTCTTGCCC GACTATGCCG GCGGCGTGTT CGACTTCGTC ACCCGCAGCG GCCGGCTCGC CTCGCGGCAC CTGATGGACG ACAGTTCGCT GTTCGACATC AGCAACGCGG GCGGTTGGCT GGAGCCGATC TGGTTCCGGG GCAGCAAGGA CGACACTGGC ACGGCGGGCT TCAAGGTCAA GGGCTGGGGC ATTTCCTCGG GCATCGAGCG GATCACTGGG ATCGGCAACG TCGGCCTCTC GTTCGCCTAT ACCAAGGGCA GCATCTCCAC CGGCAGCTAC CAGAAGACCG ACGCCAGCAA CTACGAGCTT GGCGCATTCT GGCGCACCGG CACCGGACCG TTCTATGCCT ATGCCAAGAT CTCGGTAGGC CGCGTGTCAC TGAATTCGAC CCGCACCTTC ACCGGCGAAG TGGACAGCGA CAGCCTGTCC TACAGCGCCA ATGGCCAGTG GAAGGGCTGG ACCTTCGGTG GCCAGGGCGG CGCGTCCTAC AAGCTGGCGC TGGGCGGCGG GCTCGCGCTC AAGCCGATGG CGCGCTTCGA CTGGTACCGC CTGAACGAGA AGGGCTATAC CGAAAGCGGC GACGACGAGA TCTACCTCAC CGTCGCCAAG CGCAACTCCA GCCTGCTCAG CGGCACCGGC AGCCTTACCG CTTCATGGAG CGCGGGCGAA TCGACGCGCG AAAGCCGGCC GCTGACGGTC GAACTGGAAG GCGGCTATCG CTCGCGCCTG GCGGGCAAGC TGGGCACCAC GGTCGCCAAC TTCGAGGACG GCGACCAGTT CCGCCTCACG CCGGACGCGA TGAAATCGGG CTGGACCACC GAAGCCCGCA TCCTGGCCGG CGGTCTCGAC TACACCTGGC AACTTGCCGG CGGCGCCGAG CAGATCCAGG GCAGCGTCGA CTATTCGGTG CGCGGCTCGC TCAGCATCGC GTTCTGA
|
Protein sequence | MDRLRTTTIL STLAGTPVAL ALLVPQAANA ATSITTSQTT PVKTSTAGDL TIGDDGKITL ETGEPAVTID SNNTVTIDSG GAIKNVEGED GAIGIAVDAG KTTTITNDGT ITITETFTVS DDDSNGIADG PISSASDRYG ILVRSGSTTK ASIENTGTIT VEGLNSGGIV VKSDLDGSIE NTGTIKVLGD NGVGISTQGV TGDVTIEGTV AVVGKGAQGV VLGGDVGGTF RIQGAIAQSS SYTTDDGTSQ TLSRTDLRTG KAAVEVTGNV AGGILLDAAP YNRDSSNTDE DGDGVADASE ETGSIASVGN SPALLIGGTS DITIGKVTGR DGDFSLAIDG NITASSVYSN TNAYAVVIGG QGGSVTMANG IGVSGSVIAT TVDETAIAVL INEGSTVPTL SNSGTIKANI SSPGEGAAYA IQDKSGTLTT IENTGFITVT GSSTDDMRAI DVSANTTGVT IKQYLNDLDE LAQENEQEED GYDASNPTIY AAITGNIYTG SGNDVLDIAT GRIYGNSYLN AGNDQVLLSG DSGYEGKIYF GSGTATMTMS DTAYFVGNLD LAGNAGTLTM SDSSSFSGTI SNGANLDVTV NGGTFGASSA TTLSFDTLTV KSGGALNVYI DGSEGTASLI DVNTATFASG SKVSATISSL ENAEGSYTIL KADSLEGTPS FDSTTTELPV LFNGDVSVVG ETLVLDVTRK TASELGLTAP QSAAYEAIYS QAVAIDDLGT SLLQVEDVAA LQEQFNQLLP DYAGGVFDFV TRSGRLASRH LMDDSSLFDI SNAGGWLEPI WFRGSKDDTG TAGFKVKGWG ISSGIERITG IGNVGLSFAY TKGSISTGSY QKTDASNYEL GAFWRTGTGP FYAYAKISVG RVSLNSTRTF TGEVDSDSLS YSANGQWKGW TFGGQGGASY KLALGGGLAL KPMARFDWYR LNEKGYTESG DDEIYLTVAK RNSSLLSGTG SLTASWSAGE STRESRPLTV ELEGGYRSRL AGKLGTTVAN FEDGDQFRLT PDAMKSGWTT EARILAGGLD YTWQLAGGAE QIQGSVDYSV RGSLSIAF
|
| |