Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0168 |
Symbol | |
ID | 3918304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 166719 |
End bp | 169919 |
Gene Length | 3201 bp |
Protein Length | 1066 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640442894 |
Product | TonB-dependent receptor |
Protein accession | YP_495451 |
Protein GI | 87198194 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4771] Outer membrane receptor for ferrienterochelin and colicins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAATCA AAACATTCTC TGCGCTCCGT GCGAGCGCCG CGCCGCTGGC GCTGGTCGTC GCGGGCTTTA CCGCATCGAC CGCCTTCGTC GCGCCCGCAA TGGCGCAGGA CTACACCCGC GGTAACCTCG TGGGCGAGGT GCTCGACGGC AACGGCGCTC CCGTTTCCGG CGCGCAGGTG ACGATCCGTT CGAACGAACA GGGCTTCACC AACACCACCA CCACCGATTC CAGCGGCCAG TTCCGCGTCA CCGCTCTCCC GACCGGCACC TACTCGGTGA CCGTCACGGT CGACGGCGCG GTCGTTGTCC AGGACAACTC GGCCAGCGTC GTTGCCGGCT CGAACAACTC GTACCGTTAT TCGACGGGCG AAGCTGCTGC CGGCGGCGCG ATCGTCGTCA CCGGTTCGCG CATCAAGACC AACGACTTCG CTCAGAACAC CTCGGGCCTC ACCCTGAACG TGCAGGAAGT TGCCGAGAGC GTTCCGATCG CGCGTTCGCA GTCGGCCCTC ATCCTGCTTG CACCCGGCAC CAACGCCGGT GACACCGGCT TCGGCGACTG CCCCGACTGC GTGAGCTTCG GCGGCGCGTC GATCGCCGAG AACAGCTACT ACGTGAACGG CCTCAACACG ACGAACTTCC GTACCTTCGT CGGCAACAAC GTCGTTCCGT TCGAGTTCTA TCGCACGTTC GACGTCAAGA CCGGCGGCTG GTCGGCCGAA TACGGTCGTG CACTGGGCGG CGTGACCTCG GCGGTCACCA AGTCGGGTTC GAACAACTTC GAATACGGTG CGGTCGTCGC CTACACGCCC GACTTCCTGA GCGAAGATTC GCCCAACACC TACCTTGACG ATACCGGCTC GCTCAAGTCT CTCAACAGCC GCGACTACCG CGAGCGCCTC GAGGCAAACT TCTACCTCTC GGGCCCGATC ATCAAGGATC GCCTGTTCTT CTACGGTCTG GTCACGCCGC GCTACAGCGT GTCGGAAGAC ACCTCCCCGT CGTCGGGCTA TCGCGTCCGC GCCAAGTCGA ACACCCCGTT CTACGGCGGC AAGCTCGACT TCATTCCGTT CGACGGTCAC CGCATCGAAG GCACCTTCTG GTCCGACGAA CGCACGATCA ACTACGACTA CTACAACGTC GACGCGCTCG GTAACGAGAA GACCGGCATC ATCACTGGGA TCAATCGCGA AGGTCGTGAA ATCAACAAGA TCGGCGGCAA GAACTGGATC GTCCAGTACA CTGGCCAGTT CACCGACTTC TTCACGCTTT CGGGTGCCTA CGGCGAGAAC CGCTACAAGC GCTATGACGT GATCAGTGGT GGCGACAGCG CCGTTCCGAC CATCCAGACG CAGCTTGCCT ACGACGGCAA CGGCGAACCT GTCACCTCGC TCAAGACCAT CGCCGGCGTG CCCGTTTCGC CGACGGACGG CCAGGACCTG CGCAAGGTGA TGCGCATCGA CGCGGACCTC TATGTGAACC TGCTCGGTTC GCATCACTTC CGCTTCGGCT TCGATCGTGA AGACCTGTCG GTTACCGAAG ACACCTTCTA CACCGGTGAC CGCACCTATC GCTTCACGGC GAACTACATC CGCACCCGTA CCTACCTGAA CGAAGGCTCG TTCAAGACCA AGCAGACGGC CTTCTACATC CAGGATAGCT GGGACCTCCT GAACGACCGC CTCAACCTGC AGCTGGGCGT GCGCAACGAC CAGTTCCAGA ACTACGGCAT CACGGGCGGC AAGTACCTCG ACCTCAAGAA CCAGTGGGCT CCGCGTCTTG GCGCGTCGTT CGACGTGTTC GGCGACAAGC TGACCAAGAT CCAGGCGTTC TGGGGCCGCT ACTATCTGCC GGTCGCCACC AACACCAACA TCCGCCTGGC CGGCGCCGAA ACCTACTACG AGCAGCGCTT CGGCTACGCT CCGGGTGTCG TCGGTTCGAA CTATGACACG AACGGCGTTC CGATCGGCCA GCAGTTCGAC AGCTCGGGCG CTCCGATCCT CGGCTCGCTC ACCGGCGCAA ACTCGCTGAA CTGCCCCGAC TTCGGTCCGG GTGCCGGCCA GAAGTGCCGC ACCGTGTTCT CTGACGGCCT TCCCGGCCCG ACGGACACCC TGGTCTCCTC GACGCTCAAG CCGATGTACC AGGACGAACT GATCTTCGGC ATCACGCACC GTATGGAAGA CTGGACCTTC GGTCTTCGTT ACATCAACCG TCGTCTCAAG CAGACGCTGG AAGACATCGC GATTGACGAA GCGGTCAACC GTTACTGCGA ACAGCAGAAC CTCGATTGCG CAACCTCGTC GGGCAGCCCG ATCTGGTCGG GCTTCCACCA GTACGTTCTG GCCAATCCTG GTGAAGCCGT CACCGTGCGC CTCGATGGCG ATCCGACGAA GCCGGGCACG ACTGACGTCG TCACCCTGTC ACCGGAGCTG CTTGGCTATC CGAAGGCCGT CCGCAAGTAC GACTCGATCG AGTTCACCGC GTCCAAGGCC TTCAACGGCA CCTGGGGCTT CGACTTCAGC TACACCTGGC AGAAGCTTCG CGGTAACTAC GAAGGTTCGG TCAAGTCGGA CAACAACCAG GACGACGCCG GCCTTACGCA GGACTTCGAC GTTCCGGGGC TGACCACTGG ATCGTACGGT ACGCTTGCCA ACAATCGCGA GCATACCTTC AAGCTGTTCG GTTCGTGGCA GCCGGTTGAC TGGCTCCGCA TCGGTGCAAA CCTGACCGTC CAGTCGCCGC GCAGCTTCAG CTGCATCGGC GTCGCCATCC CGGACTACAT CAAGCTGCTC CAGGCTGGCG AAAGTGCGGT TCTGAACGGC GGTGCGGCTT CGCAGTACGG CGCCGCGTCG TTCTACTGCC GCAACCCGAA GGGCAACCAG AACGGTACGA CGGTCACGAA CGACATCACC GGCGAAACCA GCGTGCTGGT CAACCGTGGT ACGGCGTTCA AGAGCGACTG GTCGAAGAAC CTCGACCTCG GCTTCCAGTT CAAGCTGGGC GAGGCTCTGG GCAATTCGAA CTTCCGCATC GACGTGTTCA ACGTCTTCAA CTGGAAGTCG AAGACCGACT TCGTCGAATT CGGCGAAACG GACTCGGGTG CCACCCGCGC GGACTATCGT CTGCCGACCG GCTACCAGGC TCCGCGCCAG GTGCGCTTCA CCTGGACGAT GCGCTTCGGT GCAAACAACG GCGCCGACTG A
|
Protein sequence | MKIKTFSALR ASAAPLALVV AGFTASTAFV APAMAQDYTR GNLVGEVLDG NGAPVSGAQV TIRSNEQGFT NTTTTDSSGQ FRVTALPTGT YSVTVTVDGA VVVQDNSASV VAGSNNSYRY STGEAAAGGA IVVTGSRIKT NDFAQNTSGL TLNVQEVAES VPIARSQSAL ILLAPGTNAG DTGFGDCPDC VSFGGASIAE NSYYVNGLNT TNFRTFVGNN VVPFEFYRTF DVKTGGWSAE YGRALGGVTS AVTKSGSNNF EYGAVVAYTP DFLSEDSPNT YLDDTGSLKS LNSRDYRERL EANFYLSGPI IKDRLFFYGL VTPRYSVSED TSPSSGYRVR AKSNTPFYGG KLDFIPFDGH RIEGTFWSDE RTINYDYYNV DALGNEKTGI ITGINREGRE INKIGGKNWI VQYTGQFTDF FTLSGAYGEN RYKRYDVISG GDSAVPTIQT QLAYDGNGEP VTSLKTIAGV PVSPTDGQDL RKVMRIDADL YVNLLGSHHF RFGFDREDLS VTEDTFYTGD RTYRFTANYI RTRTYLNEGS FKTKQTAFYI QDSWDLLNDR LNLQLGVRND QFQNYGITGG KYLDLKNQWA PRLGASFDVF GDKLTKIQAF WGRYYLPVAT NTNIRLAGAE TYYEQRFGYA PGVVGSNYDT NGVPIGQQFD SSGAPILGSL TGANSLNCPD FGPGAGQKCR TVFSDGLPGP TDTLVSSTLK PMYQDELIFG ITHRMEDWTF GLRYINRRLK QTLEDIAIDE AVNRYCEQQN LDCATSSGSP IWSGFHQYVL ANPGEAVTVR LDGDPTKPGT TDVVTLSPEL LGYPKAVRKY DSIEFTASKA FNGTWGFDFS YTWQKLRGNY EGSVKSDNNQ DDAGLTQDFD VPGLTTGSYG TLANNREHTF KLFGSWQPVD WLRIGANLTV QSPRSFSCIG VAIPDYIKLL QAGESAVLNG GAASQYGAAS FYCRNPKGNQ NGTTVTNDIT GETSVLVNRG TAFKSDWSKN LDLGFQFKLG EALGNSNFRI DVFNVFNWKS KTDFVEFGET DSGATRADYR LPTGYQAPRQ VRFTWTMRFG ANNGAD
|
| |