Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1603 |
Symbol | |
ID | 3918711 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1667939 |
End bp | 1670911 |
Gene Length | 2973 bp |
Protein Length | 990 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640444343 |
Product | TonB-dependent receptor |
Protein accession | YP_496877 |
Protein GI | 87199620 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4771] Outer membrane receptor for ferrienterochelin and colicins |
TIGRFAM ID | [TIGR01782] TonB-dependent receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAGTT CAGCAACGCG ATCCATTGCG TTCCGCGCAT CCAGCATCGG CGCGATTGCG CTGGCGCTCG CCGGGCAATC CGCGCTGGCA CAGGACGTCG CTGCCGGCGA TGCCGAGCAG TCGGGTGAAA TCGTCGTCAC CGGCATCCGC GCCTCGCTCA GCAAGGCTCT CGACATCAAG CGCACGGCGC AGGGCGTGGT CGATGCGATC TCTGCCGAGG ACATCGGCAA GTTTCCCGAT ACGAACCTTG CTGAATCGCT CCAGCGCATC ACCGGCGTGT CGATTGACCG TTCGAACGGT GAAGGCTCGA CCGTCACGGT CCGCGGCTTC GGTCCGGAAT TCAACCTCGT CCTCCTGAAC GGCCGCCAGA TGCCGACCTC GTCGCTGGGC GACGGGGCCA GCGCGCCGTC CTCTCGCTCG TTCGACTTCG CCAACCTTGC CTCGGAAGGC ATCTCGGGCG TGGAAGTCTA CAAGTCGGGC CGCGCCACGC TGCCCACCGG CGGCATCGGC TCCACCATCA ACATCAAGAC CCCGCGTCCG CTAGACCGTC CGGGCCTGTC GGGCAGCCTC GCGGTTCGTG GCGTTTATGA CAGCTCGCGC AACGAAGGCA ATCCGATCAC GCCGGAGGTT TCCGGCATCG TCTCGGATAC GTTCGGCGAT GGCGTGTTCG GCATTCTCGT CACCGGCACC TGGCAGAAGC GCAAGGCCAG CGTGAACCAG GCGAACGTCG GCTGGCGCGA CGGCTATCTC GGTTCGGAAA ACAACTGGGG TTCGCTGCCG CAGGAAGGCG ATCCGCGTTA CGGCAGCATC ACCAACCGTC CCGGCCCGAA CGACGTCTAT CAGGTCCAGC AGAACGCCAG CTACGATCTC AACGACATCG ACCGCGAGCG GCTGAACGGT CAGGTCGTGC TGCAAGCTCG TCCGACCGAC AGCCTGACCG CGACGATCGA CTACACTTAT TCGCGCAACA CCGTGCAGGT GCGCAATTCG AACGTCGGCG TGTGGTTCAA CTTCAACGAC GTTTCCAGCG CCTGGACGGA CGGTCCGGTT GCTGGCCCGA TCTTCTATTC GGAAAAGTTC GGGGCGGGCG AAGGCAAGGA CTTGTCCTAT TCGGGCTCGC TCACCGAGAA CCGCTCGGAA AACAAGTCCA TCGGCGGTAA TCTTCAGTGG AAGGGGCCGG GCGGCCTGCG CCTCGAACTC GATGGTCATC ATTCGACGGC CGAATCCGGT GCCAACAATC CTTATGGCAC CAGCACTTCG GTCGGTACGG CGGTCTTCGG AATCAAGCAG CAGACGGTCA ACTACGAGAA CGACCTCCCG GTCATTTCGG TCGTCATGCA TGACGGCATC GACCCGCTCA ACGCCGCGAA CATCCAGGCC ACCGGCAATG CGTTCCGCAA CGCCTATTTC AAGGACACGA TCAACGAGGT CCAGTTCCGC GGCGGTTACG ACTTCGACAA CTCGATCCTC GACAGCCTCG ATTTCGGCGT GACCTATGTC GAGAACAAGG TGCGTTCGGC CTATGGCTTC ATCCAGAACG ATACCTGGGG CGGGTCGACC ACGAAGGAGC AGCTCCCGGA CGACCTGTTC ACGCTCGAAT CCCTGCCTGA CAAGTTCAAG GGTGTCTCGG GTGCCAGCGA CCCGGCGATG ATCCAGAGCT TCTACCGCTT CAACTTCGAG AAGATGGTCG GCTTCCTCGA CGACCTGAAC GGCATCTGCG GCGGCGATGG CGATTGTCGC GCGCCATTCA CGGTCGACCG GCGCATCCGC GAACGGACGC TGGCGCCCTA TGCTCAGGCG AACCTGACCT TCGACCTGCT CGAAAACCCT GCGCATTTCC GCGCCGGCAT TCGCTACGAA AAGACGAAGA TCACCTCTTC GGCGCTCGTG CCGATCCCGA CCGGCACGCA GTGGGTGGCC GCGAACGAAT TCAACCTCAC CTACGGCAGC GGATCGGACT TCACCACGTT CAAGGGTGAA TACGAGAACT GGCTGCCCGC GTTCGACTTC GACTTCGAGC CGATCGAGAA CGTCAAGGTC CGCGCGAGCT ACAGCCACAC CATCACCCGG CCCGACTATG CCTCGATGCA GGGCGGCCGT ACGGTGGACC AGCTCTTCCG CATCGGTGGC GGCTTCGGCA GCCAGGGCAA CCCGGGCCTG CTTCCCTTCA AGTCGAAGAA CATCGACGTG TCGGCGGAGT GGTACTACGC CCCGGCCAGC TACCTGTCGG TCGGTTTCTT CGACAAGCGG GTGAGGAACT TCATCTCGAG TACGCGGGTT GACACCGAGG CGTTCGGGCT GACCAATCCG GCCGATGGCC CGCGTTACCA GGCCGCCGTG GCCGCACTTG GCCCCAACGC CAGCACGACC GCGATCCGCA ACTACATCTT CGCCAACTAC CCGTCTTCGG TGGTCGTCGA CAGCTACGAC CCGGTCACCG GAAACTACAC CGGCAAGATC CTCGGTCTGC CCGAAGACAA CAAGGTGAAC TTCCAGATCA CCACGCCGAT CAACTCGGAC CAGGCGGCAC ACCTCTATGG TTTCGAGTTC GCCGTGCAGC ACAGCTTCTG GGATACCGGC TTCGGCGCGA TCCTGAACTA CACCGTGGTC AAGGGCGATG CGAAGTACGA CAATTCCCAG CCGTCCAGCG TGCCGCAGTT CGCGCTGACC GGCCTTTCGG ACAGTGCCAA CGCCGTCCTG TTCTATGACA AGAACGGGTT GCAGGCGCGC GTCGCCTACA ACTGGCGCGA CAAGTTCCTC GCCGGCACGG GCCCCAACCC GTACTATGTC GAGGCCTATG GCCAGGTCGA CGCAAGCGCG AGCTATGAGT TCCGCAAGGG ATACACCGTG TTCGTCGAGG CGATCAACCT TACCGGCTCC AGCCGACGGG GGCACCTGCG CAGCACCAAC AACGTGTTCT TTTCGTCGCC GGGCTATGCC CGCTACCAGG CCGGTTTCCG CTTCAATTTC TGA
|
Protein sequence | MKSSATRSIA FRASSIGAIA LALAGQSALA QDVAAGDAEQ SGEIVVTGIR ASLSKALDIK RTAQGVVDAI SAEDIGKFPD TNLAESLQRI TGVSIDRSNG EGSTVTVRGF GPEFNLVLLN GRQMPTSSLG DGASAPSSRS FDFANLASEG ISGVEVYKSG RATLPTGGIG STINIKTPRP LDRPGLSGSL AVRGVYDSSR NEGNPITPEV SGIVSDTFGD GVFGILVTGT WQKRKASVNQ ANVGWRDGYL GSENNWGSLP QEGDPRYGSI TNRPGPNDVY QVQQNASYDL NDIDRERLNG QVVLQARPTD SLTATIDYTY SRNTVQVRNS NVGVWFNFND VSSAWTDGPV AGPIFYSEKF GAGEGKDLSY SGSLTENRSE NKSIGGNLQW KGPGGLRLEL DGHHSTAESG ANNPYGTSTS VGTAVFGIKQ QTVNYENDLP VISVVMHDGI DPLNAANIQA TGNAFRNAYF KDTINEVQFR GGYDFDNSIL DSLDFGVTYV ENKVRSAYGF IQNDTWGGST TKEQLPDDLF TLESLPDKFK GVSGASDPAM IQSFYRFNFE KMVGFLDDLN GICGGDGDCR APFTVDRRIR ERTLAPYAQA NLTFDLLENP AHFRAGIRYE KTKITSSALV PIPTGTQWVA ANEFNLTYGS GSDFTTFKGE YENWLPAFDF DFEPIENVKV RASYSHTITR PDYASMQGGR TVDQLFRIGG GFGSQGNPGL LPFKSKNIDV SAEWYYAPAS YLSVGFFDKR VRNFISSTRV DTEAFGLTNP ADGPRYQAAV AALGPNASTT AIRNYIFANY PSSVVVDSYD PVTGNYTGKI LGLPEDNKVN FQITTPINSD QAAHLYGFEF AVQHSFWDTG FGAILNYTVV KGDAKYDNSQ PSSVPQFALT GLSDSANAVL FYDKNGLQAR VAYNWRDKFL AGTGPNPYYV EAYGQVDASA SYEFRKGYTV FVEAINLTGS SRRGHLRSTN NVFFSSPGYA RYQAGFRFNF
|
| |