Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2509 |
Symbol | |
ID | 3916830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 2712217 |
End bp | 2714607 |
Gene Length | 2391 bp |
Protein Length | 796 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640445266 |
Product | TonB-dependent receptor |
Protein accession | YP_497779 |
Protein GI | 87200522 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.538781 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGCGA TGCTCGCCGG CCTTGCCACT CCGGCTCTCG CCGAGGAACA GCAGGTTGCC CAATCCGACA CCGGCCTGGC CGAGATCATC GTCACCGCCC AGCGACGCAC CGAGAACCTT CAGGACGTGC CGATCGCGAT CACTGCGGCA AACTCCGAAA CGCTCGCCCA GGCACGCGTC GAAAACGTTG CCAACATCCA GGCGATCAGC CCCTCGATCA GCTTCCGCGT GACCAACATC GCCACGTCGA GCGCCAACCT CATCATCCGC GGCCTCGGCA CGACCGGCAA CAGTCGTTCG TTCGAAGGCT CGGTCGGCGT GTTCATCGAC GGCGTCTACC GCACCCGCGC GGCAGCGGCG CTCCAGAACT TCCTCGACAT CGACAATCTC CAGGTCCTGC GCGGCCCGCA AGGCACCCTG TTCGGCAAGA ACACCACCGC CGGCGCGCTC CTGCTCAGCT CCGCCGCGCC CTCGCTCAAC GACGTCAACG GCTCGGTCGA GGCGACCTAC GGCAACTATG ACGGCCTGAT CGTACGCGGA GCCATCAACG CGCCGCTGTC CGATACGGTC GCCTTCCGCA TCGCGGGCCT CGCGTCCAGC CAGAACGGCT TCTACACCGA CAGCACCACC GGCGACGATC TCAACGGCAA CAAGACCCGC GCCGCAAAGG CGCAGCTCCT GTTCGAGCCG AGCGAGAACC TTACGGTCCG CGTGATCGGC GACTACTCCT ACAGCAACGG CAATTGCTGC TACGCCACTT CGGCCTTCAT CGATGGCCCG ACCCAGCCGC TGATCGACCT GCTCACGCTC TACCAGCCGT CCAGCAGCGC CCAGCTTCTC GGCGTACTGA CCGGCGCGCT GCCCGCTTCG TCGATGACGC CGACCGGCCG CACCCTGCCC TCGCGCGATG CCTCGAAATG GGAGCAGACG CTGAACGGAA ACGGCAAGCA GACCATCGAG GACTACGGCG GCACGCTGCT CGTCGATGCC TCCATCGGCG AAGGCACGCT GAAGTCGGTC ACCGCCGTGC GCAAGTTCAA GGTCGATCAG GTCGACCTCG ACCCCGATTT CTCGGGCGCG GACATCTTCC GCTACAACGA AAGCTTCGAA AGCCGCTTCA TCTCCCAGGA ACTGACCTAC AACACCAAGA TTACGGCGCT CAATGCCGAG GCGGTCTTCG GCCTGTTCTT CTCGGATGAA AAGCTCAAGA TGGGCCGCAG CCTGCCCTGG GCCGACCAGG CCCAGTACTA CTGGGACGTG ATCTTCGCGC AGCTCGGCGT CGCGCCCGGC ACGGCCAACG CCGCCCCCGG CACCTGGACG AGCGAACGCA TGGGCGGTTC GGCGAAGTCC TACGCCGGCT TCGCGCATCT CGATTTCGCG GTGAACGACA AGTTCAACGT GATCGCCGGC CTGCGCTATT CGGTCGAGAA GAAGCGCGGC TTCTTCAACA ACTCGTTCTA TCGCTCCTCG CCGTTCGACG TGTTCACCCT GCTCGGCATC GCACCGGCGC CGGCCTATGA CGCGACTTCG ACCGACAAGG CGCTGTCCGG AACCTTCGGC CTCCAGTACC GCCCGACCGA CGACATCATG CTCTATGCAA CGTACAACCG CGGCTTCAAG GCGGGCGGCG TGAACATGGA CGTGAACGCA GCCGGTACGC TGATCAACAA TGCAGAGGCA TACAACGCCC TGCCCGCCCC GATCCGCGCC GCCTTCTTCG GCAATGCCGA GGCCAAGGAC CCGCTGAATC CCCGCTACAA GCCCGAGAAG GTCAACGCCT TCGAGGTCGG CGGCAAGTTC CAGTACCTTG ACGGCCGCGC GCGCACCAAC ATCGCGTTCT TCTACTACGA CCTGTCCGAT CTCCAGATCG CTCAGTTCAT CGGCCTGCGC TTCACCGTGC TCAACGCCAA GTCCGCCAAG GACTACGGCG TCGAGATCGA GAACATGTTC CAGCTCACCG ATGGCCTGAC GCTCGGCCTC GATGGCACCT GGATCCCGCA TGCGCAGTAC GCGAAGGACG CGAACATCGA CCCGGTCCTG TCCGGCTCGC GCTTCCGCTT CAGCCCCAAG TTCTCGGGCA ACGCGACGCT GAACCTCGAC CAGCCGATCA ACGACAACCT CAGCCTGCTC GCCCGCGCAC AGGTCCAGTA CCAGAGCCGC CAGCTCATAA GCACGGCGAC CACGGCGGAA CAGGGCGCGG TGACGCTGGT CAACGCCAAC CTCGGCTTCA AGCTGCCGCA GACGGGGCTG CTGATCGAAG GCTGGGTGCA GAACCTGTTT GACAAGACGT GGTTCACCCA GTCCTTCCCA ACGCCGCTCC AGACCGGCGA CCAGAACGCC TACCCGGGTG CGCCGCGCAC CTACGGCATC CGCGTCCGCG CGACGTTCTG A
|
Protein sequence | MGAMLAGLAT PALAEEQQVA QSDTGLAEII VTAQRRTENL QDVPIAITAA NSETLAQARV ENVANIQAIS PSISFRVTNI ATSSANLIIR GLGTTGNSRS FEGSVGVFID GVYRTRAAAA LQNFLDIDNL QVLRGPQGTL FGKNTTAGAL LLSSAAPSLN DVNGSVEATY GNYDGLIVRG AINAPLSDTV AFRIAGLASS QNGFYTDSTT GDDLNGNKTR AAKAQLLFEP SENLTVRVIG DYSYSNGNCC YATSAFIDGP TQPLIDLLTL YQPSSSAQLL GVLTGALPAS SMTPTGRTLP SRDASKWEQT LNGNGKQTIE DYGGTLLVDA SIGEGTLKSV TAVRKFKVDQ VDLDPDFSGA DIFRYNESFE SRFISQELTY NTKITALNAE AVFGLFFSDE KLKMGRSLPW ADQAQYYWDV IFAQLGVAPG TANAAPGTWT SERMGGSAKS YAGFAHLDFA VNDKFNVIAG LRYSVEKKRG FFNNSFYRSS PFDVFTLLGI APAPAYDATS TDKALSGTFG LQYRPTDDIM LYATYNRGFK AGGVNMDVNA AGTLINNAEA YNALPAPIRA AFFGNAEAKD PLNPRYKPEK VNAFEVGGKF QYLDGRARTN IAFFYYDLSD LQIAQFIGLR FTVLNAKSAK DYGVEIENMF QLTDGLTLGL DGTWIPHAQY AKDANIDPVL SGSRFRFSPK FSGNATLNLD QPINDNLSLL ARAQVQYQSR QLISTATTAE QGAVTLVNAN LGFKLPQTGL LIEGWVQNLF DKTWFTQSFP TPLQTGDQNA YPGAPRTYGI RVRATF
|
| |