Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3668 |
Symbol | |
ID | 5077816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | - |
Start bp | 297891 |
End bp | 300164 |
Gene Length | 2274 bp |
Protein Length | 757 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640481391 |
Product | TonB-dependent receptor |
Protein accession | YP_001166053 |
Protein GI | 146275893 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.859497 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGGTGT CGAGGCTACT TTTGTCCGCA GCGGTCTGCG CGTTGGTTCC ATCTTTCGCA TTCGCGCAAA ATTCTGGTCA GGCCGATTCC GGCTTGGAAG AAATCATCGT CACCGCGCAA AAGCGCGAGC AGAACATGCA GGACGTGCCG GTCGCGGTCA CCGCGCTGTC CGCCGAGACC CTCACCAACC GCAACGTCGC CTCGGTTGCA GACCTGCCCC GCCTCGCTCC CAGCCTCACG CTGACCCAGG GCAACGTGCC CACCAACAAC TCGCTCAACT TGCGCGGCAT CGGCACGATC GCCTTCAGCA CCGCCATCGA ACCTTCGGTC GCGGTGGTCG TCGACGACGT TGCCTTGCTC CAGCAGGCCC AGGCCTTCTC GGGCCTCAGC GACATCTCCC GCATCGAAGT GCTGCGCGGC CCCCAGGGAA CGCTGTTCGG CAAGAACGCA TCGGCGGGCG CGGTCAACAT CGTCTCGCAG GGCGCGTCCG ACGTGTTCAC CGGCGCGGTC ACCGGCACTG CCACCACCGA CGATGAATAT CGCGTCGACG CGTCGCTGGC CGGCCCGCTC GGCGAAAACG CCGGTTTCCG TGTCAACGCC TTCTACGGCG ACCGCAAGGG CTACATCCGC AATCTTGAGG ATGGCTCGCG CCTCAACAAC GACAAGAGCT ACGGCTTCCG CGGCCGCCTC GAACTAAAGC CCACCGAAAC GATCAAGGTA GACCTGATCG CCAGCCACTC GATCAGCGAA AGCGACGGCT TCGCCCGCAC CTTCCGGGCC GCGCCGACCG GCGCCGCCGT GTTCGGCACC CCGCTGACCG ACAGCCTCGT CGGCATCACG CCGGGAGAGG ACAACTACTC CGTCCGGCTC GACAAGCCGC TGTTCAACAA GAGCAAGCAG ACCACCGTCT CGGGCCGCGC CACGCTCGAT CTCGGATTTG CCGACCTGAT TTCGGTCACC AGCTACCAGG ACTGGCGCTT CCAGTTCGAG GAAGACTTCG ACTACACCGT GTCGGACGTG CTCGGCATCC CCGGCGGAAT CGTGGCCGAC AGCACCTATC ACGCCACCCA GTTCGCGCAG GAACTGCGCC TCGTCTCGCC CAGCAAGGGC CGCTTCAGCT ACGTGCTCGG CCTGTTCTAC GCCGACGGCA AGACCGACCG CGAATTCGAA CGCGGCCCCT CGGGCCCGGT CGTCGCGAGC TGGGCCTCGC AGAGCCGCAC CGAAAGCTAC GCCGCCTTCG GACAGGCCAC CTTCAACCTG ACCGACACCA CGCACATCGA TGCCGGGGTG CGTTTCAACC ACGAAAAGGT CGGCGCCAGC TTCCTCAATC GCGTGCCCAA CGCCTCGCCC CCGGCCGATA ACGCCACCTG CCTCACCACC TGCGTCGGCA ATGCCAAGGA CAGTGTCGTG ACCTGGAAGA CCGCCCTGCG CCAGGATATT GGCGATGCGG TCATGGTCTA TGCCTCGTTC GCGCGCGGAT ACAAGGGCCA GGGCTTCGAC ATCAGCACCG GATTTAACCC GCGACGGGCA GCCTTCCCGG TGCGTCCGGA AACGTCCAAT GCCTATGAAG TGGGCATCAA GTCACGCTTC CTCGACAACA AGGTCCAGCT CAACATCGCA GGCTTCTGGA GCGATTTCCG CGACTTCCAG GCCCAGTCCG GCATTCTGCT GCCCGACAAC ACGGTCCTGC TCACGCTGAA CAACGTCGGC AAGGTCCGCA CCCGCGGCAT CGAGGCAGAA CTTACCGCCA AGCCCACGGC GGCCCTGACG CTCGACAGCG CGGTCAGCTT TACCGACACC CGCATCATGG AATTCCCGGG CGCCCAGTGC TACACCGGCC AGACCACTGG CTGCGTCGAT CTCGACGGCG ATGGCCCGGC GACCGTCAAG GGACAGGACC TTGCCGGAAA GCGCCTTCCC AACGCGCCGC GCCTCAAGTT CAACGCGGGC TTCAACTACG ACGTGTTCCT GCCTTCGGCA CCGTTCGATG CCTTTGTCCA GGCCGACGTT TCCTACCAGA GCAAGGTCAA CTTCGACCTC CTCGGCAATC CGCTGACGGT CCAGGATGGC TATGCGGTGG TCAACGGCAG TATCGGCATC GACCAGAACG AGCGCGGCGG AATGCGCGTG GCCCTGTTCG TCAACAACCT GTTCGACAAG CACTACGCCT CGAACGTCAG CATCGCCTCG GGGGGCTCGG CCGGCCTGCT CAGCCAGGCT CTCGACCGCA AGTCCCGCCG TTACTTCGGC ATCCGGGCCC GCTACCAGTT CTGA
|
Protein sequence | MRVSRLLLSA AVCALVPSFA FAQNSGQADS GLEEIIVTAQ KREQNMQDVP VAVTALSAET LTNRNVASVA DLPRLAPSLT LTQGNVPTNN SLNLRGIGTI AFSTAIEPSV AVVVDDVALL QQAQAFSGLS DISRIEVLRG PQGTLFGKNA SAGAVNIVSQ GASDVFTGAV TGTATTDDEY RVDASLAGPL GENAGFRVNA FYGDRKGYIR NLEDGSRLNN DKSYGFRGRL ELKPTETIKV DLIASHSISE SDGFARTFRA APTGAAVFGT PLTDSLVGIT PGEDNYSVRL DKPLFNKSKQ TTVSGRATLD LGFADLISVT SYQDWRFQFE EDFDYTVSDV LGIPGGIVAD STYHATQFAQ ELRLVSPSKG RFSYVLGLFY ADGKTDREFE RGPSGPVVAS WASQSRTESY AAFGQATFNL TDTTHIDAGV RFNHEKVGAS FLNRVPNASP PADNATCLTT CVGNAKDSVV TWKTALRQDI GDAVMVYASF ARGYKGQGFD ISTGFNPRRA AFPVRPETSN AYEVGIKSRF LDNKVQLNIA GFWSDFRDFQ AQSGILLPDN TVLLTLNNVG KVRTRGIEAE LTAKPTAALT LDSAVSFTDT RIMEFPGAQC YTGQTTGCVD LDGDGPATVK GQDLAGKRLP NAPRLKFNAG FNYDVFLPSA PFDAFVQADV SYQSKVNFDL LGNPLTVQDG YAVVNGSIGI DQNERGGMRV ALFVNNLFDK HYASNVSIAS GGSAGLLSQA LDRKSRRYFG IRARYQF
|
| |