Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3473 |
Symbol | |
ID | 5077622 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | - |
Start bp | 73888 |
End bp | 76248 |
Gene Length | 2361 bp |
Protein Length | 786 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640481197 |
Product | TonB-dependent receptor |
Protein accession | YP_001165859 |
Protein GI | 146275699 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGACCC AAGCCACGCG GCGGCTGATC GTCGTTGCCC TTACCACCAC TGCAATCGGT TTCTCGGTGC CGGCATTCGC CGCCGAAGCC GATGAGCAGC AGGCGGCGCA GACCGCCGAG GCGACCGATT CAGGCGACAC CGGCGCGATC ATCGTCACTG CCCGTCGCCG TTCGGAAACC CTGCAGTCGA CCCCGGTTGC CATTACCGCC GTCAACACCG CCATGCTTGA AAGCAAGGCC GCAGTGAACA TCGGCGATCT TCAGGGCGCG GCTCCTGGCC TTCTCATCAC GCAGCAGAAT TCGGGCGCGC AGGCGGCCAA CATCTCGATC CGCGGCCTGA CCTATGCGGA TATCGAGAAG TCGCAGACGC CGACCGTCGG CGTCGTTGTC GATGGCGTGA CCATCGGCAC CAACACCGGC CAGCTCCAGG ACGCCTTCGA TGTCGCCCAG ATCGAAGTCC TGCGCGGTCC GCAAGGCACG CTGTTCGGCG CGAACACCAT CGGCGGCGTC ATCAACATCA CGCGTTCGAA GCCCACGATG GAACCCGGCG CCAAGGCCGA GTTCTCCTAT GGCCGCTGGA ACACGATGTC GCTCAAGGCC ATCGCCAACT ACGGCGACGG TGATACCTGG GGCGTCAAGG CGTTCTACTT CCACAACGAG ACCGACGGTT TCTACCGCAA TGTCACGCGC AACACGAATG CGGGCTGGAG CGTTGGCAAC ACCGTCGGCG GCAGCCTGCT GTTCAAGCCT GCGGGCTCGG GCTTCGACGC GCAGTTGACG GTCGAGCACG TCAGCCAGAA GTTCGATCCG GTCGTCAGCA ACCTGACCAA CAGCACCGAG GTGTTCTGCG GCTTCATTCC TGAGCGTGAG TGCAACCGCA ACAACACGAC CGATCTTTAC ACCACCTTCG GTGACTATGC CGAGAGCACC TACAATGCTC CCGACGCAAC GCTGGAAATG AACTACGATC TGGGTGGAGT GAAGCTGACC TCGATTACCG GTTGGCGCCA TTCCAAGGAG GCGCAGACTC AGGACTTCGA CGGTTCATCG ACCGACCTGT ATTACGTCGA TCGTCGCCAG CACTACACGC AGTGGAGCCA GGAGCTGCGC GCTGCAGGGA ATCTCTTCGA CGGCTTCGAC TATGTCGTTG GCGGTTACTT CTTCAGCTCG AAGTATGACC TGACGCAGTG GAGCCGAGTA TTCGGCTTCG ATTCTTCAAC CCCTCCGACC AAGTTCGACA CGGCCGCGCA GCACGTCGAA GGGAAGACCA AGAGCTATGC GTTCTTCGGC GACTTCAACT GGGCTTTCGC GCCGGGCTTC CGCCTCTCGT TCGGCGGCCG TTTCAGCCAC GACAACAAGA AGCTCAGCAA CGGCTTTGCC GATGGCGTCC TGCTCGATCC CGACAACCTC GATCTCAGCA AGATCGCGCT GGTCGGCAAG GGCGATGCCA GCTTCAACAA GTTCACTCCC AAAGTCGGCA TCGACTGGCG CCCGACGCCG GACCTGATGG TCTATGCCTC GTGGTCGCGT GGCTATCGTT CGGGCGGTTT CAGCCCCCGC GCCGCTACCG CTGCAACGGC CAGCACGCCG TTCCAGCCCG AAACGGTCGA CGCCTACGAA GTCGGCGTGA AGCTGGCAGC TTTCGATCGC AAGCTTGAGC TGAACGTCGC CGGCTTCGTG TCCGACTACA AGGACATGCA GCAGAACCTG ACCGTGCCTG GCGGCCCCAC CGGCAACCAG ACGATCACCG GCAACGTTCC GGGTGGCGCG CTGATCAAGG GCATCGAAGT CGACGGCACT GTCCGCGTGA CCGAAAACTT CAAGCTCACC GGCTCGATCG CGGTGATGGA CTCGCACTTC CGCAACTTCG TCACCTGTGG CGCCTATGCC GGCGGTGCGG TGGCGACCAA CGATTGCGGC ACCGGTCTCG TTCCATTCGA CTATTCGAAG AACCGTCTGA TCTACGCGCC TGATTTCACC GCTTCGCTCA GCGCGGAATA CACCCTGCCG ACGAGCTTCG GCGACGTTTC GGCCAACGTC GGCTGGCGCC ACATCTCGCC CTATGACGAA CAGCTCTCCG CTGCTTCGCT TACCCCTACG CTCAACGGCG ATGGCGAAGC GACGCGGATC ACCGTCGAAG GCAACGATCC GCGCGTCCGC ACCACCACGC AGGATCTGGT CGATGCGGCG CTGACCTTCA ACTTCGATTT CGACAATACC AAGGCCTATG TCCGCGTCTT CGGCCGCAAC CTGCTGAACG AGAAGACCAC GACCCACGCA TTTACCGTCG CGGGACTGTG GTCGTTCGGC ATGGCGCTCG AACCGCGCAC CTATGGCGCG ACGCTGGGGG TCAAGTTCTG A
|
Protein sequence | MKTQATRRLI VVALTTTAIG FSVPAFAAEA DEQQAAQTAE ATDSGDTGAI IVTARRRSET LQSTPVAITA VNTAMLESKA AVNIGDLQGA APGLLITQQN SGAQAANISI RGLTYADIEK SQTPTVGVVV DGVTIGTNTG QLQDAFDVAQ IEVLRGPQGT LFGANTIGGV INITRSKPTM EPGAKAEFSY GRWNTMSLKA IANYGDGDTW GVKAFYFHNE TDGFYRNVTR NTNAGWSVGN TVGGSLLFKP AGSGFDAQLT VEHVSQKFDP VVSNLTNSTE VFCGFIPERE CNRNNTTDLY TTFGDYAEST YNAPDATLEM NYDLGGVKLT SITGWRHSKE AQTQDFDGSS TDLYYVDRRQ HYTQWSQELR AAGNLFDGFD YVVGGYFFSS KYDLTQWSRV FGFDSSTPPT KFDTAAQHVE GKTKSYAFFG DFNWAFAPGF RLSFGGRFSH DNKKLSNGFA DGVLLDPDNL DLSKIALVGK GDASFNKFTP KVGIDWRPTP DLMVYASWSR GYRSGGFSPR AATAATASTP FQPETVDAYE VGVKLAAFDR KLELNVAGFV SDYKDMQQNL TVPGGPTGNQ TITGNVPGGA LIKGIEVDGT VRVTENFKLT GSIAVMDSHF RNFVTCGAYA GGAVATNDCG TGLVPFDYSK NRLIYAPDFT ASLSAEYTLP TSFGDVSANV GWRHISPYDE QLSAASLTPT LNGDGEATRI TVEGNDPRVR TTTQDLVDAA LTFNFDFDNT KAYVRVFGRN LLNEKTTTHA FTVAGLWSFG MALEPRTYGA TLGVKF
|
| |