Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1589 |
Symbol | |
ID | 3918697 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1647915 |
End bp | 1650911 |
Gene Length | 2997 bp |
Protein Length | 998 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640444329 |
Product | TonB-dependent receptor |
Protein accession | YP_496863 |
Protein GI | 87199606 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4206] Outer membrane cobalamin receptor protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.36828 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACCTC GTGAAGGGCT ACGACTCAAG TGGCCCGTGG AGGAGGATGA CATGCATATG AGGGGCACTC GCCTTGCAAT TTCCGCGTCC CTGCTGACCG TGAGCTTCGC GCTCGCAGCG CCGGCGTTCG CACAGGAAGC AGCGCAGGAC AAGGCGGCCG AAGCGCCGGT CGACGCCAGG GAAATCGTCG TTACCGGGTC GCTCATCCAG CGTCCCAACA ACACCGCTGT CAGCCCGATC GTGAGCGTCG GCGAGGCCGC GCTCAAGGAG ACCGGCCAGG TCAACCTGCA GGACGCGCTG AACCAGTTCC CGAGCTTCAC CACCGCCGGC AACGCCAACA CCGGCGGGCA AGGCACCGGC GGTCGCGCGT CGATCAACCT TCATGGCCTC GGGACCAACC GCAACCTCGT GCTGCTCGAT GGCAAGCGCC TGCCGCTGTC GGACATCAAC GGCAACGTTG ACATCAACAT CCTGCCGGAA TCGATCATGG GCGGCGTTGA CGTGATCACA GGCGGCGCCT CGGCGATCTA TGGTTCGGAC GCGATCTCGG GCGTGGTCAA CTTCAAGACC CTGCGCGCTT TCGACGGTGT GCGCCTCGAC GCGCAGAACT CGATCAGCGA GCGCGGCGAC GGTTACAAGT TCAACGGCTC GCTCGCTTTC GGTACCGGTT TCGCGGAAGA TCGCGGTCAC GTGATCGCGG CGTTCAGCTA TGCCCAACAG GATGCAGTCA ACGGCAGCAG CCGCGACTTC TTCTTCGACA AGACGCCGTC TTCGTTCATC GGCACGGGCA CGTTCGTGCC CAGCGCCACC AATGCCCCGA CGGTCGATGC CGTCAGCGGC GTGTTCAGCG GCTATGGCGT CGGGGGCCTG CCGAACAACG TGAACCTGCT GAACCTCGGG TTCAACAACG ACGGCACGCT GTTCACCCAG ACCGGTGCGC GCAACTACAA GGGCGCGAAC GGCACCAACG GCTATCTCGT TGTCGGCAAC AACGTGCGCA TGCCGGTCGG GCAGCAGATC GACTTTGCGA ACGCACTCAA GCGCAAGACC GCATTCCTCA AGGCCGACTA CGACCTGACG CCTTCGCTGA CCGCCTATGG CCAGTTCATG TACGTCGATC TTTCGGTCCA CACCGCTTCC GGCGGCAGCC TTACCCAGTT CGGCACGCTC ACCACGGTTC CGGTGACCAA CCCGTTCATC CCCGATGATC TCAAGACGAT CCTCGCCTCG CGCCCCAATC CCAACGCCCC GTTCACCTGG AACGGCCGTT ATGTCGGCGT GCCCTACAAG AACTGGGACG AGAACTACGT CGTCGAGCAG TACATGGCTG GCCTGAAGGG CGATATCGCC AGCGGCTGGT CGTTCGACCT CTTCGCCTCG TACGACCAGT CGGTCCACGA CCAGCAGCTC AACGATGCGG TGATCAAGGG CCGGGTGCAG ACGCTGTTGA ATGCGGCAGA CGGCGGCGCG TCGATCTGCG CGGGCGGGTT CAACCCGTTC GGCGATGCCA ATGCCCGCTC GCTGTCGGAT GCCTGCGTCA ACTACATCAC CAAGACAGCC TTCTCGAAGG AGAAGCTCGC CCAGACCCAG GCCCAGCTCC AGGTCAACGG CAAGCTCTTC GACCTCGGCG CGGGTCCGGC GCAGATCGCG CTGGTCGCCG GCTATCGCAA GAACACCTAC TCCTATGTGC CTGACAGCGA TCTTGCCGCG CAGAACATCG AGGCGGTGAT CGCCTCGCAG CCTGCCTCGG GCCGTATTTC GGTCAAGGAA TTCGCGGCGC AGGTCGATAT TCCGCTGCTC GCCGACAAGC CGTTCGTGCG TGAACTGGGC ATCGGCGGCG CGATCCGTCA TTCGGACTAC TCGGTCACCG GCGGGGTCAC CAGCTACGAA GTCGACGCCC GCTGGCGTCC GGTCGATGCG CTGCTGTTCC GCGGCAGCTA CCAGCGCGCG GTGCGTGCGC CGAACATTGG CGAACTGTTC TCGCCGCAGA CCGGCACGCA GCTCGTCATC GGCACGCCTC CGGGCGCGCT GGGCGATCCT TGCGACGTTC GCTCCAACGC CCGCACCGGC GCCGATGGCG CGAAGGTGGC TGAGCTTTGC GTCGCGCAGG GCATCCCGCA GGCGGCCATC TCGAGCTACA CCTTCCCGAC CACGGCAACG GGCCAGCTCG TGTCGGGCAA CACCAACCTG ACGCCTGAGC GCGCCGATAC GTTCAACGTC GGCTTCGTAC TAAACTCGCC GTTCGAAAGC GGCCCGCTTG CCGACTTCAC GCTCTCGGTC GATTACTACA ATATAGCCGT GAAGAACGTG ATCTCGACGG TGCCCGGCCT GACCGTGCTG TCGAAGTGCT TCAACCTCGA CGGTTCGAAC CCGACCTACG ACAAGAACAA CCTCTATTGC GGCCTGCTGG TGCGCGACAA CAGCGGACAG CTCACGACGG TTGCGACGCC GTACCTCAAT CTCGGCGCGG TGAAGACCGA TGGTGTGGAA GCGCAGGTCA ACTGGACGGT GCCGGCGCGC TTCCTCGGCG ACGAGAGCGG GCGGTTCTTC GTCAACTCGG CAGTGGGCTG GCTGCACAAG TACAAGTTGC AGCTGCTGCC TGGCGCGGCC TATCTCGACT ATACCGGCAT CAGCAACGGC GCCGCCGGCG TCAGCAGCCT GCCGCCGCGC GCGACGCCCA AGTGGAAGGC CGTGACGAGC TTCGGCTACC GTTCAGATGC GGCAACGCTC GGCCTGCGCT GGCGCTACCA GAGCAAGATG GAGGACACGA CCTCGGTCCT GACGCCGGCG ACAGCGCAGG TGGGCGTGCC GGCATATGCG CTGTGGGACA TGTTCGGCTC TGTCCGGATC GCCAAGACCT TCGAGATCCG CGCGGGCGTG AACAACCTGT TCGACAAGGG ACTGCCGTTC GTCGCCAGCT CGCAGAACGG CACCGACGTG GCGCTTTATG ACCCGATCGG CCGCTCGTTC TACCTTGGCG CCAAGGTCAG CTTCTGA
|
Protein sequence | MRPREGLRLK WPVEEDDMHM RGTRLAISAS LLTVSFALAA PAFAQEAAQD KAAEAPVDAR EIVVTGSLIQ RPNNTAVSPI VSVGEAALKE TGQVNLQDAL NQFPSFTTAG NANTGGQGTG GRASINLHGL GTNRNLVLLD GKRLPLSDIN GNVDINILPE SIMGGVDVIT GGASAIYGSD AISGVVNFKT LRAFDGVRLD AQNSISERGD GYKFNGSLAF GTGFAEDRGH VIAAFSYAQQ DAVNGSSRDF FFDKTPSSFI GTGTFVPSAT NAPTVDAVSG VFSGYGVGGL PNNVNLLNLG FNNDGTLFTQ TGARNYKGAN GTNGYLVVGN NVRMPVGQQI DFANALKRKT AFLKADYDLT PSLTAYGQFM YVDLSVHTAS GGSLTQFGTL TTVPVTNPFI PDDLKTILAS RPNPNAPFTW NGRYVGVPYK NWDENYVVEQ YMAGLKGDIA SGWSFDLFAS YDQSVHDQQL NDAVIKGRVQ TLLNAADGGA SICAGGFNPF GDANARSLSD ACVNYITKTA FSKEKLAQTQ AQLQVNGKLF DLGAGPAQIA LVAGYRKNTY SYVPDSDLAA QNIEAVIASQ PASGRISVKE FAAQVDIPLL ADKPFVRELG IGGAIRHSDY SVTGGVTSYE VDARWRPVDA LLFRGSYQRA VRAPNIGELF SPQTGTQLVI GTPPGALGDP CDVRSNARTG ADGAKVAELC VAQGIPQAAI SSYTFPTTAT GQLVSGNTNL TPERADTFNV GFVLNSPFES GPLADFTLSV DYYNIAVKNV ISTVPGLTVL SKCFNLDGSN PTYDKNNLYC GLLVRDNSGQ LTTVATPYLN LGAVKTDGVE AQVNWTVPAR FLGDESGRFF VNSAVGWLHK YKLQLLPGAA YLDYTGISNG AAGVSSLPPR ATPKWKAVTS FGYRSDAATL GLRWRYQSKM EDTTSVLTPA TAQVGVPAYA LWDMFGSVRI AKTFEIRAGV NNLFDKGLPF VASSQNGTDV ALYDPIGRSF YLGAKVSF
|
| |