Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3525 |
Symbol | |
ID | 5077674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 139955 |
End bp | 142255 |
Gene Length | 2301 bp |
Protein Length | 766 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640481249 |
Product | TonB-dependent receptor |
Protein accession | YP_001165911 |
Protein GI | 146275751 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTATCG TCGCCGTGCT GCTCGCCAGC ACTTCGCTTG TCGCCGCCCC GGCCTTCGCC GCCGAAGAAG CCGCACCTGC ACCGGTCGCC CCTGAAGCCA TTTCCGATGC GGCCGCCGCC GAAATCGTCG TCATGGGCCA GGGACAGACC CGGCAGGTGC AGGAACTTTC CACGCAGGAG CTGACCATTC TCGCCTCGGG CACCAGCCCC TTGAAGGCGA TCGAGAAGCT GCCCAGCGTC AATTTCCAGT CCGCCGACGC CTTCGGCACC TATGAATGGT CGACCCGCGT CACCATTCGC GGCTTCAGCC AGAACCAGCT TGGCTTCAAC ATCGACGGCA TCCCGCTGGG CGACATGTCC TATGGCAACG CCAACGGCCT GCACATCAGC CGCGCGATCA GCCCCGAGAA CATCGGCGTC ACCCGCGTCA GCCAGGGCTC CGGCTCGATC ACCGCGCAGT CGACCAACAA TCTCGGCGGC ACGCTCGAGT TCTTCTCGAT CGATCCCAAG GACGCCCTCG GCGTTACCGC CAGCGCATCC TACGGTTCGG AAAACACCTG GCGCGGCTTT GCGCGCATTG GCCTGGGCAC CACCGATGGC GCCCGCGCCT TCGCTTCGGT CCAGTACCAG GACGGCGAAA AGTGGAAGGG CGACGGCAAG CAGCGCACGC TGATGGTCAA CGCCAAGGGC GTGCTCCCGC TGGGCGGCGG TACCGAACTC GACGGCTACG TCAGCTATTC CGACCGCGCC GAGCAGGACT ACCAGGATCT CAGCCTCGCG ATGATCCAGC GTCTCGGCTA TGACTGGGAC AACTTCGGCC CCTCGCGCTA TGCCGAAGCG GTCCGCGTGG CCGACATCGC CGCCAACCGC GGCGACACCG GCACCGCCCC GCTCAACGCC GCGGCGGGCA CCACCTACCC CAGCCCGATC GCTTCGGCGG ACGATGCCTA TTACGATGCC TCGGGCCTGC GCAAGGACAC CCTCGCCTCG CTCGGCCTCA CCACGCCGCT GGGCGATGCG CTGACCTTCA AGGTCAAGGG CTACTACCAC GAGAACGACG GCCAGGGCAC GTGGGGCAGC CCATACGTCA ACAGCCCCAC CGGCGTGCCG ATGGCGCTGC GCACCACCGA ATACGACATC AAGCGCAAGG GCGTGTTCGC CGCGCTTTCG GGCACGTTCG GCATCAACGA ACTCACCGTC GGCGGCTGGT ACGAGAAGAA CGACTTCATC CAGTCGCGCA AGTTCTATGC CTATGAGAGC CGGACCAACC CCGGCCGCGA CCACCTGAAG TTTCAGCACA ACCCGTTCTA CACGCAGTGG TCCATCGCCT TCGAGACCGA CACGCTGCAA TACTACGTCT CCGACGACGT CGACCTCGGC GATCTCAAGG TGAACCTTGG CTGGAAGGGC TATTCGGTCG ACACCAACGC GTTCGCGCTG GTCAACGTCA GCGGCCTCGC CACCGGCGAC ATCAAGGTCG AGGACTGGTT CCAGCCGCAC GTCGGCCTGA ACTACAAGCT TGGTGACGGG CTCGAGGCTT TCGCGGGCTT CACCCAGGTG ACGCGCGCCT ACCAGGCATC GGCCACCAGC GGCCCGTTCT CGACCACGCA GGCCGGCTTC AATGCGATCA AGGACAAACT CAAGCCCGAA AGCTCGGACA CCTGGGAGGC GGGCCTGCGC TACAACACCG GCGTCATCAA CGCCTCGCTC GCAGGCTACT ACGTCAACTT CCGCGACAGG CTGCTGGTGA TCCCGACTTC GGTCGGCGTC GTCGGCTCGG CCAACGTGCT GCAGAACGTC GGCTCGGTCC GCGCACTCGG CATCGAGGCG GCGGTGGACG TGAAGCTCCC CGGCGGCTTC GGCGCGTTCG CTTCGTACAG CTACAACGAC ACGACCTACC GCGATGACGT GACCATCACC GCGGGCGGCA CCACGGTGGT CCGCGCGACC GCTGGCAAGA CCGTCGTCGA CACGCCAAAG CATCTCCTGC GCGGCGAACT GTCGTATGAC AGCCAGACCG TGTTCGGCCG CGTCGGGGTC AACTACATGT CCAAGCGCTA CTTCACCTAC CTCAACGACC AGTCGGTCCC CGGCCGCGCG CTGGTGGACG CGACCATCGG CTACCGCCTC GACATCGGCC AGCGCCAGCC GGTCGAACTG CAGCTCAATG CCGTGAACCT GTTCGACAAG CGCTACGTCG CCACGATCGG GTCCAACGGC TTCGGCTTCA GCGGCGACAA CCAGACCCTG CTCGCGGGCG CACCGCGCCA GGTCTTCGTC ACGCTCAAGG CGGGGTTCTG A
|
Protein sequence | MRIVAVLLAS TSLVAAPAFA AEEAAPAPVA PEAISDAAAA EIVVMGQGQT RQVQELSTQE LTILASGTSP LKAIEKLPSV NFQSADAFGT YEWSTRVTIR GFSQNQLGFN IDGIPLGDMS YGNANGLHIS RAISPENIGV TRVSQGSGSI TAQSTNNLGG TLEFFSIDPK DALGVTASAS YGSENTWRGF ARIGLGTTDG ARAFASVQYQ DGEKWKGDGK QRTLMVNAKG VLPLGGGTEL DGYVSYSDRA EQDYQDLSLA MIQRLGYDWD NFGPSRYAEA VRVADIAANR GDTGTAPLNA AAGTTYPSPI ASADDAYYDA SGLRKDTLAS LGLTTPLGDA LTFKVKGYYH ENDGQGTWGS PYVNSPTGVP MALRTTEYDI KRKGVFAALS GTFGINELTV GGWYEKNDFI QSRKFYAYES RTNPGRDHLK FQHNPFYTQW SIAFETDTLQ YYVSDDVDLG DLKVNLGWKG YSVDTNAFAL VNVSGLATGD IKVEDWFQPH VGLNYKLGDG LEAFAGFTQV TRAYQASATS GPFSTTQAGF NAIKDKLKPE SSDTWEAGLR YNTGVINASL AGYYVNFRDR LLVIPTSVGV VGSANVLQNV GSVRALGIEA AVDVKLPGGF GAFASYSYND TTYRDDVTIT AGGTTVVRAT AGKTVVDTPK HLLRGELSYD SQTVFGRVGV NYMSKRYFTY LNDQSVPGRA LVDATIGYRL DIGQRQPVEL QLNAVNLFDK RYVATIGSNG FGFSGDNQTL LAGAPRQVFV TLKAGF
|
| |