Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1559 |
Symbol | |
ID | 3917234 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 1615327 |
End bp | 1617519 |
Gene Length | 2193 bp |
Protein Length | 730 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640444299 |
Product | TonB-dependent receptor |
Protein accession | YP_496833 |
Protein GI | 87199576 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.224366 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGATGA AGAGGGTAGC ACACCTGCGG CTCGTGGCCG TACTGGGAAC GAGCGCGCTG GCACTTCTCG CCGGTGGACA GGCTTACGCG CAGGAAGCAG TGGCACCGCA GGAGCAGGCC ACCGAGGCAT CGGTGTTCGG CGACATCGTC GTCACCGCCA CCAAGAAGGC GAACGCGCAG AACGTGCAGG ACGTGCCGAT TGCCGTCACC GCTTTCGGCT CCGAACAACT CGAAAGCCAG CACGTGCGCA CGCTCGACAA CCTGGGCTAT AGCGCACCCA ACGTGCAGCT CGACGACGTC GGCACCGCAC CAGGCTTTGC CAATTTCTCC ATCCGCGGCC TTGGCATCAA CAGCTCGATC CCCTCTATCG ATCCAACCGT CGGCGTATTC GTCGACGGCG TCTACATGGG CATCAGCGCC GGCATCCTGT TCGACACCTT CGACCTCGAA GGCGTCGAAG TGCTGCGCGG CCCGCAGGGC CTGCTGTTCG GCCGCAACGT GACGGGCGGC GCGGTCGTGG TGCGCACATC CACCCCCGGC AACGACCTCA AGATCGAAGG GCGCCTAGCT GCGGAAACCG GCCTCAACAA GATCGCCAGC GCAGTGGTCT CCGGGCCGCT GATCAAGGAC AAGCTGGCCG CCAAGGTCGC GGTCTACTAC AATGACGACG ACGGCTGGTT CACCAACAAG TTCAACGGCA ACAAGAACTT CGGCGCTTCC AAGACGCTGA TCGTGCGCTC CGCCCTGCGC TACACGCCGA CTTCCGAGGT TGAGGCGGTG GCCCGTTACG AACATGGACG CGTGCGCGGC GACGGCGCGG TAGTGAGCAA CTTCGGCCTC TTCCGCCGCG AGAGCTTCGG TATCAGCGTC GACGAGGAAG GCGTTACCCG AAACGACTGG AACCAGGCGT CGCTGGAACT GAACATCGAC ACCGATTTCG GCAACGGCAA GATCACCAAC ATTGCCGCAT ACCGTGACTT CAAGGGCTTC GTGACGAGCG ATATCGACTC CTCGCCCAGC TACACCTTCC ACGCAGACAC GCTCACCCGG CAAGACCAGT GGAGCAATGA ACTTCGCTAT GCCGGCACGT TCGGTGCGCT GGAGCTGACC ACCGGGCTCT ACTACTTCCA GCAGGATATC GACTACATCG AACTGCGCCG TCTTGCGGCG GGAGCGCTCA AGATCTCCGG CGGCGGCAAG CAGCACCAGA AGACGTTCGG CGCCTTCGTT TCGACCGACT GGCACGTCAC CGATACAGTG ACGCTGAACG GCGGCGTCCG CTATTCGTGG GAGCGCAAGA GCGCCAAGGT TGCAAATCTC GCCGGCAACC TGTGCGACCC GATCGTCACG AAGACCTGCA GCACCTATGG TTTCTCCAAC AGCAAGAGCT GGAGCGATCC GACCTTCCGG GTGGGCGCGC AATGGCAGCC GACGAACGAG ACCCAGGCCT ATGCATTCTT CGCCCGCGGT TTCCGCAGCG GCGGCTACAA CTTCCGCAAT GGCAATGCCG CAGAAGCCCC GGGGCCGTTC GACGCCGAGA AGCAGAACTC GTTCGAGGCA GGCATCAAGC AGGACTTCGG CCGCACGCTG CGCCTGAACC TTGCGGCATT CCACAACACC GTGCTTGGCC TGCAGCGCGA GATCATCCGC CCGGTCCTGC CGATCGGCAC GACGCAGGTC ATTCGCAATT CCGCGAACGT CCGCATCCAG GGCATCGAGG CGGAAGCCGT GCTGCGCGTT GGCGACCACC TCACCTTCAA CGGCCAGTTC GGCTATACCA AGGCCAAATA CACGAAGATC CTCTACGACC TGACGGGCGA CGGCGCGATC AATGCCAAGG ACTTTGCCCT CAAGCCGCCG CGCCTGGCAC CCTGGACCTA TGGGGTGAGC GCCAATTTCG CGCATGAAGT GACAGGTGGC GGCGAAGTGA CGGCGCGCCT CGGCTATGCC CATCGCGATG CGGCCTGGTC GAACGATGCC AACACCGGCC TGCTGAGCAA GGCAGACATG GTCGACGCGA ACCTCTCCGT AGAGACGGCC GGTCGCAGGT GGAAGTTCTC GGTCTACGGC ACCAACCTGC TGAACGACCA GACCGAGGGC AACGTCTCGA GCCTCCCGTT CTTTGCCGGA TCGACCTTCG CGTCGATCAA CAAGGGACGT GTCGTCGGGG CCGAAGTTCT GTTCCGCTAC TGA
|
Protein sequence | MGMKRVAHLR LVAVLGTSAL ALLAGGQAYA QEAVAPQEQA TEASVFGDIV VTATKKANAQ NVQDVPIAVT AFGSEQLESQ HVRTLDNLGY SAPNVQLDDV GTAPGFANFS IRGLGINSSI PSIDPTVGVF VDGVYMGISA GILFDTFDLE GVEVLRGPQG LLFGRNVTGG AVVVRTSTPG NDLKIEGRLA AETGLNKIAS AVVSGPLIKD KLAAKVAVYY NDDDGWFTNK FNGNKNFGAS KTLIVRSALR YTPTSEVEAV ARYEHGRVRG DGAVVSNFGL FRRESFGISV DEEGVTRNDW NQASLELNID TDFGNGKITN IAAYRDFKGF VTSDIDSSPS YTFHADTLTR QDQWSNELRY AGTFGALELT TGLYYFQQDI DYIELRRLAA GALKISGGGK QHQKTFGAFV STDWHVTDTV TLNGGVRYSW ERKSAKVANL AGNLCDPIVT KTCSTYGFSN SKSWSDPTFR VGAQWQPTNE TQAYAFFARG FRSGGYNFRN GNAAEAPGPF DAEKQNSFEA GIKQDFGRTL RLNLAAFHNT VLGLQREIIR PVLPIGTTQV IRNSANVRIQ GIEAEAVLRV GDHLTFNGQF GYTKAKYTKI LYDLTGDGAI NAKDFALKPP RLAPWTYGVS ANFAHEVTGG GEVTARLGYA HRDAAWSNDA NTGLLSKADM VDANLSVETA GRRWKFSVYG TNLLNDQTEG NVSSLPFFAG STFASINKGR VVGAEVLFRY
|
| |