Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3520 |
Symbol | |
ID | 5077669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 131472 |
End bp | 134345 |
Gene Length | 2874 bp |
Protein Length | 957 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640481244 |
Product | TonB-dependent receptor |
Protein accession | YP_001165906 |
Protein GI | 146275746 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01782] TonB-dependent receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTAGTC ACGTTTCGCG CGTTTCGCTT TTGAAGATGG CCCTGCTGGC CGGTGGCGCA TTTGCCGCGC CGAGCCTTGC TTTCGCCCAG GACGCGACCG CCGCTCCCGA ACCCCAGGCC GTAGCCGAAG ATGCAGCGCC CCCGACCGAC GCCATCGTCG TCACCGGCTT CCGCCAGTCG CTCCAGGCTG CGATCAACGT CAAGAAGAAC GCCGTCGGCG CGGTCGACGC CATCGTTGCC GAAGACATTG CAAAGTTCCC CGACCAGAAC CTTGCCGAAT CGCTACAGCG CATTCCCGGC ATCTCGATCT CGCGCGATGC CGGCGAAGGC CGCGCGATCA CCGTGCGCGG TCTGTCCAGC CAGTTCACCC GCGTCCGCGT CAACGGCATG GAAACGGTCG CCACCTCGAC CGACGGCGCC TCGGCGAACC GCGACCGCGC GTTCGACTTC AACGTCTTCG CTTCGGAACT GTTCAGCTCG ATCGTCGTGC ACAAGACCGC CGAAGCCAGC CTCGACGAAG GCTCGCTCGG CGCGGTGGTG GATCTCAACA CCGGCAATCC GCTGGGTGGC AAGGCCGGCT TCACCGGCGT CGCATCGGTC GTCGGCACTT ACAACGACCT GTCGGACTAC GTCGGCCCGC GCCTTGCCGG GCTGCTGAGC TGGCGCAACG ACGCCGGCAC TTTCGGCATC TCGCTGTCCG CCGCGTACCA GAAGACCCGC GTGCTGGAGC TTGGCAACAA CTCGGTCCGC TGGGCCCAGG CGCGCTTCGA CTCGGTCGAT GGGACCCCGT GCTTCACCCG TCCCAACAGC GGCGGCACTT ACGTCGAGAG CGATGCCTGC GACGAGGCCG CCCTCGCCTT CCACCCGCGC ATCCCGCGCT ACGGCGAAGT GAAGCATGAC CGCGAGCGCC TCGGCATCAC CGGCTCGGTC CAGTTCGCGC CAACCGATGC GACGAAGTTC TCGATCGACG GCCTGTTCTC CCGCTTCGAC GCGAAGCGCG AGGAGCAGTG GGGCGAAGTC CTGCTGCGCT CCAACGAGCG TTCGATCGAC GTGGTCGACC CGGTCTACGA CGACAACGGC AACATGGTCT CGGCCACGCT GAACGATGCC TGGGTGCGTA CCGAGCACTA CCTGCGCAAG AGCCGCACCG AATTCTATCA GGTGGGCGGC ACCTGGGACC AGGACATCGG CGACAGCCTG CGCTTCACCC TGCTCGGCGG CTTCTCCAAG TCGAATGCCG AAATCCCGGT CGAGACCACG ATGATCTTCG ACGACCGCGA TGCCCAGGGC TATTCGTTCG ACTACACCGA CATGAAGCAC CCGAAGCTGA CCTTCGGCAC CAGCGTCACC GATCCGGCGA ACTTCCAGCT CGCCGAAATC CGCGACCGCC CGTCGAGCGT CGTGAACCGC TTCAAGACCG TGCAGCTTCG TACCGAATGG GACGTGGCCG AAGGCTTCAC GATCAAGGCG GGCGGCATGT GGCGCCGGTT CAACTTCGAT ACCGAAGCCT TTACCCGCGA CACCGCGGTC TGCGGCAACG GCGGCGTCGA CCGCATCTTC GGCACGATCA ATTGCTCCGC CTCCTCGGTG TTCGGCCCCA CCGCTGTCTA TGGCATCCCG GTGACGGCGG CGCTTGGCCA GGTGTTCAAC CTCGGCAACG CGGGTCAGCC CGCGGGCAAC ACCAATTCGT GGCTGGTGCC CAATCTCGAC GCCACGACCG CGCTGACCAA GCTCTACGAA CGCCCGCTGG CTCTCGATGC CGGCAACAAC CGGGGCGTGC AGGAGACCAC CAAGGGCGGC TATCTCCAGT TCGACGCCAA GGGTGAGCTG CTGGGCCTGC GCTATGCGCT CAATGCAGGC ATGCGGTACG TCAAGACCGA CCAGTCCTCC TACGGTCTCG TCGGCTCGGT GCAGACCACG GTCAAGCGCA GCTACGAAGA CTGGCTGCCC TCGGCCAACC TCGCGCTCTA CCCCACCGAG AACGTGATCG TGCGCGCCGC CATCGCCGAC GTGATGACCC GCCCGACGCT CGGTTCGCTG ACGCCCGGCG GCTCGGCCGA CGGCTTCAAC TACCGCGTCA GCACCGGCAA CCCCTATCTG GAACCCTATC GCGCCACCAA CTACGACCTC GGCGTGGAAT GGTACTTCGC TCCCCAGGCC GTGCTTTCGG CGGCATGGTT CAAGAAGGCG GTGCACACCT TCACCCGCAG CGCCTCGATC ACCGGTCTGA CCTACGCCCA GACCGGCGCG CCGATTTCCT CGCTCAGCCC GAACTCGCCG GCTGCGCTGA ACCCGTCGCA GTTGCTCGAG GATCGCTGGA CGCTGGCAAC CACGGTCAAC GGCGAGGGTG CCACGCTCAA GGGCTGGGAA TTCGCCGCGC AGGTGCCCTT CAAGATCTTT GCCGAAGGCT TCCTCGGGAA CTTCGGCGTG ATCGCCAACG CCACCTTCAT CTCGTCGAGC GCGGAATACG AACTCCAGGG CCCGATCACC GTCGCCCGCC TGCCCAACGG CAACCTCGGC CCGCTCAACA ACGTGACGCT CACCAGCACG CTCGCGAACG TGTCGAAGCG CGCCTACAAC GGCACGCTCT ACTACGATGA CGGCCGCTTC TCCGCGCGCG TCATGGGCAC CTACCGCAGC GCCTACCACG AAGGCGCCAG CGGCACCGGC AACCTGCTCG AAGGCTACGG CTCGATGTGG AACGTCGACG CTTCGGTTCG CTACAAGGTG AACGACTGGC TGGAAGTCTC GGTCGAGGGC AACAACCTCC TCGACACCTA TCGCTACCGC TACACCGACA TAGAAACGCA GCGGAATTAC GAGAACAATC ACTTCGGGCG CAACATCCTG GTGGGCGCGC GCCTCAAGTA CTGA
|
Protein sequence | MPSHVSRVSL LKMALLAGGA FAAPSLAFAQ DATAAPEPQA VAEDAAPPTD AIVVTGFRQS LQAAINVKKN AVGAVDAIVA EDIAKFPDQN LAESLQRIPG ISISRDAGEG RAITVRGLSS QFTRVRVNGM ETVATSTDGA SANRDRAFDF NVFASELFSS IVVHKTAEAS LDEGSLGAVV DLNTGNPLGG KAGFTGVASV VGTYNDLSDY VGPRLAGLLS WRNDAGTFGI SLSAAYQKTR VLELGNNSVR WAQARFDSVD GTPCFTRPNS GGTYVESDAC DEAALAFHPR IPRYGEVKHD RERLGITGSV QFAPTDATKF SIDGLFSRFD AKREEQWGEV LLRSNERSID VVDPVYDDNG NMVSATLNDA WVRTEHYLRK SRTEFYQVGG TWDQDIGDSL RFTLLGGFSK SNAEIPVETT MIFDDRDAQG YSFDYTDMKH PKLTFGTSVT DPANFQLAEI RDRPSSVVNR FKTVQLRTEW DVAEGFTIKA GGMWRRFNFD TEAFTRDTAV CGNGGVDRIF GTINCSASSV FGPTAVYGIP VTAALGQVFN LGNAGQPAGN TNSWLVPNLD ATTALTKLYE RPLALDAGNN RGVQETTKGG YLQFDAKGEL LGLRYALNAG MRYVKTDQSS YGLVGSVQTT VKRSYEDWLP SANLALYPTE NVIVRAAIAD VMTRPTLGSL TPGGSADGFN YRVSTGNPYL EPYRATNYDL GVEWYFAPQA VLSAAWFKKA VHTFTRSASI TGLTYAQTGA PISSLSPNSP AALNPSQLLE DRWTLATTVN GEGATLKGWE FAAQVPFKIF AEGFLGNFGV IANATFISSS AEYELQGPIT VARLPNGNLG PLNNVTLTST LANVSKRAYN GTLYYDDGRF SARVMGTYRS AYHEGASGTG NLLEGYGSMW NVDASVRYKV NDWLEVSVEG NNLLDTYRYR YTDIETQRNY ENNHFGRNIL VGARLKY
|
| |