Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3042 |
Symbol | |
ID | 3916654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 3255880 |
End bp | 3258405 |
Gene Length | 2526 bp |
Protein Length | 841 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640445822 |
Product | TonB-dependent receptor |
Protein accession | YP_498311 |
Protein GI | 87201054 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTCC TGGTCGCCGC GTCGACCATC GCGATCGGCA GCGTCGCGCT GGCCAGCGCC GCCCACGCCC AGTCGACCGG TTCGGTCGAC GTCGAAGAGG CGATCGTCGT GACCGGCACG CGCGCGGATG CCGCCGTCAA CGGCTTCAAG GCCCCCGAAA CCCCCAAGGC CAAGGCCGTC CTGACCCAGG AACTCGTCGC TCGGCAGAAC CCCGGCAAGG CGATCTTCGA CACGATCAAC ATCGTGCCGG GCGTCAATTT CACCAGCACC GACCCCTATG GCGCCGCAGG CGGCAACTTG CGCATTCGCG GCTTCGACGG CGCGCGCATC TCGGCCACGT TCGACGGCGT CCAGGTCAAC GATTCGGGCA ACTATTCGCT CTACACCAAC CAGCAGCTCG ACTCCGAACT GATCGAGCAG GTCAACATCA ACTTCGGCGC GACCGACGTC GACAGCCCGA CCGCGAGTGC TGCGGGCGGC ACCGTCAACT ACCGCACCCG CCTGCCCAAG GAAGAGCTTG GCGCCGCGAT CAACTATTCG CACGGCACCT TCAACTACAA CCGCGTGTTC GGCGTGATCG ACACCGGCGT GTTCACGCCC TTCGGCACCC GCGCGTTCTT CTCGGCCAGC GACACCAAGT ACGACCAGTT CCGCGGCCCA GGCGGCATCC ACAAGCAGCA GTACAACGTC CGCGTGTACC AGCCGATCGG CGAAAACGGT GATTTCGTGA GCCTCGCAGG CCACTACAAC GAGAACCGCA ACAACTTCTA TCGCCGCGTC GGCATCAACG ACATGCGCAC CCTGCTGGGC TCCGCCACGA TCCCGGCGTC GGCCAGCATC ACTCCCGCTT CCCCGCTCGA TCTCGGCAAC CTGACCGACG CGCAGCAGGA GACCATCTTC AACTTCAACA ACGATGCCAC CTGCACGCTG CCCAGCAGCA GCGGTGGTGC GGGCCAACAG TCGGACGCGT CGTCCTGCGC CAACTACTAC AATACCTCGA TCAACCCTTC GAACACCGGC AACATCCGCT TCAACTCGCG CTTCGCGCTG AGCGAGAAGC TGATCGCGAC GCTTGATGCC AGCTATCAGT ATGTCCTCGC CAATGGCGGC GGCACCTCGG TCTTCGCGGA ATCCGACGCC AACCGCAGCG TCTCGGGCGT CTACAGCCGC CAGGGAACGC GCATCGGCGG TGGCGTCGCG AGCGGCGTCG ACATCAATGG CGATGGCGAT ACGGATGATC TCGTGCGTCT GCTGTTCCCG TCGAACACCC GCACGCACCG TCTCGGCGCC ACGCTTTCGC TGCGCTACGA AGCGTCGCCG GAAAATACCT TCCGCGTTGC CTATACCTGG GACCGCGCCA AGCACCGCCA GACCGGCGAA GCGGGTCGGC TCGACCAGCT CGGCAACCCG CTGAACGTCT TCGGCGGCAT CGGCGACGAT GATTCCGCGG TGAAGGATGC GGCCGGCAAC GTCCTCCAGA AGCGCGACCG CCTGTCCTAC GCGATCCTGC ACCAGGTCGC GGGCGAATAC ATCGGCAAGT ACTTCGACAA TACGCTGACC GTGCAGGCCG GCGTGCGCGC GCCGTTCTTC CGCCGCAACC TGACCAACAA CTGCTGGACT ATCGCCGGTA GCTCGAACGA CGCCTACTGC ACCTCGGAAA GCGATGCCGT GGTCGAAGCC AAGTACCCGG CCTATGCCGC GCCCTACGCC GCCCGCAAGG TGGCCTACAG CGCAGTCCTG CCCAATGCCG GCTTCGTCTA CAAGATCACG CCGCAGGTCA ACGTGTTCGG CAACTTCAGC CAGGGCTTCT CGGCCCCGCG TACCGACAAC CTCTACGGGT TCGATGGCGT GAAGATCCAG CCGACCTCGC TGGTGAAGCC GGAACGCACC AACAGCTTCG ACCTCGGCGC GCGCTATACC AGCCGCGTCG TCCAGGCCCA GGCCAGCGCA TGGTACATCG GGTACAAGAA CCGCATCATC TCGTCGCAGG TGCTGCTCGA GGACGGCAGC TCGCTCAACC TCGACCGCAA CGTCGGTCGC GTGCGCAGCT ACGGCTTCGA CGCCAGCGTC GCGGTGCGCC CGGTCGACAT GTTCTCGCTC TACACCTTCG CGTCCTACAC CAACGCGAAG CTGAGGGACG ACGTGGTCTC GCCCGCCGGC GCGATCCTCT CGCCGACCAA GGGCAAGTTC GTGGCCGAAA CGCCGAAGTG GCAGGTCGGC GGCCGCGCCC AGTTCGACTA CGAGCCCGTC TCGATTGGCG CGCAGGTCAA GTACGTGGGC GACCGCTTCC TGACCGACAT CAACGACGTG ATCGCACCGT CCTACACCAC GGTCGATCTC GATGCGCGCG TCAACCTCGG CAAGGTCAAC GACAAGGGTT CGATCTACCT GCAGCTCAAC GTGATCAACC TGTTCGACAA GTTCTACATC GGCAACCTGT CGACGCAGGC TGCCGCCTCG AACAACCCGC AGGTCGAGTT CGGCTCGCCG CGCACTTTCG TCGGCTCGAT CCACTTCGAG TTCTGA
|
Protein sequence | MKFLVAASTI AIGSVALASA AHAQSTGSVD VEEAIVVTGT RADAAVNGFK APETPKAKAV LTQELVARQN PGKAIFDTIN IVPGVNFTST DPYGAAGGNL RIRGFDGARI SATFDGVQVN DSGNYSLYTN QQLDSELIEQ VNINFGATDV DSPTASAAGG TVNYRTRLPK EELGAAINYS HGTFNYNRVF GVIDTGVFTP FGTRAFFSAS DTKYDQFRGP GGIHKQQYNV RVYQPIGENG DFVSLAGHYN ENRNNFYRRV GINDMRTLLG SATIPASASI TPASPLDLGN LTDAQQETIF NFNNDATCTL PSSSGGAGQQ SDASSCANYY NTSINPSNTG NIRFNSRFAL SEKLIATLDA SYQYVLANGG GTSVFAESDA NRSVSGVYSR QGTRIGGGVA SGVDINGDGD TDDLVRLLFP SNTRTHRLGA TLSLRYEASP ENTFRVAYTW DRAKHRQTGE AGRLDQLGNP LNVFGGIGDD DSAVKDAAGN VLQKRDRLSY AILHQVAGEY IGKYFDNTLT VQAGVRAPFF RRNLTNNCWT IAGSSNDAYC TSESDAVVEA KYPAYAAPYA ARKVAYSAVL PNAGFVYKIT PQVNVFGNFS QGFSAPRTDN LYGFDGVKIQ PTSLVKPERT NSFDLGARYT SRVVQAQASA WYIGYKNRII SSQVLLEDGS SLNLDRNVGR VRSYGFDASV AVRPVDMFSL YTFASYTNAK LRDDVVSPAG AILSPTKGKF VAETPKWQVG GRAQFDYEPV SIGAQVKYVG DRFLTDINDV IAPSYTTVDL DARVNLGKVN DKGSIYLQLN VINLFDKFYI GNLSTQAAAS NNPQVEFGSP RTFVGSIHFE F
|
| |