Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2601 |
Symbol | |
ID | 3917016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2809891 |
End bp | 2812260 |
Gene Length | 2370 bp |
Protein Length | 789 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640445360 |
Product | TonB-dependent receptor |
Protein accession | YP_497871 |
Protein GI | 87200614 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.889408 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAATT GCGTGAAGTT CGCGTGCTCC GCTTCGTTGC TGGTCCTGGC GATGCAGTCT TCCGCCGTCG CGGCGCAGGA CGTGCAAGGC AATGCCGGCG TTGCTGAAGA GACGACCCCG GTCTTCGGCG ACATCGTGGT GACGGCCAAC AAGCGCCAGG AAAACGCCCA GAAGGTTCCG ATCGCGATCA CCGCCTATTC GGGCGATCAG CTCAAGGCGC TGGGCGTTAC CGATGCGACG CAGATCACCC AGCAGGTGCC AGGGCTCCAG CTCAACGCCT GGTCGCCGAA CGTCACGATC TTCAACTTGC GCGGCATTTC GCAGAACAAC TTCACCGACT ACCTCGAGGC GCCGATCGCG GTCTATGTCG ACGATGCCTA TATGGGGTCG ATCAACGGGC TTTCGGGGCA ATTGTTCGAC GTGCAGCGCG TCGAGGTGCT GCGCGGGCCG CAGGGGACGC TTTTCGGTCG CAATGCAACC GGCGGCCTGA TCCATTACCT GTCGACCGAC GCCAGCAAGG CGGAGTTCAA CGGCTACCTC ACGGCGAGCT ACGAGCGCTT CGACCGGCGC GCGCTCGAAG GCGCGGTCGG TGGGGCGCTG GCGGACGGCA TTCGCGCGCG CGTTGCCGGG CGCGTGGTCA AGGCCGACGG CTACATCAAG TCGGCCGCGG CATTGCCGGG CGTGTTCGAG GCCAACGGGC AGGATCTGGG CAGCGAGAAC GGCTGGGCCC TGCGCGGCAC GATCCAGGCC GATCTCGGCC CCGACGGCAA GCTTGACCTG TGGGTCAAGC ACAGCGAGGA CAACGACGTC GCGACCGGCG GTTATGTCTT CGACAACTGC AACCTGCAGG ACAACGGTTA CTGCGGCACC GACGCGGCGG GGCTGGGCAA TGGCAGCGGC GGGGTCATCA ACGGCATCAC CGGCGAACCC GCCAGCCCCT TCCAGAACTT CAGCGACACG CCCGGTGTGT TCAACCGCAA CACCAACATC TACCAGGGCA AGCTGACCTA CGACCTGGGC GGGGTGAACC TGACCGCGAT CACCAACTAC ACCGACCTCA GGAAGGATTA CCAGGAAGAC GGTGACGCCC TGCCGGTGGA AGTCATCGTG TTCCGCACGA ATGCCCGCTA CCGGCAGTTC AGCCAGGAAC TGCGCCTGGC GGGCGAAAGC GAGCGCTTCC GTTGGCAGGC GGGCGCCTAC TACCTCGACA TGAAGATCAA GGGCGGCATG GACACGGTGG GCGCGCCGGC CATCGGCGCC GCGCTTGCGG CGGGCCTGCC CGGCGTCGCG CCGACGATTG CCGAGACCTA CAATCTGCAT TCGAAGAACT GGTCGGTGTT CGGCCAGGCG GAATACGACC TGTCCGACAA GCTCACCGTG ATCGGCGGCT TGCGCTATTC CAAGGATACC AAGACGGTCG ACTACCGGTC GGCAGTGGTG GAGGGCGCCG CCTCGTCGCT GATCGCCACG GACGAGACGT TTTCGGCCAC GCTGCCCGGC GCGGACCGCA TTTCGGACGG CGACATCGCC GCGCGCGTCA CGCTGAACTA CAAGCCCGCG GACGATACGC TGGTGTTTGC TTCGTGGAAC CGCGGCATCA AGGGCGGCAA CTTCACGCTG AACGGTTATG TCACCGCGCA AACCTTCCAG CATCGTCCGG AAACGCTCAA TTCGTTCGAG GCGGGCGTGA AGTGGTCGAA CCCCTCGCGT ACGCTGCGCG TCAATGCCAC GGCCTATCAC TACATCTACA ACGACTATCA GGCTTTTGCG CTGATCGGCG GCGTGCCGCA GGTCGGCAAT AGCGACGCCA ACGCGACGGG CTTCGAGCTG GAAACCTTCT TCCAGCCGAC CGACCACCTG AATATCAACC TGGGCGCGAC GTGGGAGCGT ACCCATGTCG ATACCGTCCA GACCGCCGGA TCGCAGTTCC TGTCGGTTCT GGTGCCGGGA GCGTCGGTGC CCCAGTATTG CACCGACCAG AACGACGGCA CCTATTTCTG CGACTACCCG ACCAAGTCGG TCAGCGGCGC GCAGTTCCCC AACGCGCCGA AGTTGAGCCT GAACTATGTC CTGCGCTACA ACGTCGATGC CTTTGGCGGC AATGTCGTCG CGCAGGTCGA CGGCGTCTGG TACGACAAGC AGTTCCTCGA AGTCACCAAT GGCCGGTCCT CGATCCAGCC GGCCTACAAC GTGACCAACG CCTCGCTGAG CTGGACGTCC GACGACGATC GCCTTTCGGT GCAGGTGTTC GGCCGGAACG TCTTCGACAA GGCCTATCGC GCCTATGCGC TCAACCTCGG ACCGCTCGGC ACGACCTCGG TCTACGCCAA GCCCGCCACC TATGGCGTCA GCGCCACGGT CAAGTGGTAG
|
Protein sequence | MKNCVKFACS ASLLVLAMQS SAVAAQDVQG NAGVAEETTP VFGDIVVTAN KRQENAQKVP IAITAYSGDQ LKALGVTDAT QITQQVPGLQ LNAWSPNVTI FNLRGISQNN FTDYLEAPIA VYVDDAYMGS INGLSGQLFD VQRVEVLRGP QGTLFGRNAT GGLIHYLSTD ASKAEFNGYL TASYERFDRR ALEGAVGGAL ADGIRARVAG RVVKADGYIK SAAALPGVFE ANGQDLGSEN GWALRGTIQA DLGPDGKLDL WVKHSEDNDV ATGGYVFDNC NLQDNGYCGT DAAGLGNGSG GVINGITGEP ASPFQNFSDT PGVFNRNTNI YQGKLTYDLG GVNLTAITNY TDLRKDYQED GDALPVEVIV FRTNARYRQF SQELRLAGES ERFRWQAGAY YLDMKIKGGM DTVGAPAIGA ALAAGLPGVA PTIAETYNLH SKNWSVFGQA EYDLSDKLTV IGGLRYSKDT KTVDYRSAVV EGAASSLIAT DETFSATLPG ADRISDGDIA ARVTLNYKPA DDTLVFASWN RGIKGGNFTL NGYVTAQTFQ HRPETLNSFE AGVKWSNPSR TLRVNATAYH YIYNDYQAFA LIGGVPQVGN SDANATGFEL ETFFQPTDHL NINLGATWER THVDTVQTAG SQFLSVLVPG ASVPQYCTDQ NDGTYFCDYP TKSVSGAQFP NAPKLSLNYV LRYNVDAFGG NVVAQVDGVW YDKQFLEVTN GRSSIQPAYN VTNASLSWTS DDDRLSVQVF GRNVFDKAYR AYALNLGPLG TTSVYAKPAT YGVSATVKW
|
| |