Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3816 |
Symbol | |
ID | 5077964 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 471857 |
End bp | 474226 |
Gene Length | 2370 bp |
Protein Length | 789 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640481539 |
Product | TonB-dependent receptor, plug |
Protein accession | YP_001166201 |
Protein GI | 146276041 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.540841 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATCCA GGTGGAAGCT GACCACTGCG GTTGTGCTGG CGGCGACGGC TGCGACTTTC GCAAACGATG CTGCGGCACA GGACGCAAAG CAGGAGCGGG CATCTGCCGA TAGCCCGGTG GTGTTCGGCG ACATCGTCGT GACAGCGACC AAGCGTTCGG AGAACGTCGG CCGCGTGCCG ATCGCGATTT CGGCCTTTTC GGGCGATCAG CTCAAGGCTC TCGGCATCAC CGAGACGACG CAGATCACCC AGCATGTGCC GGGTCTGCAG CTCAACGCCT GGTCACCCAA CGTGACCATC TTCAACCTGC GCGGCGTTTC GCAGAATAAC TTCACCGACT ATCTGGAAAG CCCCGTCGCG GTCTATATCG ACGACGCCTA CATGGGTTCG ATCAATGGCG TGTCGGGCCA GTTGTTCGAT GTCCAGCGCG TGGAAGTCCT GCGTGGACCG CAAGGAACGC TGTTCGGGCG GAACGCGACC GGCGGGTTGA TCCACTACCT CTCCACCGAC GCCTCGAACG CAACGTTCAA TGGCTATGCC ACCGCCGGCT ACGAGCGGTT CAACCGGCGC ATGCTGGAAG GCGCGGCGGG CGGTTCCATC GCCGAGGGGC TGCGCTTCCG CGTCGCCGGA CGGATCGCCA GGGCCGATGG CTATGTGAAG CCCCAGGCCG CATTGCCCGG AGTGTTCGAA AGCAACGGAC AGGCGCTGGG CGGAGAGAAC GGCTGGGCAT TGCGCGGCAC GATCCAGGCC GACCTTGGCG AGCGCGGCAC GCTCGACCTG TGGTACAAGC ATTCCGAGGA CAACGGGGTC GCCACCGGTG GCTACGTGTT CGACAACTGC GACCTCACGG CGAGCGGCTA TTGCGCCACG GACGCGGCGG GGCTTTCCAA CGGCACCGGG GGCGTGATCA ATGCGATCAC CGGTGAAAAG GCCAGTCCCT ACGCCAACTT CTCCAACCTT CGCGGTTCGC TGGATCGGGA CATCGACGTC GCGCAGGCGA AGCTGCTCTA CGATCTGGGC GGCGCAAAGC TGACCGCGAT CACCAACTAC ACCTGGCTCG ACAAGGTCTA TGCCGAGGAT GGCGACGCCA CTCCGCTCGA CCTGATCGGT TTTGGCACGA CCGCAAAGTA CCGCCAGTTC AGCCAGGAGC TGCGGCTGTC CGGCGACGCT CCGCGCTTTC GCTGGCAGGT GGGCGCCTAC TACCTGGACA TGAAGATCCG CGGATCGAGC CTGACCAGCG GCGTCCCCGC GCTGAGCGCG GCGGTGGCGA CCGGGCTGGA CGGGACCAAC CCGGCCATCG ACGACGACTA CAACCTGCGT TCGAAGAACT GGTCGCTGTT CGGGCAGGTC GAATACGACC TCGCCGACAA GGTCACGCTG GTTGCCGGCG GTCGTTATTC GAAGGACACC AAGCGTGTCG ATTATCGCTC GGCCGTGACG TCCGGAGGGG CGACGGTCGA ACTGGGCTCT GACGAGAGTT TTGCCGCGCT GCGCCCAGGC GTGGACCGGA TTTCCGATGG TGACTGGGCA GCACGTGTCA GCCTGAACTT CAAGCCGAAC GCGGACACCC TCCTGTTCGC TTCGTGGAAC CGGGGGATCA AGGGCGGCAA CTTCACGTTG AGCCCGGTCG TCGACGTCGC CAACTTCCAG CACGGCGGCG AAACGCTCAA TTCGTTCGAG CTGGGTGCGA AGTGGGCCAA TGCGGACAAG ACCGTGCGCC TGGCCGCGAC CGCCTATCAC TACATCTACG ATAACTATCA GGCCTTCGCC ATCATCAACT TCGTGCCGCA GGTTCGCAAC AGCGACGCCC GCGCGACCGG TGTCGAGATC GAGGCATTCC TGCGCCCCGC GCCACACTTC AACGTCAACC TTGGCGCGAC GTGGGAAACC AGCAAGGTCG ACTTCGTCTC GACCGCCGGG ACGGCCGTGC TTGGCGTGCT GGTCCCCGGC GCACCCGCGC CGCAATACTG CACCGACCAG GGCGACGGGA CTTATGGCTG CGCCTATCCC AGCGCCGGCG TGACCGACGC CGAACTCCCC AACTCGCCGC GCTTCAGCGT CAACTACGTG CTGCGCTACG ATGTGGACAC CAGCTTCGGC AACGTTGCCG CGCAGTTTGA CGGTGCGTGG TACGACGACC AGTTCCTCGA AGTGACCAAT GGCGCCTCAT CGCGGCAGAG CGCCTACAAC GTGTCGAACG CCTCGCTGAC GTGGACATCG CCGGACGACC GCTTCTCGGT CGAGGTCTAT GGCCGCAACG TGTTCGACAC GGTCTATCGG CAGTACACGC TGAACCTCGG CCCGCTCGGG ACCACCAGCA TGTACGCCAA GCCCGCGACC TATGGCGTCA GCGCAACGGT GAAGTGGTGA
|
Protein sequence | MESRWKLTTA VVLAATAATF ANDAAAQDAK QERASADSPV VFGDIVVTAT KRSENVGRVP IAISAFSGDQ LKALGITETT QITQHVPGLQ LNAWSPNVTI FNLRGVSQNN FTDYLESPVA VYIDDAYMGS INGVSGQLFD VQRVEVLRGP QGTLFGRNAT GGLIHYLSTD ASNATFNGYA TAGYERFNRR MLEGAAGGSI AEGLRFRVAG RIARADGYVK PQAALPGVFE SNGQALGGEN GWALRGTIQA DLGERGTLDL WYKHSEDNGV ATGGYVFDNC DLTASGYCAT DAAGLSNGTG GVINAITGEK ASPYANFSNL RGSLDRDIDV AQAKLLYDLG GAKLTAITNY TWLDKVYAED GDATPLDLIG FGTTAKYRQF SQELRLSGDA PRFRWQVGAY YLDMKIRGSS LTSGVPALSA AVATGLDGTN PAIDDDYNLR SKNWSLFGQV EYDLADKVTL VAGGRYSKDT KRVDYRSAVT SGGATVELGS DESFAALRPG VDRISDGDWA ARVSLNFKPN ADTLLFASWN RGIKGGNFTL SPVVDVANFQ HGGETLNSFE LGAKWANADK TVRLAATAYH YIYDNYQAFA IINFVPQVRN SDARATGVEI EAFLRPAPHF NVNLGATWET SKVDFVSTAG TAVLGVLVPG APAPQYCTDQ GDGTYGCAYP SAGVTDAELP NSPRFSVNYV LRYDVDTSFG NVAAQFDGAW YDDQFLEVTN GASSRQSAYN VSNASLTWTS PDDRFSVEVY GRNVFDTVYR QYTLNLGPLG TTSMYAKPAT YGVSATVKW
|
| |