Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1691 |
Symbol | |
ID | 3916266 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1773187 |
End bp | 1775418 |
Gene Length | 2232 bp |
Protein Length | 743 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640444432 |
Product | TonB-dependent receptor |
Protein accession | YP_496965 |
Protein GI | 87199708 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0224052 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGGAAC GTCTTTACCG TGGAATCGCG CTGGCAGGGG TAGCCGCCGC ATCCATTTTC GCCGCAGCGC CCGTATTGGC GCAGGAAGTG CAATCCGACA GGGCCGTGCC CGACGGCGAG ATCGTCGTGA CCGCAACCAA GCGTGCGGAA AGCCTGCAAT CGGTCCCCAT CTCGGTCTCG GCCATCGGCG GCGATACGCT TGCCAAGGCG CGGGTCGGCA GCGTGGACAG CCTGGTGACC AAGGTCGCCA ATCTTCAGCT CACGTCGATC GTGGGCGACA ACACGCCGAT CTTCGCGCTG CGCGGCGTAT CGATGTCCGA CTACAGCCTC AACCAGTCGA GCCCCGTCGC AACCTATTAC GACGAAGTCT ACAAGGGCAA CTTCGCCTTC CTCGGCGTCA CCATGTTCGA CCTTGAGCGC GTCGAGGTAC TGCGCGGGCC GCAGGGCACG CTCTATGGCA AGAACACCAC CGGCGGCGCG GTCAACATCA TCGCCAACAG TGCGAAGCTG GGCGAGACCA GCGGCTATTT CAGCGCCGGC TATGGCAACT ATGACCGCTT CGACCTCAAC GGCGCGGTCA ACGTGCCGCT GGGCGAAAAG GCGGCCCTGC GCATCGCGGG CACTTATGCC CGGGCCGATG GCTGGTTCAA GAACGTGGTC CCGGGCAAGC CCGATCTCGC CTCGACCGAC GAGTACGCCA TCCGAGGCAC GCTGAACTTC GAGGCGAGCG ATACCGTCCG CTTCGTCCTG CGCGCGTCGA CCAGCTACCA GAACCCGCAG AACTATGGCA TCTATGCCCA GCCCGAAGAC GTCAATCGCC CCGGTCTCGA CCGCTGGGAG ATCGCCTCGA ACATCGCCAC CAAGCGCAAG GCGCGCACCT ATTCGGTCGC GCTTACCAGC AACTTCGACG TGTCGGATAC GCTGACCGTC ACCAGCATCA CCTCGTGGGA CAAGGGCAAC CTGTTCTTCT ACGAGGACAC CGACGGCACC GCCTCGCAAC TGCTCGAAAT CCCCTATACC GACCGCGCCA CCCAGTTCGC GCAGGACCTG CGCCTGACCA GCGACACCGG TGGCCCGTTC GATTTCATCC TCGGCGCCTA CTTCAACCGC GAGAAGGTCT ACAACGAGAC TGCCTTCGAG ATCGGCAAGG ACATTGACCT TACCGGCGAC AACATCGTCA CGGCAGACGA CTGCGTCGAA GGCCTGACCA ACGAAGATGG CAGCGACGAT GGCATCGCCT GCCTCTTCCG CAACCGCTTC GACCAGGTGA AGAAGAGCTA CGCGATCTAT TCGGACCTCA AGTACCAGGT CACCGATGCG GTGACCCTGC GTGGCGGCCT GCGCTATACC CACGATACGG GCCGGCAGAG CGGCTTCCGC TCCGATGCGC TGGGCGTCGA CGGGTCCGAG GTGGCCAACC TGATCCCCTT GTCCTCGCTC AGCTATTCGC AGGACAACCT CTCCGGGAAG ATCGGCCTCG ACTACAAGCT GGCCGATGGC AACCTGCTCT ACGCCAGCGT CAGCCGGGGC TACCGCGCGC CCAGCTTCAA CGCGCAGGCC TTCTTCGATC CGTCGGAGCT TTCGGTCGCC AAGCCCGAGC AGGTGACCTC GTACGAAGTC GGCGCGAAGA CGCAGTTCCT CGACCGCCGC ATCACGCTCA ACGTGGCCGG GTTCTACTAC GACTACCGCA ACCAGCAGTT CATCAACGTC GACCCGGTAC TGGGCTCGCA GACGCTGCTG AACATTCCCA AGTCGCGCAT CTATGGCGGC GAGGCCGAGC TGACGATCCG CGCCAGCGAC CGGCTGACCC TGCACAGCGG CATGGGCGTC CTTGCCACAA AGATCCAGCG CGGCAGCGTG AGCGGCGTGG ACGTTTCCGG CAACCGCCTG TCCAACGCAC CGACCTTTAC CTTCAACGCC ACGATCGACC TGACGCTGGT CGATGGCGAC ATGGGCAAGC TCTCGGTCCA CCCGGACGTG GCCTACCAGT CGAGCCAGTT CTTCGAAGTG CTCAACATCC CCCGCCTGCG CCAGACTTCC TACGCGCTGG TCGGCGGGCA CATCGACTGG GAAAGCGCCG ACGGGCGCTT CAATGCCTCG GTCTGGGGCA AGAACCTGTC CAACAAGTTC TACTTCACCT CGCGCGTGGA CCTGCTGGCG GGCTTCGGCT TCGACTACAA CCACATCGGC AATCCGCGCA CTTACGGCGT GACAGTGGGC GCGAAGTTCT GA
|
Protein sequence | MRERLYRGIA LAGVAAASIF AAAPVLAQEV QSDRAVPDGE IVVTATKRAE SLQSVPISVS AIGGDTLAKA RVGSVDSLVT KVANLQLTSI VGDNTPIFAL RGVSMSDYSL NQSSPVATYY DEVYKGNFAF LGVTMFDLER VEVLRGPQGT LYGKNTTGGA VNIIANSAKL GETSGYFSAG YGNYDRFDLN GAVNVPLGEK AALRIAGTYA RADGWFKNVV PGKPDLASTD EYAIRGTLNF EASDTVRFVL RASTSYQNPQ NYGIYAQPED VNRPGLDRWE IASNIATKRK ARTYSVALTS NFDVSDTLTV TSITSWDKGN LFFYEDTDGT ASQLLEIPYT DRATQFAQDL RLTSDTGGPF DFILGAYFNR EKVYNETAFE IGKDIDLTGD NIVTADDCVE GLTNEDGSDD GIACLFRNRF DQVKKSYAIY SDLKYQVTDA VTLRGGLRYT HDTGRQSGFR SDALGVDGSE VANLIPLSSL SYSQDNLSGK IGLDYKLADG NLLYASVSRG YRAPSFNAQA FFDPSELSVA KPEQVTSYEV GAKTQFLDRR ITLNVAGFYY DYRNQQFINV DPVLGSQTLL NIPKSRIYGG EAELTIRASD RLTLHSGMGV LATKIQRGSV SGVDVSGNRL SNAPTFTFNA TIDLTLVDGD MGKLSVHPDV AYQSSQFFEV LNIPRLRQTS YALVGGHIDW ESADGRFNAS VWGKNLSNKF YFTSRVDLLA GFGFDYNHIG NPRTYGVTVG AKF
|
| |