Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2803 |
Symbol | |
ID | 3916963 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 3026531 |
End bp | 3028921 |
Gene Length | 2391 bp |
Protein Length | 796 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640445582 |
Product | TonB-dependent receptor |
Protein accession | YP_498073 |
Protein GI | 87200816 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
| 


|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000028196 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACAAG CCAATCGTAC CACGTCGTTC ATTCGACTTT TCGCTGCGGG TAGCACTGCT GCGCTCGCGC TCGGCATTGC CACGCCGTCG TTCGCGCAAG ACGCCCAGGC CCAGGCCGAC CAGGCGCCGA CCACCGGCGA GATCGTCGTC ACCGCACAGT TCCGCGAGCA GCGCCTGCAG GATACCCCGC TGTCGATCAC CGCGGTCGAT GCCAGCCTGC TCGCCTCGCG CAACCAGACC GACATCTCGC AGATCGCGGC TCAGGCGCCG AACGTGCAGC TCACCCAGAT GGGCGGCGCC TTCGGTTCGT CGATGGCGGC CTATATCCGC GGCATCGGCC AGTACGACTT CAACCCGGCC TACGAACCGG GCGTGGGTAT CTATGTCGAC GACGTCTACT ACGCGACGCT GACCGGCTCG GTCATGGACC TGCTCGACCT CGACCGCGTC GAAGTGCTGC GCGGTCCGCA AGGCACGCTG ACTGGCCGCA ACTCGATCGG CGGCGCGATC AAGCTGTTCT CGGCCAAGCC TACCGAAGGC AACAGCGGCA CCGTCGAGGC GACCTATGGC TCGCGCCAGC GCGTCGACCT GCGCGCCACG GCCAACTTCG AGCTGACCGA CGGCCTCTAT GCGCGCATCT CGGGCGTGTT CAAGCGCCAG GACGGCTATG TCGACCAGAT CGACTATGGC TGCGCCAACC CGGACAACGA ACTCGGCATC GGCGGCAATG CCTCGACGCC CGCGGACTGC GTCGTCGCCA AGCTGGGCGA GAAGAACTAC TCGGGTATCC GCGGTTCGCT GCGCTACAAT CCGTCGGATA CGATCGACTG GATTGTCACC GGCGACTACA CCTATGAAAA TCGCACCAAC GCCGCCGGCG TGATGAGCGC GACTGACCCG TCCAAGACCG GCGGCGTCGA TTTCACCTGC GGCAAGTTCT GCACCTATGC CAGCTGGTAC ATGCCGGAGG GCGGTCAGGC GACCCAGGCC TACTACAACC CCAACACCAC CAAGTTCGAA GGCTGGGGCG TTTCGAGCAA CCTGACGGTC GGAATCTCGG ACTCGCTGAA GCTCCAGGCG ATCACTGCCT ATCGCAAGTA CAACCAGATC TTCGGCACGG ATGATGACTA CACCCCCTAC AGCCTGATCG GCGGTTCGGG CTTCAACGAC CTCGACTTCA AGTTCTTCAG CCAGGAACTG CGCCTCAACG GCCAGGTCGG CGACAATATC GACTGGACCA TCGGCGGGTT CTACAACAAC CAGACCTCGG TCTACTTCAC CCGCCAGGAC ATCCGCTACA TCGTGCCGAT CGGCGTGCCC TCGCTGTTCC TGCAGTTCCA GGGTAACGAC CCGATCAAGG CCAACAGCAA GGCCGCGTTC GGTACGGTGA TCTTCCACCC GACCGAAGCG ATGACCGTGA CCGGCGGCAT CCGCTACACC AAGGAGCACA AGGACTATAC CTTCGTGCGC CAGGCATGGG ATGGCGGTAC GCTGACCGAT CCGTTCGGCG TCGGTGCGCT CGATGGTTCC AAGGCGGTCT ACGACGGCGA CAAGGTCGAC TGGCGCCTTT CGCTCGACTA CCGCTTCAGC CCCGAAGTCC TGGCCTATGC GACGGTCAGC ACGGGCTTCA AGGGCGGCGG CGTCACGGCG CGTCCGTTCA CCAAGAACCA GGCGATCAAC GGCACGTTCG ATCCGGAAAC GCTCCATGCC TATGAAGTGG GCCTGAAGAC CGACCTGTTC GACCGCAGGC TGCGCCTGAA CCTGTCGGGC TTCTACAACG ACTACAAGAA CATCCAGCTT CCGATCGGAG ACTGCTCGGC GCTCGACGGG TTCGAACCCG GCACCGATCC GTTCCCCTGC GCGGCGATCC AGAACGCCGG TGACGGCGAG ATGTACGGCC TCGAGGCGGA ACTCTCGGCC CACCCGGTCG AAGGTCTCGA CATCGATGCT TCGCTGAGCT GGATCGACGG CAAGTGGAAG CGGATCGACA CCGCGGCGCA GGGCGCGCTG CGCGTGACCG ACCCGATCAC CACGCCGGCC TGGCGCGGAA GCTTCGGCAT CCAGTACAAG GCGCTGCTGG GCAACAACGC GGGTTCGATC ACGCCCCGCT TCGACCTGTC CTACACCGGC AAGCAGACGA TCGGCCGTCT GATCAACTCC GGCGAGTTCG GCCCGCTGCA GTACAATCCG TCGATCACGC TGGCGAACGC CCGCGTCACC TGGAAGAACG AGGACGAGAA CCTTGCGGTC TCGTTCGAGG TCCAGAACCT GTTCGACAAG TACTACTACC TGCCGCTGCG CTTCGCTGCG GTCTATGCCT TCGTCGGCAC GGCCTACTCC AACGTCGGTC GCCCGCGCGA ATGGGCGGTC ACGGTTCAGA AGAAGTTCTG A
|
Protein sequence | MAQANRTTSF IRLFAAGSTA ALALGIATPS FAQDAQAQAD QAPTTGEIVV TAQFREQRLQ DTPLSITAVD ASLLASRNQT DISQIAAQAP NVQLTQMGGA FGSSMAAYIR GIGQYDFNPA YEPGVGIYVD DVYYATLTGS VMDLLDLDRV EVLRGPQGTL TGRNSIGGAI KLFSAKPTEG NSGTVEATYG SRQRVDLRAT ANFELTDGLY ARISGVFKRQ DGYVDQIDYG CANPDNELGI GGNASTPADC VVAKLGEKNY SGIRGSLRYN PSDTIDWIVT GDYTYENRTN AAGVMSATDP SKTGGVDFTC GKFCTYASWY MPEGGQATQA YYNPNTTKFE GWGVSSNLTV GISDSLKLQA ITAYRKYNQI FGTDDDYTPY SLIGGSGFND LDFKFFSQEL RLNGQVGDNI DWTIGGFYNN QTSVYFTRQD IRYIVPIGVP SLFLQFQGND PIKANSKAAF GTVIFHPTEA MTVTGGIRYT KEHKDYTFVR QAWDGGTLTD PFGVGALDGS KAVYDGDKVD WRLSLDYRFS PEVLAYATVS TGFKGGGVTA RPFTKNQAIN GTFDPETLHA YEVGLKTDLF DRRLRLNLSG FYNDYKNIQL PIGDCSALDG FEPGTDPFPC AAIQNAGDGE MYGLEAELSA HPVEGLDIDA SLSWIDGKWK RIDTAAQGAL RVTDPITTPA WRGSFGIQYK ALLGNNAGSI TPRFDLSYTG KQTIGRLINS GEFGPLQYNP SITLANARVT WKNEDENLAV SFEVQNLFDK YYYLPLRFAA VYAFVGTAYS NVGRPREWAV TVQKKF
|
| |