Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3403 |
Symbol | |
ID | 5077552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 2678 |
End bp | 4873 |
Gene Length | 2196 bp |
Protein Length | 731 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640481127 |
Product | TonB-dependent receptor |
Protein accession | YP_001165789 |
Protein GI | 146275629 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.181941 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGATC TTACACTGCG CAGTCGCGCT GTGCTCGCCG TACTCATGTC CACCGTTGCC ACGCCAGCGC TTGCGCAAGC GGTCGACGAG CCCGCGAACG ACGGCGGCCT CGAAGCCATC GTCGTCACCG CAGAGCGCCG CGAGCAGAGC CTCCAGGCCG TGCCGATCTC GGCCACCGTG CTTTCGGGCG AAGAGCTTCA GCGCAAGGGC GTTTCCAACC TCAACGACAT CCAGCAGGTC GCACCCTCGG TTGCCATCAA CACCTTCAAC CGCTCGACCT TCATCAACAT CCGCGGCGTC GGCATCGCCC AGTCCGCGCC CACCTCGAAC CCCGGCGTCG CCTACTACAT CGACGGCCAG CTCATCCCGC ACGAGCAGTT CATCGGCCAT TCGTTCTTCG ACATCGGCAC GATCGAGGTC CTGCGCGGCC CCCAGGGCAC GCTGACCGGC CAGAACTCCA CCGGCGGCGC GATCTACGTC CGCACGCCCG AGCCCGAGTT CGACAGCACC TTCGCCATCG GTGACGTCAC CGTCGCCAAC TACGACCGCT ACCGCGCCGT CGCCGCGCTG AACCTCGGCG GCGAGGACGT CGCCCTGCGC ATCGCCGGCG TGCACGAGGA GCGCGACAGC TTCACGCGCA ACATCGCGGC CAACGCGCAG AGCCAGCCCG GCAACCTCAA CATGGACGCG ATCCGCGCCA ACCTGCGCCT GCGCGACATG GACGGCCGCC TGACCGTCAA CGTGCGCGGC GAATACTTCG ACGTCCGCTC CGACAACAAC GCGGTCAAGA ACCGCAACGA CAAGGTCAGC AGCAATCCGT TCGAGATCGA GGAAGACGCG CTCTCGTTCC AGAACCAGGC CGGCTACCGC ATCTCGACCG AGGCACGCTA CGACGTGTCC GACAGCGTCC AGGCGCGGGG CCTCGTGTCC TGGCAGGACG GCTATACCCA CGACCAGACC GACGGCGACC GCACCGCGAC CGCCCAGGCC GTCCCCGCCA ACCTGTCCAC CAGCAGTGCC AACACCCGGA CCTATCCCGG CCGCGTCAGC AATGGCGACA CGCGCTTCAA GACCCTGATC GGCGAGTTCA ACCTGCTCTC CACCGACAAG GGGCCGCTGC AGTGGGTCGT CGGCGGCTTC GTGATGGACG AAACCGTCCC CGTCACCTTG CTGCGCGACA ACCGCAACAC GCTCGACCTG CTCCAGTCGA ACAGCTCGAT CATCACCGAG GCGAAGAACA CCTCGCAGTC GGTCTTCGGC CAGGTGAACT ACTACGTCAC ACCCGCGATC GAAGTTCTGG CGGGTGCGCG CTACAGCTTC GACAAGCAGG TCTATACCCG CTTAGCCGTC CCGGGCGCAG GCTTCACCCT GCCTTTCACC AGCGAAGCGA AGTCGGAACA GCTCACTGGC AAGATCGGCC TCAACTACCA CTTCGGCGCC GACAACCTGC TCTACGTGAC CGCATCGAAG GGCTACAAGG CGGGCGGCGT GAACCTCACG CCCAACACTC CCGACTTCAA GCCGGAACGC AACTTCGTCT ACGAAGCAGG CTTCAAGACC GAACTCCTCG ACCGCCACCT GCGCGTGAAC GGCGACGTGT TCTACTCGGA TTACAAGGAC ATCCAGCTTT CCAGCCTCGT CGGCGGCCTG CCCACCACGC AGAACGCGCT GGCTGGCCGT GCCTATGGCG GCGAACTTGA AGTCACTGCC CAGTTCGGCG GCTTCGCGGC GAACGCCGGC CTCGGCTACC TCGATGCCAA GTTCAAGAAC TCGGCCTGCA TTTCCGACAC CAACGCCGCC GGCACCGATC CTGGCTGCGC CACCAACCTG CGCTTCGTGC CCAAGGGCCG CGTCCTGCCG TTCTCGCCGG AATGGACCGT CAACGCGGGC GTCCAGTACA CGCTCTCGCT CGGCAGCGTG GACGTGACTC CGCGCGTGCA GTGGTCGTAC CTGTCGGAAC AGTACGCCAC CCCGTTCCCC AGCGTGAACA CGCTGGTCCC GGGCCGCAAC CTGTTCGACG CGCGCCTCAC TTTCGACCTC GGTCGCAAGT ACAAGCTCGA AGGCTTCGTC AACAACCTGA CCAACAAGAC CTACATCGCC ACGCAGATCC AGAACAGCTC GAGCGCGGAC GGCGGCATCA TCTACGGTGC ACCCCGCACC TGGGGCGTTC GCCTGAAAGT CGAGATCGGC AACTGA
|
Protein sequence | MPDLTLRSRA VLAVLMSTVA TPALAQAVDE PANDGGLEAI VVTAERREQS LQAVPISATV LSGEELQRKG VSNLNDIQQV APSVAINTFN RSTFINIRGV GIAQSAPTSN PGVAYYIDGQ LIPHEQFIGH SFFDIGTIEV LRGPQGTLTG QNSTGGAIYV RTPEPEFDST FAIGDVTVAN YDRYRAVAAL NLGGEDVALR IAGVHEERDS FTRNIAANAQ SQPGNLNMDA IRANLRLRDM DGRLTVNVRG EYFDVRSDNN AVKNRNDKVS SNPFEIEEDA LSFQNQAGYR ISTEARYDVS DSVQARGLVS WQDGYTHDQT DGDRTATAQA VPANLSTSSA NTRTYPGRVS NGDTRFKTLI GEFNLLSTDK GPLQWVVGGF VMDETVPVTL LRDNRNTLDL LQSNSSIITE AKNTSQSVFG QVNYYVTPAI EVLAGARYSF DKQVYTRLAV PGAGFTLPFT SEAKSEQLTG KIGLNYHFGA DNLLYVTASK GYKAGGVNLT PNTPDFKPER NFVYEAGFKT ELLDRHLRVN GDVFYSDYKD IQLSSLVGGL PTTQNALAGR AYGGELEVTA QFGGFAANAG LGYLDAKFKN SACISDTNAA GTDPGCATNL RFVPKGRVLP FSPEWTVNAG VQYTLSLGSV DVTPRVQWSY LSEQYATPFP SVNTLVPGRN LFDARLTFDL GRKYKLEGFV NNLTNKTYIA TQIQNSSSAD GGIIYGAPRT WGVRLKVEIG N
|
| |