Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3785 |
Symbol | |
ID | 5077933 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | - |
Start bp | 427226 |
End bp | 429472 |
Gene Length | 2247 bp |
Protein Length | 748 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640481508 |
Product | TonB-dependent receptor |
Protein accession | YP_001166170 |
Protein GI | 146276010 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.440403 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGCGA CACAACACCG CGTTCGCGCG CACGTTGCAC TAGGTGCGGG ACTTGCCGCA CTTTCATCCA CGGTCGCCGT GGCCCAGCCC CCCGGCGAAA CGATTGAAGG CGCTGGTGAA ATCGTCATCA CCGCACAGAA ACGGCCGGAA CTCGCAGGCG AAGTGCCGCT GTCCATTTCG GTGATCGACG GCGAGACCTT GCAGGCTGCC CGGCTCAGTC AGGCCGACGA TATCGCGGCA TTGGTCCCCA ATCTCCGTTT CAGCGCAACG GTCGGGGAAA ACACGCCGAT CTTCGCGCTG CGCGGAGTGT CCATGTCGGA CTTCAGCCTC AACCAGGCCG GACCGGTCGC GACCTATTAC GACGAGGTCT ACAAGGGCAA CTTCGCCTTC CTTGGGGTCC AGCTTTATGA CCTTGCACGG ATCGAGGTCC TGCGCGGACC GCAGGGCACG CTCTACGGCA AGAACACCAC TGGCGGCGCG ATCAACTACC TTGCCGAACG GCCACGGTTC GAGAACGGGG GGTACCTGAA AGCCGGCATC GGAAACTTCG GGAGAGCCGA AGGCCAGGGC GCGTTGAACC TCGCCATCTC GTCCACGCTG GCCGCGCGCA TCGCCTTCAC CGCCGCGCGC GCCGACGGTT GGTTCCGCAA CCGGCTGGCG GGTAGCCCGA ACCTTTCAGC CACCCGCGAG TACGGCGTGC GCGGCTCGAT CCTGTGGAAG CCGTCGGATA GCGCCGAACT GGTCCTGCGC CTGTCGACGA GCCTCCAGAC GCCGCAGAAC TACGGCACCT ATTCGGTACC GGGGCCCGGC GGCACAGGCG CGGGCGTCTA TGAAGCATAC GGTCGGGGCA TGAGCTACTT TCGCACCGGC ATCGGCAAGC GGGAAATCGA AGCGAACTTC ACGCCGCGTC GCCGCGCCCG CACATGGTCG GCGGCATTGA CCGGCACGTT CCGGCTGAGC GACAACCTGT CGCTCGTGTC CGTGACGGGA TGGGACCGCG GCAGCCTCTT CGTGCCGGAG GACACGGATG GCAGCCCGAC CCGGACCCTC GAAATTCCCT ACACGGACCG CGGCACGCAG TTCGGGCAGG AACTCCGCCT TGCATACGAC GGCGATGGAG CGCTGAGCCT GATCCTGGGC TTGCACCACC ATCGCGAGGA CCTGTTCAAT GCGACCGACC TGAACTTCTG GACCGACCTC GACGTCGATG GCAACGGGCG GGTCGACGTT GATGACTGCT CCGCGAACGC CAGCCTTATG GCCTGTGCCA TTTCCAACCG CTTCGACCAG CGCAAGCGGA GCTGGGCACT GTTCGGCGAT GCGCACATGA AGCTCGGTAC CAGGACGGGC CTGCGGGGCG GGCTGCGCTT CACCCGGGAC ATCGGGCTGC AGGCGGGGCT GACGTCGCAA TTACGCGGCG TCGATGGCGT GCTTGTCGCA ACGCCCATCC TCCCGCTCGA CCGCAGCTTT GCGGGCAGCA ACCTTTCCGG CAAGATCGGT ATCGACCACA AGCTGGCCGA TGGCACCATG TTCTTCGCCA GCTACAGCCG GGGCTACCGG GCAAGCGGGT TCAATGCGCA GGCTTTCTTC GATGCCGCCG AAGCCGGTGT GGCCCGACCC GAGACGATCG ATGCGTTGGA AGCTGGCGCC AAGACGCGAT TGGCCGGTAA CGCGCTGGCG GTCGCGGTGA CCGGCTTTCA CTACATCTAC CGCAACCAGC AGTTCCTTTC GGTCGACCCT GCCGATGCGA CGCAGACACT CGTCAATCTC GACCGGTCGC GCATCTATGG GGCCGAGATA GAGCTGGAGG CTCGGCCCAC GTCCGAGATT GCGGCTCAGA TCGGCGTCGG CATCCTGCAT GCGCGGGCAA CGCAAGGCAT GATCGGCGGC CTCGACGTGA GCGGCCACAG CCTTTCCAAC GCGCCCTCGC TTACTCTCAA CGCCGCAGTG GCTGCGACGA TATGGGAACG CGGGCCGGCG CGCATGGCGC TGCGCGGGGA CGCCAGCTAC ACGTCCTCGC AGTTCTTCGA GATCGTCAAC ATCCCCCGCC TGCGTCAGCC CGGATATGCG CTGCTTGGCG CAAGTGTCGA TTACGCGCGC GGTCCGATGA TCCTATCGAT CTGGGGCAAG AACCTTGGCG ACAAGGTCTA TTTCACCTCG AGCATCGATC TTTCTGGATT CGGCTTCGAT TACAATCACG TGGGGACACC CCGCACCTAT GGAGCGACCG CCAGGGTCAG CTTCTAG
|
Protein sequence | MSATQHRVRA HVALGAGLAA LSSTVAVAQP PGETIEGAGE IVITAQKRPE LAGEVPLSIS VIDGETLQAA RLSQADDIAA LVPNLRFSAT VGENTPIFAL RGVSMSDFSL NQAGPVATYY DEVYKGNFAF LGVQLYDLAR IEVLRGPQGT LYGKNTTGGA INYLAERPRF ENGGYLKAGI GNFGRAEGQG ALNLAISSTL AARIAFTAAR ADGWFRNRLA GSPNLSATRE YGVRGSILWK PSDSAELVLR LSTSLQTPQN YGTYSVPGPG GTGAGVYEAY GRGMSYFRTG IGKREIEANF TPRRRARTWS AALTGTFRLS DNLSLVSVTG WDRGSLFVPE DTDGSPTRTL EIPYTDRGTQ FGQELRLAYD GDGALSLILG LHHHREDLFN ATDLNFWTDL DVDGNGRVDV DDCSANASLM ACAISNRFDQ RKRSWALFGD AHMKLGTRTG LRGGLRFTRD IGLQAGLTSQ LRGVDGVLVA TPILPLDRSF AGSNLSGKIG IDHKLADGTM FFASYSRGYR ASGFNAQAFF DAAEAGVARP ETIDALEAGA KTRLAGNALA VAVTGFHYIY RNQQFLSVDP ADATQTLVNL DRSRIYGAEI ELEARPTSEI AAQIGVGILH ARATQGMIGG LDVSGHSLSN APSLTLNAAV AATIWERGPA RMALRGDASY TSSQFFEIVN IPRLRQPGYA LLGASVDYAR GPMILSIWGK NLGDKVYFTS SIDLSGFGFD YNHVGTPRTY GATARVSF
|
| |