Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2215 |
Symbol | |
ID | 3916531 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2357459 |
End bp | 2360374 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640444970 |
Product | TonB-dependent receptor |
Protein accession | YP_497487 |
Protein GI | 87200230 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4771] Outer membrane receptor for ferrienterochelin and colicins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.169336 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGGGCA AGCTTTCCCT GCTGGCCGGC GTGTGCGTAG CGGCCATTTC CACCCCCGTA TTCGCCCAGT CGGCGAACGA GAATGTGGGT CTTGAAGAAA TCATCGTTAC TGCGCAAAGG CAGGCGCAAT CGCTTCAGGC CGTGCCTATC GCGGTGTCCG CATTCTCGGC GGAGAACCTC GAGAAGCAGC AGATCCAGAA CTCTTCCGAT CTTCAGCTCT CGCTGCCGAA CGTAACCTTC ACCAAGACCA ACTTCACCTC CTCCAGCTTC ACCATCCGCG GCATCGGCGA CCTTTGCGTC GGCGTCTCGT GCGATGCCGC CACGGGCATC CACCTGAACG ACATGCCGAT GGTATCGAGC CGCCTGTTCG AAACCGAATT CTTCGATCTC GAACGCATCG AAGTTCTGCG CGGTCCGCAG GGCACCCTGT TCGGGCGCAA CGCCACTTCC GGCGTGGTCA ACATCATCAC CGCCAAGCCT GATCTTTCCG GCTTCGGCGC GTCGGGCGAG GCCGACTACG GCAAGTACAA TTCGGTCCGC GTCAAGGGCA TGATCAACGT TCCGCTGGGC GAAACGCTCG GCGTGCGCGT TGCGGGCATG TACACAAAGC GCGATGGTTA CACCTACAAC ATCGGCTCCA ACAGCCGCAT CGACGGTCGC GACATGTACG CCCTGCGCGG CTCGCTGCGC TGGGAGCCGA GCCCGGATAC CACCATCGAC CTTTCGGCTT ACTACTTCCG CGAGAAGGAC AGCCGCAGCC GCGTCCAGAA GCAGCTCTGC CATCGCGACC CGACCGGCAT ACTTGGTTGC TTGCCCGACA AGCTGGCCAA CGAGACAACC AACGCGGACT CGACCCTCGC GGCACTGCTG ACGTCGAAGG AATTTTTCGG CATTGCCGTT GCACCGAGCT TTGCTGCGCT CGGCCTCGGC AGCATCTATG GCGACGACGG CGACAATTTC TCGGGCGCGG TCAATCCGTC CGACGTTCGT ACCGTCAACA TCGACTACCT GCCGACCTAC TTCGCCGAGG AAGAAGTCTA CCAGGGCAAG CTCCAGCAGG CTCTCGGCAA CAACCTGACG TTGAATGTCA CCGCTGGCTA TTCGCGCAAC GAGGTTCGCA GCCGTACCGA CTACAACCTC GCGACCGAGC GCAGTCTCGT CGGCAACGCG GGCCTCTATG CCCTCGCGGC CTTCGCGCAG AGCCCGACCC TGGGCTTTGC CTTCGCGCCT ATCGCGGCTC GCCTCATCCC TAACGGCCCG ACCGGCCAGA TCTGCCAGTC CGACATCGAC CCCAACAACG TTGGTGTCTT CGGCGCGGGC AACAAGGCGA TCTGCGCGAA CACCTCGCTC GACTTCGATG AATCGAGCCA GACCTACCAG CAGTACGCTG CCGAAGCGCA CATCGACTCC GACTTCGACG GCATGTTCAA CTTCCTGATC GGCGCGAACT ACCTGCGCGG CGTGACGACG AACAACAGCT ACTACGTCAA CTCGTTCGGT CTGGACTACG CATCGGGCCT GCTCGGTGCT GCGGCGGCGA CGACCGGCGC CTTCGGTGCC AATGCGGTGT TCCGTCCTTC GCCGTTCTTC CGCAGCAACA CCGACAAGGG CACGCTCGAC AGCTACGGCA TCTTTGGTGA GACCTACTTC AAGTTCAACG ACAAGATTAA GCTCACCCTC GGCCTGCGCT ACAACCACGA CAAGAAGTTC ACCCGAGCCC GCACCACCTT GTTGAGCGAC GGCCTGTCGA GCGAATTCGC AGGTGGCAAC GCAATCTATG CGCTGGTCGG GGCGGCCTCG CTCGAGGATG CTCTGAACTG GGCCACTGCC GACTTCGACA AGGGCACGGA TGACGTCCAG GCCTTCCAGG AACGCTCGGT CAGCTTCGGT CGCATGACCG GCCGCGCGGT GCTGGATGTC CAGCTCACGC CCGACAACCT GCTCTACCTG TCCTACTCGC GTGGGTACAA GTCGGGCGGC ATCAATCCGC CGCTCTCGGT CGGGTCGGTC AACGACTCGT TCAAGCCGGA ATCGGTCGAT GCCTTCGAAA TCGGTTCCAA GAACCGCTTC GGCGCGCTGC AGCTCAACCT CTCGGCGTTC TACTACCGCT ACAAGGATCT TCAGCTCAGC CGCATCACCG CGCGCACTTC GGTCAACGAC AACATCGATG CCAACATCTA CGGCGTCGAG GCCGAAGCGA TCATGGCGCC CACGCGCAAC CTGCTGGTCA ATTTTTCGGC CAGCTACCTC AAGACCAAGG TCGTCGGGGA CCAGTTCTTC GTCGACACCC GCGACGTTTC GGCGGGCCGT TCGGACACGG TGATCATCAA GGACATCACC CTCGGCTCGA ACTGCGCGGT TACCGGGGCA AGCGCGGCGG CGGCCAATGC CTTCGTCAAC ACCATCAATA GCGGGGTCGG GCTGCGCGGC ACGACGCCGA TCCCCGGCAC CAACACCACC GGTGCCTTCT CGATCTGCGG CGTTCTGGCA ACCCAGGCCG CGGCGGTCGG CAGTGCATTC GGCGGCATCA CCGTCAGCTC GGGCATCGAA AAGAACGTGC GCGGCAACCA GCTCCCGCAG GCCCCTGAAT TCAAGTGGTC GGCGGGCATC CAGTACACGC TCGAACTGGG CGGGATGACC CTCGTTCCGC GCTTCGACAT CAACTACACG GGTGAAAGCT ACGGCACGAT CTTCAACGGC AACATCAACC GGATCAAGGG CTACGAGGTG ATGAACGCCC AGATCCAGCT GAATGGCCGT GACGACCGCT GGTTCCTGCG CGGCTACATC CAGAACATCG GCAACAACAA CGCCACGACC GGCCTCTACG TGACCGACCA GTCCTCGGGC CTGTTCACCA ACATCTTCAC GCTGGAACCG CGCCGCTACG GCGTCGCGGC CGGGTTCAAG TTCTGA
|
Protein sequence | MRGKLSLLAG VCVAAISTPV FAQSANENVG LEEIIVTAQR QAQSLQAVPI AVSAFSAENL EKQQIQNSSD LQLSLPNVTF TKTNFTSSSF TIRGIGDLCV GVSCDAATGI HLNDMPMVSS RLFETEFFDL ERIEVLRGPQ GTLFGRNATS GVVNIITAKP DLSGFGASGE ADYGKYNSVR VKGMINVPLG ETLGVRVAGM YTKRDGYTYN IGSNSRIDGR DMYALRGSLR WEPSPDTTID LSAYYFREKD SRSRVQKQLC HRDPTGILGC LPDKLANETT NADSTLAALL TSKEFFGIAV APSFAALGLG SIYGDDGDNF SGAVNPSDVR TVNIDYLPTY FAEEEVYQGK LQQALGNNLT LNVTAGYSRN EVRSRTDYNL ATERSLVGNA GLYALAAFAQ SPTLGFAFAP IAARLIPNGP TGQICQSDID PNNVGVFGAG NKAICANTSL DFDESSQTYQ QYAAEAHIDS DFDGMFNFLI GANYLRGVTT NNSYYVNSFG LDYASGLLGA AAATTGAFGA NAVFRPSPFF RSNTDKGTLD SYGIFGETYF KFNDKIKLTL GLRYNHDKKF TRARTTLLSD GLSSEFAGGN AIYALVGAAS LEDALNWATA DFDKGTDDVQ AFQERSVSFG RMTGRAVLDV QLTPDNLLYL SYSRGYKSGG INPPLSVGSV NDSFKPESVD AFEIGSKNRF GALQLNLSAF YYRYKDLQLS RITARTSVND NIDANIYGVE AEAIMAPTRN LLVNFSASYL KTKVVGDQFF VDTRDVSAGR SDTVIIKDIT LGSNCAVTGA SAAAANAFVN TINSGVGLRG TTPIPGTNTT GAFSICGVLA TQAAAVGSAF GGITVSSGIE KNVRGNQLPQ APEFKWSAGI QYTLELGGMT LVPRFDINYT GESYGTIFNG NINRIKGYEV MNAQIQLNGR DDRWFLRGYI QNIGNNNATT GLYVTDQSSG LFTNIFTLEP RRYGVAAGFK F
|
| |