Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1619 |
Symbol | |
ID | 3918727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1688627 |
End bp | 1692235 |
Gene Length | 3609 bp |
Protein Length | 1202 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640444359 |
Product | cadherin |
Protein accession | YP_496893 |
Protein GI | 87199636 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2931] RTX toxins and related Ca2+-binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.42147 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTACA TTCAGATCGA TGGTTCGCTG TCGGACTGGT CCTCGAACCT GAGGATCGAC GCGGGCGCCG TGGACGGCTA CCAGATCTAT GCCACGACCG ATGCGACCGA CTATATCTTC GCTTTCGCAG CGCCCACGGC GGTTGGTGCG AACACGACGA TCTGGCTCAA CACGGATCTC AACCAGGCGA CCGGCTACCA GCTCTGGGGG ACTGTTGGCG CCGAGTTCAA CGTCAACTTC AAGTCCGACG GCTCTGCAGC ACTTTATTCG GGGGCAGCCG GCGGAACGCT TGTCGCGGAC AACCTCGTCC TTGCCTACAA CGCCGATAAA ACAATGGTCG AGCTGCGCGT TCCGAAGGAT CTGCTTGGCA ACCCCGGTTC GATCGACACC GTGTACGACA TAAACGACAC GGCGATCATT CCCAGCTTCT ACCAGGACAA CGCCATCCGG GTCTGGGACG ATTCCGAGCT GGCCAGCGTC ATCCCGGCGA CAGATACCCG TATTGCGATC GTCTATTCAG CGACGACGGC GGCCAACTAC TTCAGCCAGA CTGCCTATTC GGACCTGTTC ATGGCCGCCC AGTCGCAGGC GGCGCAGGCT GGCGTGCCCT TCGACATCAT CACCGAGGCC GACCTTACCG ACATCAACAA GCTCGCCCAG TACAAGGCGA TCGTCTTCCC CTCGTTCCGC AACGTGCAGG CAAGCCAGGC CGACGAGATC GCGCACACCC TTCAGCTCGC ATCGCAGGAG TTCCACGTCG GCTTCATCGT GTCCGGCGAG TTCATGACCA ATGACGAGAA CGGGAATGCC ATGGCCGGCA ATTCCTATTC CCGCATGGCG ACGCTGCTTG ACGCCACGCG CGTTACGGGT GGCACCGCTA CGTCACTGAC GGTCACGGCC ACCGACCCGA CCGGCGTCGT CCTTGATGGC TATGCCAACG GGGAACTGGT CAATCAGTAT GCCAACGTCG GCTGGAACGC TTTCCAGAGC GTGAGCGGCA CCGGCCAGAC CATCGCCACC GAAACCATAA ATGGTTCGTC GACATACGCT GCGGTTCTCG CCACCCAGAC TGGTGGTCGC AACGTCCTCT TCTCGAGCGA TGCGGTGATG GCCGACGCCA ACATGCTGCA ACGCGCCATC GATTATGCCG TGAGCGGCGA GACGGTGACC GTTTCGCTGA ACATGACCCG CGACGCCGGC CTTGTCGCCG CGCGCGTGGA CATGGACCAG AGCATGTACA TCGAGGACGT CAACGGCGGC ATCTATGACC AGCTCGTTCC GCTGCTTCAG CAATGGAAGG CGCAATACAA CTTCGTCGGC TCGTTCTACG TCAACATCGG CGACAACACC CAGCAGGGGA TCTATACCGA CTGGAACAAG TCGCTGCCGT ACTACACGGC GATGATCGGC CTCGGAAACG AGATCGGCAC GCACACCTAT ACCCACCCCG AAGACACCAA CCTGCTGAGC CCGTCTCAGT TGCAGTTCGA ATTCGAGCTG AGCACCCAGA TCCTCGAGCA GAAGCTCAGC GCGGCGCTGG GTTATGCCTA TACCATCGAA GGTGCCGCGA TCCCCGGCGC GCCGGAAACG CTGACGACTT CGCTGGCCAT CGAACAATAC GTCAAGACCT ATCTGACTGG CGGCTACACT GGTCAAGGCG CGGGCTATCC CAACGCCTTC GGCTATCTGA CGCCCGGTAG CCAGGACAAG GTCTATATCG CGCCGAACAC CTTTTTCGAT TTCACCCTGT TCGACTGGCT GCACCTTTCG GCGGCGGATG CCAGCGCGTT GTGGCAGTCT CAGTACGAGA AGATCGTCAG CCAGGCCGAC TCTCCCGTCG TCGTCTGGCC ATGGCACGAT TACGGTGCGA CGGCGTTCAA TTCGCCCAAC TACGCGCCCG AAATCTTCAA CACGTTCCTT GCCCAGGCCG CAGCCGACGG CATGGAATTC GTGACCCTCG CCGACCTGGC CAATCGCATC AACGCCTTCC ATGGCGCCAA GGTGACGACT TCGGTCTCAG GCAACACGAT CACCGCCAAT GTCACCGCTT CCGGCAACGT CGGTACGTTT GCTTTCGATC TGCAGGGGCA GGGCAGCCAG GTCATTTCGA GCGTGGCCGG TTGGTATGCA TACGACAGCG ACAGCGTGTT CCTGCCCCAG AATGGCGGCA CGTTCGTCAT CACCCTGGGC GCGGCGCAGA CGGATGTGAC TCACATCATC GATCTGCCCA TGCGGGCGAC GCTGATGTCG GTGACGGGGA ATGGCACGAA CCTGTCGTTT CAGATCCAGG GCGAAGGCAC GGTTGTCATC GATCTTTCCG ATCCGACCAA CAAGAGCGTC CAGGTGTCTG GCGCCACCAT CGTTTCCCAG GTCGGAGACA AGCTTACCAT CGATATCGGG CCGGTCGGGT CGCACACCGT TACCGTCACC CAGACTTCGC TCAACCACGC GCCGGTTATC GAGTCCAATG GCGGTGGAGA CACCGCGGCG ATTTCGCTGG CGGAGAACCT CCTCGCAGTG ACTGCGGTCA TCGCGACCGA CGCTGATGCC AATGCCCTGA CCTACTCGAT CACGGGAGGG GCGGACGCAT CGAAGTTCAC GATCAATGCG ACGACGGGTG CGCTTGCGTT TCTGGCCGCG CCGAACTTCG AAGTCCCGAC CGATGTCGGC GGCAACAACG TATACGATGT CGTGGTGACT GCATCCGACG GAGCGCTCAC CGATAGCCAG GCGCTGGCCG TGACGGTCAC AAACGTCAAC GAGGCGCCGG TAATCACGTC GAATGGCGGC GGTGCGACCG CCTCTATCTC GCTTGCCGAG AACAACGCGG CGGTCACCGT GGTGACCTCG ACCGATCCGG AAAACACCGC GCGGACGTAT TCGCTCTCGG GTACGGACGC TGCTCGCTTC ACGATCGACG CCGCGACCGG CGCGCTCAGT TTCGTCAACG CGCCAGATTT CGAAAACCCC ACGGATGTGG GGGCCAACAA CGTCTACAAC GTGGTCGTGA CCGCTTCCGA CGGCAGCCTG ACGGATACCC AGGCACTGGC AATCACCGTC ACGAACAAGA AGGGCGTAAC CCTCAATGCT TCGTCGAGCA CCGGCAGCGT TCTGAACGGG ACGGGCGAGG AAGACCAGCT CAATGGCTGG AAGGGTGCCG ATACCCTCTA CGGTCTCGGC GGTAACGACC GTCTCGACGG TGCGGGCGGA AACGACCGCC TTTATGGCGG TGATGGCAAG GACGTTCTGA TCGGCGGCGC CGGTACGGAT ATCATGTCTG GCGGGGCTGG TGCGGACCGC TTCGAGTTCA ACGCGCTGGG GAACAGCGTG ACGGGTGCAT TGCACGACGT CATCACCGAC TTCGAAGCGG GCATCGACTT GATCGATGTG TCGAGCATCG ACGCGAATTC CGGCAAAGGC GGGAACCAGA CCTTTGTCCT GCTAGCGGAA GGTGCGGCGT TCACGGGGGT CGGGCAACTT CGTTACTTCT ACGACAGTGC GACCGACCAG ACGATTGTCC AAGGTAACGT GAACAACAAT CTGGCAGCCG ATTTCGAAAT CGCATTGTCC GGACATCAAA CCCTGTCCGC AAGCATGTTC ATCCTCTGA
|
Protein sequence | MTYIQIDGSL SDWSSNLRID AGAVDGYQIY ATTDATDYIF AFAAPTAVGA NTTIWLNTDL NQATGYQLWG TVGAEFNVNF KSDGSAALYS GAAGGTLVAD NLVLAYNADK TMVELRVPKD LLGNPGSIDT VYDINDTAII PSFYQDNAIR VWDDSELASV IPATDTRIAI VYSATTAANY FSQTAYSDLF MAAQSQAAQA GVPFDIITEA DLTDINKLAQ YKAIVFPSFR NVQASQADEI AHTLQLASQE FHVGFIVSGE FMTNDENGNA MAGNSYSRMA TLLDATRVTG GTATSLTVTA TDPTGVVLDG YANGELVNQY ANVGWNAFQS VSGTGQTIAT ETINGSSTYA AVLATQTGGR NVLFSSDAVM ADANMLQRAI DYAVSGETVT VSLNMTRDAG LVAARVDMDQ SMYIEDVNGG IYDQLVPLLQ QWKAQYNFVG SFYVNIGDNT QQGIYTDWNK SLPYYTAMIG LGNEIGTHTY THPEDTNLLS PSQLQFEFEL STQILEQKLS AALGYAYTIE GAAIPGAPET LTTSLAIEQY VKTYLTGGYT GQGAGYPNAF GYLTPGSQDK VYIAPNTFFD FTLFDWLHLS AADASALWQS QYEKIVSQAD SPVVVWPWHD YGATAFNSPN YAPEIFNTFL AQAAADGMEF VTLADLANRI NAFHGAKVTT SVSGNTITAN VTASGNVGTF AFDLQGQGSQ VISSVAGWYA YDSDSVFLPQ NGGTFVITLG AAQTDVTHII DLPMRATLMS VTGNGTNLSF QIQGEGTVVI DLSDPTNKSV QVSGATIVSQ VGDKLTIDIG PVGSHTVTVT QTSLNHAPVI ESNGGGDTAA ISLAENLLAV TAVIATDADA NALTYSITGG ADASKFTINA TTGALAFLAA PNFEVPTDVG GNNVYDVVVT ASDGALTDSQ ALAVTVTNVN EAPVITSNGG GATASISLAE NNAAVTVVTS TDPENTARTY SLSGTDAARF TIDAATGALS FVNAPDFENP TDVGANNVYN VVVTASDGSL TDTQALAITV TNKKGVTLNA SSSTGSVLNG TGEEDQLNGW KGADTLYGLG GNDRLDGAGG NDRLYGGDGK DVLIGGAGTD IMSGGAGADR FEFNALGNSV TGALHDVITD FEAGIDLIDV SSIDANSGKG GNQTFVLLAE GAAFTGVGQL RYFYDSATDQ TIVQGNVNNN LAADFEIALS GHQTLSASMF IL
|
| |