Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2374 |
Symbol | |
ID | 3915719 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2519335 |
End bp | 2523933 |
Gene Length | 4599 bp |
Protein Length | 1532 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640445129 |
Product | multihaem cytochrome |
Protein accession | YP_497644 |
Protein GI | 87200387 |
COG category | [S] Function unknown |
COG ID | [COG5276] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAGGC TCATGACCCA AAAGCGCCGC AATCGCACTC TCCCGCTGGC ATGGCTCGTC GCGGCGCTCG CTGCGTTTCT GGTGCCGAGC GCGCTCTACG CGTCGGGCGA GGAGAAGGCG CAGAAAGTCA CCTATACCCG TGCTCCTGCG GCACCGCGCG GCCAGCCCGG CGACACGGTC GAGGAGCAGT GGGTCTTTGT CGACAAGCAG AACGCCGGCT GCGTAAGCTG CCACACGGCG AGTGATCATC GCACGATGCA CGCGAGCCCG GCGGTCGTGC TGTCCTGTGC CGATTGCCAC GGCGGCAACA CGAAGGTCGT TGCCGACCGC TCGTGGAACC GCAATTCGAT GGAATACGTC GGCGCGATGA AGGACGCCCA CGTCCTGCCC CGGTATCCCG TTGCCTGGGG CTGGCCCTCG TCCGCAAATC CCAAGCGTTC CTATGCTCTT CTGGCCAAGG AAAGCCCTGA GTTCGTCCGT TTCATCAACC CTTCGGACTA TCGTGTCGCA CGCGATTCCT GCGGTGCATG CCACATGTCC ATCATCGAGG CGTCCGAGCG TTCGCTGATG TCGACAGGCG CGATGCTGTG GGGCGGCGCG GCCTACAACA ACGGCATTGT CCCCTTCAAG AACTACCTCT TCGGCGAAAG CTTCACCCGC AATGGCGAAG CGGCGCTGAT CAAGTCTCCG TCCAAGGAGA TCGGCCCCGA CGGCCAGCCC ATGTGGGGCA CTGTCACGCC CAGGGAAAAG GCGCGTGGTG CGCTGCCGAT CATGTATCCG CTGCCGCGCT GGCACACGAT TCCGCCGGGC GACGTGTTCC GCGTTTTCGA GGACGGTGGT CGCACGATCA ACCCGCAGTT CCCCGAGATC GGGCTTCCCA ATTCCACCGG CCTTATCCAG CGCCTCGAAG AGCCTGGCCG TCCCGACCTC AAGCAGTCGA ACCGCGGACC CGCGACCGGT CTGCGCGTCG CGATCCCGGT CCTGAACATA CACAAGACGC GCCTGAATGA CCCCTTCCTC TACCAGATGG GCACCAACGA CCAGCCGGGC GACTATCGCT CCTCCGGCTG CGCTTCGTGC CACGTCATCT ACGCGAACGA CCGCGAACCT CGGCACAGCC TGAATTATGG CAAGTGCGGC CGTGACGGCC AGACCGCGAC CGCCGACCCC ACCATCAATT CCCTGCGCGA AGGCCAGCAC CGCAAGGGCG CCTACGGCAG CTACGATGCC GCCAAGCAGG AACACCACGA AAAGGTCCGG GCTACCGTCC TGGGCGGCAA GGGCACCTAT CTTGGTGGCG ACAGGTCCGA TCCCGAGCAG TCAGCCGACA TGGCAGCCGA TTGCGCCCGT GCGATTTCGT CGGCCGAACA GCTCGTCTAT GCGCCGGCAA AGGGCGTCGC GCCGCATGGC ACGATGGACC ACGACAAGAT GGACCAGGCC TCCGCCCACG CCCTGGCAGG TCTGGATGAA AAGGGCCATG CGAAGAAGGG CGGCAATAAC GCCGCAGGGC ACGGCGACGA ACATGAAGGC CCCGTGGCCA TGCAAGACCG CGAACGCGGC CACCCGCTAG TCCACGCCTT CACCCGCGCC ATACCCACAG CGCAGTGCAT GAACTGCCAC ATGCACCAGC CGAACATCTT CCTGAACTCC TATCTCGGCT ACACGATGTG GGACTACGAG TCCGATGCTC CCACCATGTG GCCGGGACCG GAGAACGTCG CGCCCAAGCC CGCCGGCATG TCCGACGCGG ACTACGAGAA GACGTACAAG AAGCAGTTCT ATCCCACGAT CGACGAGCAG CGCGAGGTCC TCGACCGTAA TCCGGAGGGC GCTTCGACGC GCGGGCTCTG GCGCGACGTG GAATTCCTGC GCAACGTCTA TGACCTCAAC GGGCAGCGCA AGGACACGCA GCTCGCCGAC TATCACGGGC ACGGCTGGAA CTTCCGCGCG ATCCTCAAGC GCGACCGTCG CGGCAACCTG CTCGACGATG AGGGCGATAT GTCGTCCTAC GGCACCGACA AGGCGCACAT CGTCTCGCCC GACGATCCGG AGAAGTTCCG CAAGGCGGGT GAGGGCAAGT TCGTCGATCC CGGCGAAAGC AATCCCGGCA AGGCCGTCCA CATGATGGAC ATCCATGCCC AGCTCGGCAT GCAATGCGCG GATTGCCATT TCTCGCAGGA CAGCCACGGC AATGGACTGA TCTACGGTGA AGTCGCCAAT GCGGTCGAGA TCGGCTGCAA GGACTGCCAC GGCACGCCTG ACGCCTACCC GACGCTCCTG ACCTCGAACA TGGCCGCCCC GGAGAAGGGC AACAACCTTG CACTGCTGCG CAACGCCGAT GGCCAGCGCC GCTTCGAGTG GTTCAACGAG GCTGACGGGC GCCGCGTACT GGTCCAGCGT TCGATCATCG ATCCCAACCT GTCCTGGCGC GTCAGCTTGG TGAAGGATTC GGTAGACGCG CGCTTTGCCG GCAAGATCGA CACTGCCGGC AAGCCGATCT TCAATCCGAA GGCCGCCCGC GCCAAGCTCA TGGCAAAGTC CGCGTCCGAG GACGGCGTCT ACAAGTTCGG CACCGGCGTG CCCAAGGAAG CGCGCGCGCA CCGCGACGAC GACATGGCCT GCTTTACATG CCACCTTTCG TGGACCACTT CGTGCGCAGG CTGTCACCTG CCTATCGAGG CGAACTGGAA GTCGGCCACG CACAAGTACG AGGAAGACTA CACCCGCAAT TATGCGACCT ATAACCCTCA GGTCGCGCGC GATGACATGT TCCAGTTGGG CAAGCACCAG CGCAACAAGT CGTCGGGCTC CGATCCGGTG CAGTTCGATG CCTCGGGCAA CCCTGTGTCG GGCAAGGCGA TAACCGCGCC TATCCGTTCG TCATCGGCGC TGATCCTGTC GTCGACCAAC ATCAACCGCG AACGCATCTA TGTGCAGCAG CCGCCGATCT CGGCCATCGG CTATTCCAGC CAGGCCTTTG CGCCTCACTT CCCGCACACC GTCCGCAAGA ACGAGACGAA GCAATGCACC GACTGCCACG TCAGCCAGGA TGACGACAAC AACGCCACGA TGGCGCAACT CCTGCTGCTG GGCACGAACT ACGTGAACTT CGTTGGCATG AACGCCTGGT TCGGTCTCGA CGGCGGCTTC GAGGCGGTGC GAGTGACCGA ATGGGACGAA CCCCAGGCGG TGATCGGCTC GTACCTCCAT CGCTATGCCT ATCCGGACTT CTGGAAGCAG CATGTCGAGA AGAACGGCCG CGAGTTGAAG AACTGGACGC GCGGCAAGCC TTTCGACGGC AAGCTTTCCG GAGAGACGAC GGGCCACGAG GAGTTCTCGA ACGTGGTCGA AGGCACGAAG GACGCGGTCC GGTGCCTGCA GATGCGCGGG GAGTACATGT TCGTGGCCGA AGGAAAGGGC GGCTTCCGCG CCTACGATAT CGCGTCAGTC GCCAACAAGG GCTTCTCCGA GCGCATCGTC AACGCGCCGT TCTCGCCGCT CGGCCAGGAC ACCCATGTCG CGACGACCAA CGCGACCTGC ATGGCGCTGC CGACCAACCA GTCGATCGCA CCCACCCGCA ACACGGCGGA GCTGCGCGGC ATCAACCAGG AACAGCCGTT CTCGCCGATC TACCACTACG CCTTCGTCAC CGATGCCGTG GAAGGCCTGA TTGCGGTCAA CGTCGACACA CTGGCGGATG GCGAGTTCCG CAACAACTTC TTCACCCGCT CGCTGACCTT CAATCCGGAT GGCGTACTGA CCGGTGCGCG CCACATCACG CTGGCCGGTG ACTACGCCTA TGTCGTCACC GACAAGGCGG TGGTCACGGT CCATCTGACG AAGCCCTGGT CGCCGGACAA GCCATGCGAG GTCAGCGACA AGAAGGGCGG CGAGACGTGC CTCGACCCGC GCATCACATC GGTCGTTCCC TTGCGCGATC CGCGCGCCAC GGCGGTGCAG TTCCGTTATC TCTGGGTGAC GACTGCCAAC GGGCTCGAAC TGATGGACAT TACCAGCCTT GCCCAGCCCA GGCCGGTGCC TTCGGCAACG GTCCCGCTGG CCGATGCTCG CCGCGTCTAT CTTGCCCGCA CCTACGCCTA TGTCGCGGCC AAGCAGCAGG GCCTCGTGAT CGTCGACATC ACCGCGCCCA CCCGGCCGGC GATCTACACG AGCTATACCG CCGACGGGCA GCTCAACGAT GCCGAGGACG TTATCGTCGC ATCGACCAAC GCTTCGCTCT TTGCCTACGT CGCGGATGGG CGAAATGGCA TGAAGGTGCT GCAGCTTACA TCGCCGGCCA GCCAGCCCAA CTTCTACGGG TTCTCGCCCG AGCCGAGGCC GGAGCTGATC GCGTGGGCCC GCACGCCGAC GCCTGCGCTT GCGCTTTCCA AGGGGCTGGA CCGCGACCGC GGCGTGGATG AGACCGGCGG CCAGATCGCG ATTTTCGGTC GCCTGGGGTC GCGGCCGTTC AACAGGGCAG AAATGGAAGA TCTTTTCCTC AACAGCCGGG GAGAGGTTTT CCGGGTCACC AACAAGGTGG ACATGAGCCT GTGGGTCGGG GCGAAGACGG CGCCCATGCT CGCCAGGGTC GATCAGTAG
|
Protein sequence | MNRLMTQKRR NRTLPLAWLV AALAAFLVPS ALYASGEEKA QKVTYTRAPA APRGQPGDTV EEQWVFVDKQ NAGCVSCHTA SDHRTMHASP AVVLSCADCH GGNTKVVADR SWNRNSMEYV GAMKDAHVLP RYPVAWGWPS SANPKRSYAL LAKESPEFVR FINPSDYRVA RDSCGACHMS IIEASERSLM STGAMLWGGA AYNNGIVPFK NYLFGESFTR NGEAALIKSP SKEIGPDGQP MWGTVTPREK ARGALPIMYP LPRWHTIPPG DVFRVFEDGG RTINPQFPEI GLPNSTGLIQ RLEEPGRPDL KQSNRGPATG LRVAIPVLNI HKTRLNDPFL YQMGTNDQPG DYRSSGCASC HVIYANDREP RHSLNYGKCG RDGQTATADP TINSLREGQH RKGAYGSYDA AKQEHHEKVR ATVLGGKGTY LGGDRSDPEQ SADMAADCAR AISSAEQLVY APAKGVAPHG TMDHDKMDQA SAHALAGLDE KGHAKKGGNN AAGHGDEHEG PVAMQDRERG HPLVHAFTRA IPTAQCMNCH MHQPNIFLNS YLGYTMWDYE SDAPTMWPGP ENVAPKPAGM SDADYEKTYK KQFYPTIDEQ REVLDRNPEG ASTRGLWRDV EFLRNVYDLN GQRKDTQLAD YHGHGWNFRA ILKRDRRGNL LDDEGDMSSY GTDKAHIVSP DDPEKFRKAG EGKFVDPGES NPGKAVHMMD IHAQLGMQCA DCHFSQDSHG NGLIYGEVAN AVEIGCKDCH GTPDAYPTLL TSNMAAPEKG NNLALLRNAD GQRRFEWFNE ADGRRVLVQR SIIDPNLSWR VSLVKDSVDA RFAGKIDTAG KPIFNPKAAR AKLMAKSASE DGVYKFGTGV PKEARAHRDD DMACFTCHLS WTTSCAGCHL PIEANWKSAT HKYEEDYTRN YATYNPQVAR DDMFQLGKHQ RNKSSGSDPV QFDASGNPVS GKAITAPIRS SSALILSSTN INRERIYVQQ PPISAIGYSS QAFAPHFPHT VRKNETKQCT DCHVSQDDDN NATMAQLLLL GTNYVNFVGM NAWFGLDGGF EAVRVTEWDE PQAVIGSYLH RYAYPDFWKQ HVEKNGRELK NWTRGKPFDG KLSGETTGHE EFSNVVEGTK DAVRCLQMRG EYMFVAEGKG GFRAYDIASV ANKGFSERIV NAPFSPLGQD THVATTNATC MALPTNQSIA PTRNTAELRG INQEQPFSPI YHYAFVTDAV EGLIAVNVDT LADGEFRNNF FTRSLTFNPD GVLTGARHIT LAGDYAYVVT DKAVVTVHLT KPWSPDKPCE VSDKKGGETC LDPRITSVVP LRDPRATAVQ FRYLWVTTAN GLELMDITSL AQPRPVPSAT VPLADARRVY LARTYAYVAA KQQGLVIVDI TAPTRPAIYT SYTADGQLND AEDVIVASTN ASLFAYVADG RNGMKVLQLT SPASQPNFYG FSPEPRPELI AWARTPTPAL ALSKGLDRDR GVDETGGQIA IFGRLGSRPF NRAEMEDLFL NSRGEVFRVT NKVDMSLWVG AKTAPMLARV DQ
|
| |