Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3563 |
Symbol | |
ID | 5077712 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | - |
Start bp | 181743 |
End bp | 183917 |
Gene Length | 2175 bp |
Protein Length | 724 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640481287 |
Product | catalase domain-containing protein |
Protein accession | YP_001165949 |
Protein GI | 146275789 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0753] Catalase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000309586 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTACCC CCACGAAAGG CAAGGCCCCC GCTCGTGCCG CGAAGGAGCG CTCGCCCGCG CTCGACAATG CGCTTCGCGA CCATCAGCCC GGAGCGGGCC AGACCGATGA AGCGACTGCT CTCGGCAATG CCGGGGAAAT CCACCAGGAG GCTGCTGCCG AAAACGATGC GGCTGCCTTC CTGACCGACA ACTTCGGCCA TCGCCTGTCC GACAACCAGA ATAGCCTCAG GGCCGGAATG CGCGGTCCGA CGCTGATCGA GGACTTCATC CTCCGCGAGA AGATCTTCCA CTTCGACCAC GAGCGCATTC CGGAGCGGAT CGTCCACGCG CGCGGATCGG GTGCGCACGG CGTGTTCGAG GTGACGCGCG CGATCCCCGA CCTGACCAGG GCCGGGTTGT TCCAGAAAAA GGGCCAGACC TGCCCGGTCT TCGTGCGCTT TTCCACCGTG GCCGGCGGGG CCGGCTCGAT CGACACCCCG CGCGACGTGC GCGGCTTCGC GGTCAAATTC TATACCGACG AGGGCAACTG GGACCTGGTC GGCAACAACA TCCCGGTGTT CTTCATCCAG GACGCGATGA AGTTTCCCGA TCTGGTCCAT TCGGTGAAGA TGGAAGCCGA TCGCGGCTAT CCGCAGGCGG CCAGTGCCCA TGACACCTTC TGGGACTTCA TCGGGCTGAT GCCGGAATCG ATGCACATGA TCATGTGGGC GATGAGCGAC CGCGCCATTC CCCGCACGCT GCGCATGATG GAAGGCTTTG GCGTGCACAC CTTCCGCTTC GTCAATGCGG CGGGCGAGGG GCGGTTCGTC AAGTTCCACT GGAAGCCGGT GCTGGGCATG GAATCGCTGA TCTGGGACGA GGCGGTGAAG GTCGCCGGGG CCGATCCCGA TTTCCATCGC CGCGACCTGT TCGAATCCAT CGCTGCCGGG CACTTCCCGG CGTGGGACCT TGGGGTCCAG GTCTTCGACG AGGAATTCGC CGCGAGCCAG CCGTACGATG TGCTCGATGC GACCAAGCTG ATTCCGGAAG AGGACGTCCC CGTCGAGATC GTCGGGCGCA TGACGCTGAA CCGCAATGTC GACAACTTCT TCGCCGAGAC CGAGCAGGTG GCGTTCCTGC CATCGAACGT GATCCCGGGT ATCGACTTCT CGAACGACCC GTTGCTGCAA GGGCGCCTGT TCTCCTACCT CGATACGCAG AAATCGAGGC TTGGCACGAC AAACTTCCAC CAGATTCCGG TCAATGCGCC CAAGTGCCCG TTCCATAACA TGCAGCGCGA CGGCCTGATG CAGACGCTGG TGCCCACAGG CCGCGCCAAC TACGAGCCCA ATTCGCTCGA CGAAGCGGGC GAGGACAGCG GGCCGCGCGC CTGCCCGGAA ACCGGCTTCA CGTCGTTCCG CGAGAATGGC GAGCGCCACG ATCCGACCGA AAAGGTGCGC GTGCGGGCGG ATCTCTTCGC CGACCACTAC AGCCAGGCGG CGCTGTTCTT CCACTCGCAG ACCGAGAGCG AACAGGCGCA CATCGCCTCT GCGCTGGTGT TCGAACTGTC CAAGGTCGCG CTGGAACATG TCCGGGCGCG GGTCGTGTCG CGGTTGCGCA ACATTGACGA GACGCTGGCG CAGCGCGTTG CCGATGGCCT TGCGATGGAC CTGCCGGAAA AGGCGCCTGC CGCACGCCAG CCGGTGAAGA TGAAGCCATC GGACGCCCTG TCGATCCAGA AGCAGGCGAA GAAGACCTTT GCCGGACGCA AGGTCGGCAT TCTCTTTGCC GAAGGATCGG ACAAGGCGAC GATCGACAAG CTGAAGGCGG GTGTGGAGGA GGCGGGTGGC ACCGTCTTCC TCGTCGCGCC CAAGGTCGGC GGCATCCCGG TCAAGGGCGG CACGCTGAAG GCCGATGGCA AGCTGGATGG ATCGCCCTCC GTCCTGTTCG ACGCGGTGGC ATCGGTGCTG ATGCCGGAAG CGGCGGCGAA GCTCGCCATG CAGGGTGCGG CCGTGCAGTG GTTCATGGAT GCCTATGGCC ACTGCAAGAC AATCGCCCAC TGCAACGGCA CCCGGATCAT CCTCGAGAAG GCCGGGGTGG AGCCTGACGA GGGCGTGGTG CCCAATGAAA AGCTGCTCGA AGTCGGCCCT GTGCGCCACT TCGCGCGTGA GCCGAAGGTT CGCGATCTGG CCTGA
|
Protein sequence | MATPTKGKAP ARAAKERSPA LDNALRDHQP GAGQTDEATA LGNAGEIHQE AAAENDAAAF LTDNFGHRLS DNQNSLRAGM RGPTLIEDFI LREKIFHFDH ERIPERIVHA RGSGAHGVFE VTRAIPDLTR AGLFQKKGQT CPVFVRFSTV AGGAGSIDTP RDVRGFAVKF YTDEGNWDLV GNNIPVFFIQ DAMKFPDLVH SVKMEADRGY PQAASAHDTF WDFIGLMPES MHMIMWAMSD RAIPRTLRMM EGFGVHTFRF VNAAGEGRFV KFHWKPVLGM ESLIWDEAVK VAGADPDFHR RDLFESIAAG HFPAWDLGVQ VFDEEFAASQ PYDVLDATKL IPEEDVPVEI VGRMTLNRNV DNFFAETEQV AFLPSNVIPG IDFSNDPLLQ GRLFSYLDTQ KSRLGTTNFH QIPVNAPKCP FHNMQRDGLM QTLVPTGRAN YEPNSLDEAG EDSGPRACPE TGFTSFRENG ERHDPTEKVR VRADLFADHY SQAALFFHSQ TESEQAHIAS ALVFELSKVA LEHVRARVVS RLRNIDETLA QRVADGLAMD LPEKAPAARQ PVKMKPSDAL SIQKQAKKTF AGRKVGILFA EGSDKATIDK LKAGVEEAGG TVFLVAPKVG GIPVKGGTLK ADGKLDGSPS VLFDAVASVL MPEAAAKLAM QGAAVQWFMD AYGHCKTIAH CNGTRIILEK AGVEPDEGVV PNEKLLEVGP VRHFAREPKV RDLA
|
| |