Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0606 |
Symbol | |
ID | 3915618 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 651140 |
End bp | 654733 |
Gene Length | 3594 bp |
Protein Length | 1197 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640443336 |
Product | allophanate hydrolase subunit 2 |
Protein accession | YP_495887 |
Protein GI | 87198630 |
COG category | [E] Amino acid transport and metabolism [I] Lipid transport and metabolism |
COG ID | [COG0511] Biotin carboxyl carrier protein [COG1984] Allophanate hydrolase subunit 2 [COG2049] Allophanate hydrolase subunit 1 [COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit |
TIGRFAM ID | [TIGR00724] biotin-dependent carboxylase uncharacterized domain [TIGR02712] urea carboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0957568 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTTCG ACACCGTCCT CATCGCCAAT CGCGGCGCCA TCGCCACGCG GATCATCCGT ACCCTGCGGC GCATGGGACT GCGTTCGGTC GCGGTCTATT CCGAGGCCGA CAAGGATTCG CTGCACGTCG TGCTTGCCGA CGAGGCAATC TGCATAGGCG CGGCGCGGGC GGCGGAAAGC TACCTCAACA TCCCTGCAAT TCTCGATGCC GCCCGCCGCA CCGGCGCCGG GGCAATCCAC CCCGGCTACG GCTTCCTTGC CGAGAACGTG GAGTTCGCCG AAGCCTGCGA GAAGGAAGGC ATCGTCTTTA TCGGTCCCAC GCCCAACAAC ATCAGGACCT TCGGCCTGAA GCACAGCGCC CGCGCGCTTG CCGCGGCCCA CGGTGTCCCG CTGGCTCCCG GCACCGATCT GCTGACCGAC GAGACCGAGG CGGTCCAGGC CGCGAACGGC ATCGGATACC CGGTAATGCT CAAGGCCACC GCTGGAGGCG GCGGCATCGG CATGCGCGTC TGCGAGGACG AAGCGGACGT CCGCGAAGGC TTTTCCGCGG TGGCGCGGCA GGGCCTCGGC AATTTCGGCG ATGCCGGCGT CTTTCTCGAG CGCTACATTC GCCAGGCGCG GCACATCGAG GTTCAAATCT TCGGCGACGG TCGCGGCCGT ATCGTGGCGC TCGGCGAACG GGACTGCTCG CTCCAGCGGC GCAACCAGAA GGTCGTGGAG GAAGCGCCCG CTCCCCTCCT TCCACCGGCG GTGCGCAGCG AACTGATCGC CGCCGCCATC CGCCTCGGTC AGGCGGCCGG CTATCGCTCG GCCGGAACGG TCGAATTTCT CTATGATGCC GAGCGCGAGG AATTCTTCTT CCTCGAAATG AACACCCGCC TCCAGGTCGA ACACGGCGTT ACCGAAGAGG TCATGGGCGT CGATCTGGTC GAATGGATGG TCCGGGGCGG CGGCGGCGAT TTCGGGTTCC TCGACGACGA CCCGCCGCGT CCATCCGGGC ACTCGATCCA GGTCCGGCTC TATGCCGAAG ATCCCGCACT CGACTATCGT CCGACCTCGG GCACCCTTAC CGCAGTGACT TTCCCGGAAG GCGTCCGGAC AGAGACATGG TGCATGGCCG GAACCACGGT CAGCACGTGG TACGACCCGA TGCTGGCCAA GCTGATCGTC CACGCCGAAT CACGCGAGGC AGCAGTGGCG GCGATGCAGG ACGCGCTCGA CCGCAGTCGC ATTGACGGGT TCGAAACCAA CCTGCGCTGG CTGCGTGATG TCGTGCGCTC GCCCGCCTTC ACCAGCGGCG AGGTCTCCAC CCGCGCCCTG TCTCACGTCG CTCACGTGCC ACGCAGCATA ACCGTCGTCA GCGGCGGCAC CGCGACGATG GCGCAGGACT GGCCCGGACG GCAGCGGCTC TGGGCAGTGG GCGTCCCGCC ATCCGGACCG ATGGACGACC TGTCGTTCCG GCTCGGCAAC CGCCTGCTCG GCAATCCCGA GGGTACTGCC GGTCTCGAAG TGGCGATCAC CGGCCCGACC CTCACGTTCA ATACAGCCGC CCGCGTGTGC GTGACCGGCG CGGATTTCGG GGCGCGGCTC GACGGGCAGC CTGTCCCGCG CGGCATGGCC ATCGACATTG CCGCTGGCCA GACGCTCGCG CTCGGTCGCG CCTCGGGCGG AGGGATGCGC GGCTACATCC TGTTTGCCGG CGGCCTCGAC ATCGCGCCGT ACCTCGGCAG CCGCAGCACG TTCGAACTCG GCCAGTTCGG CGGTCACGCC GCGCGCCGCC TGCTGGCGGG CGACACGCTC CATCTCGGCG ACGAACCGGC GCAGCCCGCC CTTCCCGCCG CAAACCTGCC CGAACTGTCG AACGAATGGG CGCTGCGCGT CATGTACGGG CCGCACGGCG CACCCGACTT CTTCACCCGC GAAGACATCG ACACGCTGGT CGCCGCTGAA TGGCAGGTGC ACTACAACAG CAACCGCACT GGCATCCGGC TTGTCGGCCC CAAGCCGCAG TGGGCGCGCG AGGACGGCGG CGAGGCGGGC CTGCACCCCT CGAACATCCA CGACAATCCC TATGCGATCG GCGCGGTCGA CTTCACCGGC GACATGCCGA TCATCCTCGG ACCGGACGGT CCCTCGCTCG GCGGCTTCGT CTGCCCCTTC GTGGTGATCG CGGCGGACCG CTGGAAGATC GGCCAGCTCA CGCCGGGCGA CAAGCTCCGC TTCGTTCCGG TAAATTGCGC GGATGCCGCC GCGGCGAACG ACCAGCAACG GCGCTTCCTC GAAACCGGCA AGCCCGCGCA GGGCTGTCCG GGGAGGCCGA TGGAGACGCT CTCGCCCATC CTCGCCGTCA TCGATGAAAG CCCGCGCAGC CCCCGGACCG TCTATCGCCA GCAGGGCGAC CGCAACATCC TGGTAGAATA CGGGCCGATC GTACTCGACA TCGAATTGCG CATCCGCGTG CAGGCGCTGA TGACCGAGCT GGAACGGCTC GCCCTGCCCG GCGTGATCGA CATCGTCCCG GGCATCCGCT CGCTCCAGTT CCATTTCGAC GGCGATGCGA TGACCCAGGA GGCCGCGCTC TCGGTCCTGA TCGCGGCAGA GGAGCGGCTG GGCGACCTCG AGGACTTCAC CATCCCCTCG CGCATCGTCC ATCTGCCGCT GAGCTGGCGC GATCCGGCCA CGATCGAAAC GATCGAGAAG TACATGGGCG CCGTGCGCGA CGACGCGCCA TGGTGTCCCG ACAACATCGA GTTCATACGC CGGATCAACG GCCTGCCCGA CGTGGCGGCG GTGGAGAACC TGATCTTCGA GGCGAACTAC CTCGTCCTCG GCCTGGGCGA CGTCTATCTC GGTGCACCGG TGGCGACGCC CGTCGATCCC CGCCATCGGC TGGTCACGAC CAAGTACAAC CCGGCGCGCA CCTGGACGCC GCCGAACGTG GTCGGCATCG GCGGTGCCTA CATGTGCATC TACGGGATGG AAGGCCCCGG CGGCTACCAG CTCTTCGGCC GCACCATCCA GGTGTGGAAC ACCCACCGCC AGACCGATGC CTTCATCGAT GGCAAGCCGT GGCTGCTGCG CTTCTTCGAC CAGATCCGCT TCTACCCGGT CAGCGCCGAA GAGCTTGAAG AATGGCGGCG GGACTTTCCC GCGGGCCGGC GCTCGATCCG GATCGAGCCT TCCGAGTTCC GCCTTGCCGA CTACCGTCGC TACCTGGCCG ACAATGCCGA AGGGATCGCC GAGTTCGAGG CAAGGCGTCA GGCCGCTTTC GACGAGGAAC GCGCCGAATG GCAAAGGCGC GGAGAATTCG ACCGCACCGA CCTGGTCGAA CCCGAAGCGG CCGAAGCGGG CACCGTCGAA GTGCCCGATG GCGCGGACCT CGTCGAAGCA CCCTTCGGCG GGAGCGTCTG GAAAATGCTC GTGTCGGTGG GCGACGAGGT CGAGGCCGGC GAGACCATCG CGATCATCGA GGCGATGAAG ATGGAATGCC GGGTCGAAAG TCCGGGGGCG GGCACTGTCG CCGCGCTCTA TGCGCAGGAG CGCCAGTCGG TCCAGCCCGG CACGCCGATG CTCGCCCTGA CGAGGCACGC ATGA
|
Protein sequence | MNFDTVLIAN RGAIATRIIR TLRRMGLRSV AVYSEADKDS LHVVLADEAI CIGAARAAES YLNIPAILDA ARRTGAGAIH PGYGFLAENV EFAEACEKEG IVFIGPTPNN IRTFGLKHSA RALAAAHGVP LAPGTDLLTD ETEAVQAANG IGYPVMLKAT AGGGGIGMRV CEDEADVREG FSAVARQGLG NFGDAGVFLE RYIRQARHIE VQIFGDGRGR IVALGERDCS LQRRNQKVVE EAPAPLLPPA VRSELIAAAI RLGQAAGYRS AGTVEFLYDA EREEFFFLEM NTRLQVEHGV TEEVMGVDLV EWMVRGGGGD FGFLDDDPPR PSGHSIQVRL YAEDPALDYR PTSGTLTAVT FPEGVRTETW CMAGTTVSTW YDPMLAKLIV HAESREAAVA AMQDALDRSR IDGFETNLRW LRDVVRSPAF TSGEVSTRAL SHVAHVPRSI TVVSGGTATM AQDWPGRQRL WAVGVPPSGP MDDLSFRLGN RLLGNPEGTA GLEVAITGPT LTFNTAARVC VTGADFGARL DGQPVPRGMA IDIAAGQTLA LGRASGGGMR GYILFAGGLD IAPYLGSRST FELGQFGGHA ARRLLAGDTL HLGDEPAQPA LPAANLPELS NEWALRVMYG PHGAPDFFTR EDIDTLVAAE WQVHYNSNRT GIRLVGPKPQ WAREDGGEAG LHPSNIHDNP YAIGAVDFTG DMPIILGPDG PSLGGFVCPF VVIAADRWKI GQLTPGDKLR FVPVNCADAA AANDQQRRFL ETGKPAQGCP GRPMETLSPI LAVIDESPRS PRTVYRQQGD RNILVEYGPI VLDIELRIRV QALMTELERL ALPGVIDIVP GIRSLQFHFD GDAMTQEAAL SVLIAAEERL GDLEDFTIPS RIVHLPLSWR DPATIETIEK YMGAVRDDAP WCPDNIEFIR RINGLPDVAA VENLIFEANY LVLGLGDVYL GAPVATPVDP RHRLVTTKYN PARTWTPPNV VGIGGAYMCI YGMEGPGGYQ LFGRTIQVWN THRQTDAFID GKPWLLRFFD QIRFYPVSAE ELEEWRRDFP AGRRSIRIEP SEFRLADYRR YLADNAEGIA EFEARRQAAF DEERAEWQRR GEFDRTDLVE PEAAEAGTVE VPDGADLVEA PFGGSVWKML VSVGDEVEAG ETIAIIEAMK MECRVESPGA GTVAALYAQE RQSVQPGTPM LALTRHA
|
| |