Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0520 |
Symbol | |
ID | 3918650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 561623 |
End bp | 562768 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640443250 |
Product | GTP cyclohydrolase II |
Protein accession | YP_495801 |
Protein GI | 87198544 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase [COG0807] GTP cyclohydrolase II |
TIGRFAM ID | [TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.900028 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCCGG CCGAGATCTC CCCGATCGAA GATATCATCC GCGAAGCCGT GGAAGGCAGA CCCTTCATTC TCGTCGACGC GGATGACCGC GAGAACGAAG GCGACATCAT CATCCCGGCG CAGTTCGCCA CGCCCGCGCA GATCAGCTTC ATGGCTTGTC ATGCGCGGGG GTTGATCTGT CTTGCGATTA CGCAGGAGCG TTCCTCGCAA CTGCAGCTCA GGCCGATGGC ACCGCGCAAT GAGTCCGGCT ACGGCACGGC ATTCACCGTC TCCATCGAGG CGAAGGAAGG CGTCACGACG GGCATTTCCG CCCATGACCG GGCCAGGACA ATTGCCGTCG CGGTCGATCC GACGAAGGGA GTAGACGACC TGGTGACGCC CGGGCATGTA TTCCCGCTCA CCGCCCGGGA TGGCGGCGTG CTTGTCCGGG CCGGGCATAC CGAGGCTGCC GTCGACATTT CCCGGCTCGG CGGGCTTACA CCTGCCGGTG TGATCTGCGA GGTCATGAAT GACGACGGCA CTATGGCGCG CCTGCCGGAC CTGAAGATTT TCGCCGCGAA GCATGGCCTG AAAATAGGGA CGATCGCCGA TCTCATCGCC TACCGTCGCT CGTCGGAGCA GCTCGTGGAG GAGATGGCGT CGGCGCCGTT CCAGAGCCAC TTCTGTCCTT CGCCGATGAC GGTGCACGTC TACAGGAACA AGATTGACGG AGGCGAGCAT GTCGCACTGG TCAAGGGCGA GATCCGCGCC GACCAGGATA CGCTGGTACG CGTACACCAG GTCGACCTGA CCACCGACGT GCTCGGCTGG AACACGGCTT CGCCGGAATA CCTGCGACGT GCCCTTCGTT TCATTTCCGA TCATTCGGGA CCGGGCGTTG TCGTGCTGGT GCGCGATCCC GATCCGGAAT CCATTTCCCG CCGTGTCGCG GGCGGACGGC GCGAGTATCA CGAGAAGAAT GCCAACCGTG ACTACGGCAT CGGGGCGCAG ATCCTGATCG ATCTCGGCGT CCGGCAGATG ACCTTGCTGA CTTCGAGCAA GGCGAAGCTG GCCGCGCTTC AGGGGTTCGG CCTGACGATC AACGGACGCA CCGAACTGCG GGAGAACCGT CCGGATTCCC CGATCCGCGT ACGCTCCGAT TTCTGA
|
Protein sequence | MTPAEISPIE DIIREAVEGR PFILVDADDR ENEGDIIIPA QFATPAQISF MACHARGLIC LAITQERSSQ LQLRPMAPRN ESGYGTAFTV SIEAKEGVTT GISAHDRART IAVAVDPTKG VDDLVTPGHV FPLTARDGGV LVRAGHTEAA VDISRLGGLT PAGVICEVMN DDGTMARLPD LKIFAAKHGL KIGTIADLIA YRRSSEQLVE EMASAPFQSH FCPSPMTVHV YRNKIDGGEH VALVKGEIRA DQDTLVRVHQ VDLTTDVLGW NTASPEYLRR ALRFISDHSG PGVVVLVRDP DPESISRRVA GGRREYHEKN ANRDYGIGAQ ILIDLGVRQM TLLTSSKAKL AALQGFGLTI NGRTELRENR PDSPIRVRSD F
|
| |