Gene Saro_0520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0520 
Symbol 
ID3918650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp561623 
End bp562768 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content63% 
IMG OID640443250 
ProductGTP cyclohydrolase II 
Protein accessionYP_495801 
Protein GI87198544 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase
[COG0807] GTP cyclohydrolase II 
TIGRFAM ID[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.900028 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCCGG CCGAGATCTC CCCGATCGAA GATATCATCC GCGAAGCCGT GGAAGGCAGA 
CCCTTCATTC TCGTCGACGC GGATGACCGC GAGAACGAAG GCGACATCAT CATCCCGGCG
CAGTTCGCCA CGCCCGCGCA GATCAGCTTC ATGGCTTGTC ATGCGCGGGG GTTGATCTGT
CTTGCGATTA CGCAGGAGCG TTCCTCGCAA CTGCAGCTCA GGCCGATGGC ACCGCGCAAT
GAGTCCGGCT ACGGCACGGC ATTCACCGTC TCCATCGAGG CGAAGGAAGG CGTCACGACG
GGCATTTCCG CCCATGACCG GGCCAGGACA ATTGCCGTCG CGGTCGATCC GACGAAGGGA
GTAGACGACC TGGTGACGCC CGGGCATGTA TTCCCGCTCA CCGCCCGGGA TGGCGGCGTG
CTTGTCCGGG CCGGGCATAC CGAGGCTGCC GTCGACATTT CCCGGCTCGG CGGGCTTACA
CCTGCCGGTG TGATCTGCGA GGTCATGAAT GACGACGGCA CTATGGCGCG CCTGCCGGAC
CTGAAGATTT TCGCCGCGAA GCATGGCCTG AAAATAGGGA CGATCGCCGA TCTCATCGCC
TACCGTCGCT CGTCGGAGCA GCTCGTGGAG GAGATGGCGT CGGCGCCGTT CCAGAGCCAC
TTCTGTCCTT CGCCGATGAC GGTGCACGTC TACAGGAACA AGATTGACGG AGGCGAGCAT
GTCGCACTGG TCAAGGGCGA GATCCGCGCC GACCAGGATA CGCTGGTACG CGTACACCAG
GTCGACCTGA CCACCGACGT GCTCGGCTGG AACACGGCTT CGCCGGAATA CCTGCGACGT
GCCCTTCGTT TCATTTCCGA TCATTCGGGA CCGGGCGTTG TCGTGCTGGT GCGCGATCCC
GATCCGGAAT CCATTTCCCG CCGTGTCGCG GGCGGACGGC GCGAGTATCA CGAGAAGAAT
GCCAACCGTG ACTACGGCAT CGGGGCGCAG ATCCTGATCG ATCTCGGCGT CCGGCAGATG
ACCTTGCTGA CTTCGAGCAA GGCGAAGCTG GCCGCGCTTC AGGGGTTCGG CCTGACGATC
AACGGACGCA CCGAACTGCG GGAGAACCGT CCGGATTCCC CGATCCGCGT ACGCTCCGAT
TTCTGA
 
Protein sequence
MTPAEISPIE DIIREAVEGR PFILVDADDR ENEGDIIIPA QFATPAQISF MACHARGLIC 
LAITQERSSQ LQLRPMAPRN ESGYGTAFTV SIEAKEGVTT GISAHDRART IAVAVDPTKG
VDDLVTPGHV FPLTARDGGV LVRAGHTEAA VDISRLGGLT PAGVICEVMN DDGTMARLPD
LKIFAAKHGL KIGTIADLIA YRRSSEQLVE EMASAPFQSH FCPSPMTVHV YRNKIDGGEH
VALVKGEIRA DQDTLVRVHQ VDLTTDVLGW NTASPEYLRR ALRFISDHSG PGVVVLVRDP
DPESISRRVA GGRREYHEKN ANRDYGIGAQ ILIDLGVRQM TLLTSSKAKL AALQGFGLTI
NGRTELRENR PDSPIRVRSD F