Gene Saro_1473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1473 
Symbol 
ID3916138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1513254 
End bp1514396 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content60% 
IMG OID640444216 
ProductGTP cyclohydrolase II / 3,4-dihydroxy-2-butanone 4-phosphate synthase 
Protein accessionYP_496750 
Protein GI87199493 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase
[COG0807] GTP cyclohydrolase II 
TIGRFAM ID[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAAG TTTCACCGAT CGAGACGATC ATCGAGGACG CTCGCAATGG CCGGCCGTAC 
ATCCTTGTGG ATGCGCCTGA TCGCGAGAAC GAGGGTGACG TGATCATCCC AGCGCAGTTC
GCAACTCCCA ATGCCATCAA TTTCATGGCG ATGCACGCGC GAGGGCTCAT TTGCCTTGCA
ATCTCGAGCG AGCGCGCCAG TCACCTCGAT TTGCCAATGA TGGAGGCGCG CAACCAATCT
TCCCTTAGCA CTGCCTTCAC CGTCTCGATC GAGGCGCGTG ACGGAGTTAC GACAGGAATA
TCTGCACATG ACCGCGCCCA CACCATTGCG GTTGCCGTCG ATCCTTCCAA AGGGCCGGAA
GACCTGGTCT CTCCCGGCCA TGTGTTCCCG CTGGTTGCGC GAGATGGTGG GGTGCTTGTC
CGCGCTGGGC ACACCGAAGC CGCCGTTGAT ATCTCGCGGC TGGCGGGGTT GGCGCCTGCT
GGCGTCATCT GCGAAATCAT GAATCCTGAC GGGTCTATGG CGCGTCTACC GGAATTGATT
GAATTCGCGC GCAAGCACGA GATGAACATC GGCACCATCG CCGACCTCAT CGAATATCGC
CGCCGTTCCG AGTCTCTTGT AGAGCGCGTG GTGGAGGCGC CGTTCGACAG CTGGCACGGC
GACTTCCGTA TCATCGTTTA TCGTAATGTG ATCGACAGCG GCGAACACGT GGCGCTGGTT
CGCGGGGATC CCCACCAGGA CATGCCAACG CTGGTGCGGG TCCATCAGCT TGACCTAACT
GCGGATGTGC TGGGATGGCG CGCGGCGCAT CGCGACTATG TTCCGGCAGC GCTAGAGGTG
CTTGCTGCGC ACGATGGGCC TGCCGTGGCT GTTTTCGTCC GCGACAACAG TCCCACGTCA
ATTTCGGAAA GAGTTAAGGG CAATCGTAAG GCCTATGCCG ACACGCATGG CTACCGAGAT
TATGGCATAG GTGCTCAGAT ACTTCGCGAC GTTGGTGTGC GCGAGATGGT CCTACTGTCT
TCCAGCGCGG GCAAGCTCGC AGCGCTGGAG GGTTTCGGGC TTTCGGTGGT AAACCGTGTG
CCGCTGGTAG AGGACGAACG TGGAAAGCCG TCACGGTCGG ATCAGCCGTT CGCATCAGCT
TGA
 
Protein sequence
MTEVSPIETI IEDARNGRPY ILVDAPDREN EGDVIIPAQF ATPNAINFMA MHARGLICLA 
ISSERASHLD LPMMEARNQS SLSTAFTVSI EARDGVTTGI SAHDRAHTIA VAVDPSKGPE
DLVSPGHVFP LVARDGGVLV RAGHTEAAVD ISRLAGLAPA GVICEIMNPD GSMARLPELI
EFARKHEMNI GTIADLIEYR RRSESLVERV VEAPFDSWHG DFRIIVYRNV IDSGEHVALV
RGDPHQDMPT LVRVHQLDLT ADVLGWRAAH RDYVPAALEV LAAHDGPAVA VFVRDNSPTS
ISERVKGNRK AYADTHGYRD YGIGAQILRD VGVREMVLLS SSAGKLAALE GFGLSVVNRV
PLVEDERGKP SRSDQPFASA