Gene Saro_0899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0899 
Symbol 
ID3917985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp955550 
End bp956656 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content66% 
IMG OID640443633 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_496178 
Protein GI87198921 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.514868 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACG AGCAGAACAA GCCAGAGAGC TACAGCTACG CCAAAGCCGG CGTGAACATT 
GCTGCGGGCA ACGCATTGGT GAAGGCCATA GGCCCCCTTG CGAAGTCCAC CGCCCGCCCC
GGCGCGGACG CGGAACTCGG CGGCTTTGGC GGCTTCTTCG ATCTCAAGGC CGCTGGCTAC
AACGACCCTC TGCTGGTCGC CGGCAACGAC GGCGTGGGCA CCAAGGTCAA GCTCGCCATC
GACCATGACC GGCACGACCA GATCGGCATC GATCTTGTCG CGATGTGCGT CAACGACCTC
ATCGTCCAGG GCGCGGAGCC CCTGTTCTTC CTCGACTATT TCGCCACCGG ACGTCTCGAT
AACGGCGTTG CCGAACGTGT CGTCGCGGGC ATCGCCGATG GCTGCAAGCT GGCCGGTTGC
GCCCTCATCG GCGGCGAGAC CGCCGAAATG CCCGGCATGT ATGCCGACGG CGACTACGAC
CTTGCCGGCT TCTGCGTCGG TGCGGTCGAA CGGGGCGAGC AACTGACCGG AGACCGCGTG
GCCGAAGGCG ACGTGCTGCT CGGCCTCGCT TCCTCGGGCG TCCATTCCAA CGGCTATTCC
CTCGTCCGCC GCCTTGCCGC CGACAAGGGC TGGAAGCTCG ATCGCCCGGC CCTGTTCGAC
AACGAGCGCC TGTTGATCGA TTACCTGATC GAACCGACCC GCATCTATGT GAAGAGCCTG
CTGCCCTTCA TCCGCAGCGG CCGGATCAAC GCGCTGGCCC ACATCACCGG CGGCGGCCTG
CTTGAGAACG TCCCGCGCGT CCTGCCGCGT GGCCTCCACG CCCGGATCGA CGCCGACAGC
TGGGAACAAA GCCGCCTGAT GGCCTTCCTC CAGGCCCAGG GCAACATCGA GCCCGAGGAA
ATGGCCCGCA CCTTCAACTG CGGCATCGGC ATGATCCTCG CGGTCAATCC GGCACAGGCG
GATGCGCTTG CCGCAGACCT CGCCGCCGCC GGCGAGACGG TCTACCGCGT CGGCACCATC
GTGAAGGGCG AAAAGGGCTG CACCGTCACC GGAAGCGCCG AGACCTGGTC CGCCCGCTCG
GCCTGGGAAG CCACCCACAT TGGCTGA
 
Protein sequence
MSDEQNKPES YSYAKAGVNI AAGNALVKAI GPLAKSTARP GADAELGGFG GFFDLKAAGY 
NDPLLVAGND GVGTKVKLAI DHDRHDQIGI DLVAMCVNDL IVQGAEPLFF LDYFATGRLD
NGVAERVVAG IADGCKLAGC ALIGGETAEM PGMYADGDYD LAGFCVGAVE RGEQLTGDRV
AEGDVLLGLA SSGVHSNGYS LVRRLAADKG WKLDRPALFD NERLLIDYLI EPTRIYVKSL
LPFIRSGRIN ALAHITGGGL LENVPRVLPR GLHARIDADS WEQSRLMAFL QAQGNIEPEE
MARTFNCGIG MILAVNPAQA DALAADLAAA GETVYRVGTI VKGEKGCTVT GSAETWSARS
AWEATHIG