Gene Saro_0172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0172 
Symbol 
ID3916160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp172861 
End bp174150 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content66% 
IMG OID640442898 
Productphosphoribosylamine--glycine ligase 
Protein accessionYP_495455 
Protein GI87198198 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0151] Phosphoribosylamine-glycine ligase 
TIGRFAM ID[TIGR00877] phosphoribosylamine--glycine ligase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATCC TGCTGCTGGG ATCGGGTGGG CGTGAACATG CCCTGGCATG GAAATTGGCC 
CAATCGCCCC TCACGGGCAA GCTCTATGCC GCCCCGGGCA ACCCGGGCAT CGCCCAGCAC
GCCGATCTGG TCGCGCTTGA CCTGACCGAT CATGCGAGCG TCGTCACCTT CTGCGAGACC
AACCGCATCG GGCTGGTCGT CGTCGGCCCG GAAGCGCCGC TGGTCGATGG CCTGACGGAT
TCGCTGCGCG CGGCGGGTTT TTCGGTGTTC GGGCCAAGCC GTGCGGCCGC GCAGCTCGAA
GGATCGAAGG GCTTTACCAA GGACCTGTGC GCGCGGGCGG ATATCCCGAC GGCAGGATAT
GTCCGCGCGA AGTCGCTTGA AGAAGCGCGG GCCGCGCTGG CCGAGTTCGG CGCTCCGGTC
GTGATCAAGG CCGACGGGCT GGCCGCGGGC AAGGGCGTTG TCGTGGCAAT GACCATGGAA
GAAGCCGAGG CAGCGGTCGA CGACATGTTC GATGGCGCAT TCGGTTCGGC CGGCGCGGAA
GTGGTGGTCG AGGAGTTCCT TACCGGTGAG GAGGCTAGCT TCTTCGCCCT GACCGACGGC
GCCACGATCG TGCCTTTTGC TTCCGCTCAG GATCACAAGC GCGTGGGCGA TGGCGATACC
GGACCCAACA CCGGCGGCAT GGGCGCCTAC AGCCCGGCGC GCGTGCTGAC CCCGGAACTC
GAGGCGCAGG CGCTTTCGCA GATCATTGCT CCGACCGTAA AGGCCATGGC GGACGAGGGC
ATGCCTTATT CAGGCGTGCT CTATGCTGGC CTCATGCTCA CGGAACAGGG CCCCAAGCTG
ATCGAATACA ACGCGCGCTT CGGCGATCCC GAATGCCAGG TGCTGATGAT GCGCCTCGAG
AGCGATCTGG TCGAACTGCT GCTGGCCTGC GCCGACAACA AGCTGTCGAG CATACAGCCG
CCGCGCTTCT CCAACGAAGT GGCGATGACC GTGGTCATGG CCGCGCAGGG CTATCCCGGC
ACGCCGAAGA AGGGCGGCCG GATCGATCAT ATCCCGGCAG CAGAGGCCGG CGGCGCGAAA
GTCTTCCATG CTGGCACCGT TCTTTCGGGC GATGGCGTCC TTTCGGCCAA TGGCGGGCGC
GTCCTGAATG TCACGGCCAA GGGACCGACT GTAAAGTCCG CGCGCGATGC CGCCTATGCG
GCGGTCGATG CGATCAGCTT TCCGGAAGGC TTCTGCCGCC GAGACATCGG CTGGCGCGAG
ATCGAGCGCG AGGAAGCCGC TAGCGTCTGA
 
Protein sequence
MNILLLGSGG REHALAWKLA QSPLTGKLYA APGNPGIAQH ADLVALDLTD HASVVTFCET 
NRIGLVVVGP EAPLVDGLTD SLRAAGFSVF GPSRAAAQLE GSKGFTKDLC ARADIPTAGY
VRAKSLEEAR AALAEFGAPV VIKADGLAAG KGVVVAMTME EAEAAVDDMF DGAFGSAGAE
VVVEEFLTGE EASFFALTDG ATIVPFASAQ DHKRVGDGDT GPNTGGMGAY SPARVLTPEL
EAQALSQIIA PTVKAMADEG MPYSGVLYAG LMLTEQGPKL IEYNARFGDP ECQVLMMRLE
SDLVELLLAC ADNKLSSIQP PRFSNEVAMT VVMAAQGYPG TPKKGGRIDH IPAAEAGGAK
VFHAGTVLSG DGVLSANGGR VLNVTAKGPT VKSARDAAYA AVDAISFPEG FCRRDIGWRE
IEREEAASV