Gene Saro_3263 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3263 
Symbol 
ID3917521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3483884 
End bp3484942 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content69% 
IMG OID640446047 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_498532 
Protein GI87201275 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCCGC CCGGCGAAAC CATCGGTATC CTTGGCGGCG GCCAGCTTGG CCGGATGATG 
GCGCTGGCGG CTGCCAACCT CGGCTACCGC TGCCATGCCT ATGCGCCAGA GGCCGATTCG
ATTGCCGCGG ATGTCTGCGC CGCCTTCACC CAGGGTGCCT ATGACGACGA GGCGGCGCTG
GCGGCGTTTG CCGCGCAGTG CGCGGTCGTC ACCTACGAGT TCGAGAATGT CGCCGCCGCG
CCGCTCGCCG CAGTCAGTGC CCACGCGCCG CTCCATCCGC CCGCCCGCGC GCTCGAAGTG
GCGCAGGACA GGGTGAGCGA GAAGTCCTTT GTCGAAGCGC TGGGCGGCCG CCCCGCGCCG
TGGGCGCAGG TCGATTCGCT TGAAGATCTC AAGGCCGCCG TCGCCCGGAT CGGCGCGCCC
GGCATCCTCA AGACCCGGCG CGACGGCTAT GACGGCAAGG GCCAGTGGCG CATCGCCCAG
CCCGGCGACG CCGATGCGCT CGATCTTTCG GCCAAGCCGC TGATCTACGA AGGCTTCGTC
ACTTTCGAAG CCGAGTTCTC GGTCATCCTC GTGCGCACCG AGGCTGGTGA GGTGCGTTTC
TGGGATTCGG CCGAGAACGT CCACAAGGCC GGCATCCTCG ACCGCTCGAC CGTGCCAGCC
GCGCCGGTGA TCCTGGCGCA GGTGGACGAG GCCCGCGCCC TTGCCGCGCG GGTGGCCGAT
GCGCTTGGCT ATGTCGGCGT TCTCACGCTC GAATTCTTCG CCACGGCCGA TGGCCCGGTG
TTCAACGAGA TGGCACCCCG CGTGCACAAC TCGGGCCACT GGACCATCGA GGGCGCGCTC
ACCAGCCAGT TCGAGAACCA CGTTCGCGCG ATTTGCGGGT TGCCGCTGGG CTCGACCGCG
CTCGCCGCGA AGGGGGTGGT CATGGACAAC CTCATCGGCG ACGACGCGCA CGACTGGCCC
GCCATCCTTT CCGACCCGGC GAACCACCTC CACCTCTATG GCAAGGCGGC GGTGCGACCC
GGGCGCAAGA TGGGCCACGT CACCCGGCTG GTACTGTGA
 
Protein sequence
MIPPGETIGI LGGGQLGRMM ALAAANLGYR CHAYAPEADS IAADVCAAFT QGAYDDEAAL 
AAFAAQCAVV TYEFENVAAA PLAAVSAHAP LHPPARALEV AQDRVSEKSF VEALGGRPAP
WAQVDSLEDL KAAVARIGAP GILKTRRDGY DGKGQWRIAQ PGDADALDLS AKPLIYEGFV
TFEAEFSVIL VRTEAGEVRF WDSAENVHKA GILDRSTVPA APVILAQVDE ARALAARVAD
ALGYVGVLTL EFFATADGPV FNEMAPRVHN SGHWTIEGAL TSQFENHVRA ICGLPLGSTA
LAAKGVVMDN LIGDDAHDWP AILSDPANHL HLYGKAAVRP GRKMGHVTRL VL