Gene Sala_0680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0680 
Symbol 
ID4082977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp688516 
End bp689586 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content72% 
IMG OID638009039 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_615734 
Protein GI103486173 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.233007 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTTT TCGCTTCGCT CACCGACGCC GCCTATGCGC TCGCCCGCCC GCTCGTCCAC 
GCCACCGATG GCGAGGCCGC GCATAATCTG ACGCTCGCCG CGCTCCAGCC GCTGCCGCGC
GCGCGCCATG CCCTGACCAG CCCGATGCTC GCGACCGAGC TTGCCGGACT GCGCTTCCCC
AACCCGGTCG GGCTGGCCCC CGGTTTCGAC AAGGACGCGC GCGTCGCGCA TGCGATGCCG
CATTTCGGCT TCGGCTTTGT CGAGGTCGGC ACGCTCACCC CGCTGCCGCA GGAGGGCAAT
CCGCGCCCAC GGCTGTTCCG GCTGGTCGAG GATCGCGCGA TCATCAACCG CATGGGCTTC
AACAATGGCG GACAGGTCGC CGCCGCCGAG CGCATCGCCT GCCTGCGCCG CCATGGGCTG
CCGGTGCCGC TCGGCATCAA TATCGGCGCG AACAAGGACA GCGCCGACCG CATCGCCGAC
TATGCGAAGG GCACGGCGGC GATGGCGCCG CTCGCCGATT ATCTTACCGT CAATATCAGC
TCGCCGAACA CGCCCGGACT GCGCGCGCTG CAGGACAGGG GGGCGCTCGA GGCGCTGCTC
GACGGCGTCG CCGCGGCGCA GCCGGCGGGG GCGGCGAAGC CCGTCTTCCT GAAGGTCGCA
CCCGACCTCG AACCCGCCGA CATCGACGAC ATTGTGGCGG TGGCGCTCGA TAGGGGGCTC
GCGGCGGTGA TCGTGTCGAA CACGACCGTA GCCCGGCCGC CGCTGGCCTC GCGCCACGCC
GTCGAAGCCG GTGGCCTGTC GGGCGCGCCG CTCGCGCAGC TCGCGCTTCA GTGCGTGCAG
GATTTCCGCG CCGCGAGCGG CGGCAGGCTG CCGCTGATCG CCGCGGGCGG GATCGCCTCT
GCCGAACAGG CCTGGGAACG CATTCGCGCG GGAGCAAGCC TGGTGCAGGT CTATTCGGCG
ATGGTCTTTG AAGGGCCGGG TCTTGCGAGC CGCATCGCAC GCGGGCTGGA GACGCTGGCG
GCGCGCGACG GGTTTGCGCG GGTGAGCGAC GCGGTGGGGG CGGGCGCCTG A
 
Protein sequence
MSLFASLTDA AYALARPLVH ATDGEAAHNL TLAALQPLPR ARHALTSPML ATELAGLRFP 
NPVGLAPGFD KDARVAHAMP HFGFGFVEVG TLTPLPQEGN PRPRLFRLVE DRAIINRMGF
NNGGQVAAAE RIACLRRHGL PVPLGINIGA NKDSADRIAD YAKGTAAMAP LADYLTVNIS
SPNTPGLRAL QDRGALEALL DGVAAAQPAG AAKPVFLKVA PDLEPADIDD IVAVALDRGL
AAVIVSNTTV ARPPLASRHA VEAGGLSGAP LAQLALQCVQ DFRAASGGRL PLIAAGGIAS
AEQAWERIRA GASLVQVYSA MVFEGPGLAS RIARGLETLA ARDGFARVSD AVGAGA