Gene Sala_0923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0923 
Symbol 
ID4083133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp936265 
End bp937488 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content69% 
IMG OID638009284 
Productamidohydrolase 
Protein accessionYP_615974 
Protein GI103486413 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.671663 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTGA GGCCGCTCCA TATCGCCAAC GCGCTGCTGG TCGATGGCGA TACGCCGCGT 
CCGGGCAGCC TGCTGGCGGT CGACGGCCGC ATCGCCGCGA TCGACCCGGC TGACATCCCC
GAAGGCGCCG AAACCGTCGA TGCCAGGGGT CAGTGGCTCG CGCCGGGAAT CATCGACCTT
GGCGTCTTTG CGACCGACAA GCCCGCCTTT CACTTCGGCG GCATCACGCG CGCCGCGCTG
ATGCCCGACA ATGGTCCGCT CGACGGCGTC GGCCTTGTCG AGCGCGCGGC GAAGGGCGGC
AAACCCGACC TCTGGGTCCA TCCCCTCGCG GCCGCGACCA AGGGCCTCGA GGGCCGCGAG
CTCGCGGAAA TCGGCCTGAT GAAACAGGCG GGTGCGCGTG CCGTTGCCAC CGGCCGCGCC
CGCGTCGCCG ACAGCGGAGT GATGCGCCGC GTGCTCGCCT ATGCCGCCTC GTTGGGGCTC
GTGACGATCA TCCATGCCGA GGATGAAGGG CTGACCGCCG GCGCCGTCGC AACCGACGGC
GAGATGGCGA CGCGGCTTGG CCTGTCGTCG GCGCCCGCGA TCGCCGAAGC GATGGCGATC
GCGCGCGACC TGTCGCTCGT CGAGGAAACC GGCGCGCCGG TGCATTTCCG CCAGGTCACG
ACCGCGCGCG GGCTCGACCT GATCCGCGCC GCCAAGGCAA AGGGACTGCC CGTGCTTTGC
GGCATCACCC CCGCGCATCT GTTCCTGTCG GATACGGCAA TCGGCGATTT CCGGACCTTT
GCGCGGCTTT CACCGCCGCT GCGCAGCGAA GACGATCGCC GTGCCTGCCT TGCGGCGGTC
GTCGACGGCA CGATCGACGT TCTGTCTTCA GGCCACGACC CGCGCGGCCC CGAGGACAAG
CGCCTGCCAT TTGCCGAAGC ACTGCCCGGC ATGGCGGGAG CCGAAACCTT GCTCGCCATG
GGCCTGAACC TCGTCCGCGA CGGACATATC ACGCCTGGCC GTCTGTTCGA GATGCTTGCC
GCCATCCCAG CCTGCCTGCT CGGTGTCGAC GCGGGCCGCC TTGTAGCGGG CGGGGAAGCC
GACCTCATCC TCGTCGACCC CGACATCCCG TGGCAGGTCG ATGCAAAGAA GATGGCGACC
TGGGCGGGCA ACACTCCATT CGACGGCATG CCTGTCCAGG GCCGCGCCAC CATGATGTGG
AAGGGCGGAA AGCGGATCCG CTGA
 
Protein sequence
MELRPLHIAN ALLVDGDTPR PGSLLAVDGR IAAIDPADIP EGAETVDARG QWLAPGIIDL 
GVFATDKPAF HFGGITRAAL MPDNGPLDGV GLVERAAKGG KPDLWVHPLA AATKGLEGRE
LAEIGLMKQA GARAVATGRA RVADSGVMRR VLAYAASLGL VTIIHAEDEG LTAGAVATDG
EMATRLGLSS APAIAEAMAI ARDLSLVEET GAPVHFRQVT TARGLDLIRA AKAKGLPVLC
GITPAHLFLS DTAIGDFRTF ARLSPPLRSE DDRRACLAAV VDGTIDVLSS GHDPRGPEDK
RLPFAEALPG MAGAETLLAM GLNLVRDGHI TPGRLFEMLA AIPACLLGVD AGRLVAGGEA
DLILVDPDIP WQVDAKKMAT WAGNTPFDGM PVQGRATMMW KGGKRIR