Gene Sala_1691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1691 
Symbol 
ID4081094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1780530 
End bp1782191 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content66% 
IMG OID638010065 
Producturocanate hydratase 
Protein accessionYP_616737 
Protein GI103487176 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGCC TCGATAACAG CCGCGTGATC CGCCCGGCGA CCGGCCCGGA GATCAGCGCG 
AAAAGCTGGC TTACCGAAGC CCCGATGCGG ATGCTGATGA ACAACCTCCA TCCCGACGTC
GCCGAAGCGC CGCACGAGCT GGTCGTCTAT GGCGGCATCG GTCGCGCCGC GCGCGACTGG
GAAAGCTATG ACCGGATCGT CGAGACGCTC AGACGGCTCG AAGGCGACGA GACATTGCTC
ATCCAGTCGG GCAAGCCCGT GGGCGTATTC CGCACCCACG CGGACGCGCC GCGCGTGCTG
CTCGCCAATT CGAACCTCGT CCCCCAATGG GCGAATTGGG AGCATTTTCA CGAGCTCGAT
AAAAAGGGCC TGATGATGTA CGGGCAGATG ACCGCGGGCA GCTGGATCTA TATCGGCAGC
CAGGGCATCG TGCAGGGAAC CTACGAAACC TTCGTCGAAA TGGGCCGCCA GCATTATGGC
GGCGACCTTT CGGGTCGCTG GCTGCTGACC GCAGGACTCG GCGGCATGGG CGGCGCGCAG
CCGCTCGCGG CGGTGATGGC GGGCGCGAGC TGTCTCGCGA TCGAATGCCA GCCGAGCCGC
ATCGAGATGC GCCTGCGCAC CGGCTATCTC GACAGGCAGG CGGCGAGCAT CGACGAGGCG
CTGGCGATGA TCGAAGCGAG CCACGCCGAG GACAAGCCGG TCTCGGTCGG CCTGCTCGGC
AACGCCGCCG AAATCCTGCC GGAGATCGTC CGTCGCGGCA TCCGCCCCGA CCTCCTGACC
GACCAGACCT CCGCGCACGA TCCGGTGAAT GGCTACCTCC CCGCGGGCTG GAGCCTCGAC
CAATGGTTTG CGAAGCGCGA GAGCGATCCG TCCGCAGTCG CGAAAGCGGC AAAAGCCTCG
ATGGCGGTGC ATGTTCGGGC GATGCTCGAC CTGCACGCCG CAGGTGTTCC GACGACCGAT
TATGGCAATA ATATCCGCCA GATGGCGAAA GACGAGGGTG TCGAAAATGC CTTCGACTTC
CCCGGCTTCG TTCCCGCCTA TGTCCGCCCG CTCTTCTGTC GCGGTATCGG CCCCTTCCGC
TGGGTGGCGC TGTCGGGCGA TCCCGAGGAC ATCTACCGGA CCGACGCGAG GGTGAAGCAA
CTGCTCCCCG ACAACACCCA CCTTCACAAC TGGCTCGACA TGGCGCGCGA ACGCATCCAG
TTCCAGGGCC TGCCTGCGCG CATCTGCTGG GTCGGGCTCG GCGACCGCCA CCGCCTCGGC
CTCGCCTTTA ACGAGATGGT CGCGTCGGGC GAATTGAAAG CGCCGATCGT GATCGGCCGC
GACCATCTCG ATTCGGGCTC GGTCGCCTCG CCCAACCGGG AGACCGAGGC AATGCGCGAT
GGCAGCGATG CGGTCAGCGA CTGGCCGCTG CTCAATGCGC TCCTCAACAC CGCATCGGGC
GCGACCTGGG TGTCGCTCCA TCACGGCGGC GGGGTCGGCA TGGGCTATTC GCAGCACAGC
GGCATGGTGA TCGTCGCCGA CGGCACACCC GAAGCGGCGA AGCGGCTCGA GCGCGTGCTG
TGGAACGATC CCGGAACCGG GGTCATGCGC CACGCCGACG CGGGGTATGA CATCGCCATC
GACTGCGCGC GCGAAAAGGG CCTCGACCTG CCAAGCATCT GA
 
Protein sequence
MTRLDNSRVI RPATGPEISA KSWLTEAPMR MLMNNLHPDV AEAPHELVVY GGIGRAARDW 
ESYDRIVETL RRLEGDETLL IQSGKPVGVF RTHADAPRVL LANSNLVPQW ANWEHFHELD
KKGLMMYGQM TAGSWIYIGS QGIVQGTYET FVEMGRQHYG GDLSGRWLLT AGLGGMGGAQ
PLAAVMAGAS CLAIECQPSR IEMRLRTGYL DRQAASIDEA LAMIEASHAE DKPVSVGLLG
NAAEILPEIV RRGIRPDLLT DQTSAHDPVN GYLPAGWSLD QWFAKRESDP SAVAKAAKAS
MAVHVRAMLD LHAAGVPTTD YGNNIRQMAK DEGVENAFDF PGFVPAYVRP LFCRGIGPFR
WVALSGDPED IYRTDARVKQ LLPDNTHLHN WLDMARERIQ FQGLPARICW VGLGDRHRLG
LAFNEMVASG ELKAPIVIGR DHLDSGSVAS PNRETEAMRD GSDAVSDWPL LNALLNTASG
ATWVSLHHGG GVGMGYSQHS GMVIVADGTP EAAKRLERVL WNDPGTGVMR HADAGYDIAI
DCAREKGLDL PSI