Gene Saro_1680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1680 
Symbol 
ID3916255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1764996 
End bp1766339 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content65% 
IMG OID640444421 
Productring hydroxylating dioxygenase, alpha subunit 
Protein accessionYP_496954 
Protein GI87199697 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.801472 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCTTGC GCCTTTCCGA CCTTATCGAC GACCGCCCTT CGCAAGGCGT GTTCCGCGTG 
AACCGCGCGA TCTTCACCGA CCAAAGCGTG TTCGACGCGG AAATGCGCCG CCTGTTCGAG
GGCGGATGGG TCTTTCTCGG CATGGAATCG CAGGCGTCCG GTCCGCACGA TTTCTTCACC
ACCTCCGCCG GGCGCGTGCC GGTCATGGTC CAGCGCGACG GGGAGGGCGT GTTGCGCGCC
TTCGTCAATT CCTGCCCGCA CAAGGGCGCG CGCCTGGCGC AAGTGCGGCA AGGCAACGCG
CGGCTGCACG TCTGTCCGTA CCATAGCTGG TCGTTCGACA GCGCCGGCCG CAACAAGGCC
GTGAAATGGA AGGCGGCCGG GTGCTATTCC GATGCCTTCG ACCGCGACGA TCATGGCCTC
GCGGTCCTCC CGCGCTTTGA AGGCTATCGC GGCTTCCTGT TCGGCAGCGT TTCGCCTGAA
GTCCCGCCGC TGGCCGAACA TCTCGGCGAA GCCGCCAAGC TGCTCGATCT CGTCGCGGAT
CAGAGCGAGG AAGGGCTGGA ACTCGTGCCC GGGCAGGTGA CCTTCACCTA TCAGGCGAAC
TGGAAGCTCC AGCTCGAGAA CTGTTCCGAC GCCTATCACT TCACATCCGC CCACCCATCC
TACATCCGCG TGCTGGAACG GCGGCAGAAG GAGATCAGCG AAGAGGTCGT TGCCTCGGTG
TGGGAGAACA GCGACTACTG GAAGGAGGAC ACCAAGGGCG TCGGCGGCGG CAGCTTCAGC
ATGGCAAATG GCCACGTGCT GAACTGGGGT GTGTTTGGCG TCACCCCCGC GATCCCGCTC
TACGAACGCG CGGCGCAACT GGCGGAGCGT GTGGGTGAGG GGAAGCGCGA CTGGATGTTC
AACATGCGCA ACCTCACGAT CTTTCCCAAC CTGCAGGTCG CGGAGAACGC GTCGAGCCAG
CTTCGCGTGA TCCGCCCGAT TTCGCCGGCG CTTACCGAAA TGCGCACCTG GTGCATCGCG
CCCAAGGGGG AAAGCGACGC TGCCCGCCGC CAGCGCATTC GGCAGTACGA GGACTTCTTC
AACCCCACCG GCATGGCAAC GCCCGACGAC ACCGTGTCTT ACGAAAACTG CCAGATCGGC
TTTGCCGGCA CGACCGAACC CTGGCTCCAG GGGTATGCGC GGGGGATGGA GGCCTCGGTC
GAAGGCGGCA ACCGCTTTTC CGAACGTATC GGACTGGAAC CGCAGCGCAG CGTTCTGGCG
GACTCGCAGC TTTGCGACGA AACGCTCTAC CACAGCTACT ACCGCGCCTG GGCCGCGCGC
ATGGCACCGG AGTTCGCGGC ATGA
 
Protein sequence
MTLRLSDLID DRPSQGVFRV NRAIFTDQSV FDAEMRRLFE GGWVFLGMES QASGPHDFFT 
TSAGRVPVMV QRDGEGVLRA FVNSCPHKGA RLAQVRQGNA RLHVCPYHSW SFDSAGRNKA
VKWKAAGCYS DAFDRDDHGL AVLPRFEGYR GFLFGSVSPE VPPLAEHLGE AAKLLDLVAD
QSEEGLELVP GQVTFTYQAN WKLQLENCSD AYHFTSAHPS YIRVLERRQK EISEEVVASV
WENSDYWKED TKGVGGGSFS MANGHVLNWG VFGVTPAIPL YERAAQLAER VGEGKRDWMF
NMRNLTIFPN LQVAENASSQ LRVIRPISPA LTEMRTWCIA PKGESDAARR QRIRQYEDFF
NPTGMATPDD TVSYENCQIG FAGTTEPWLQ GYARGMEASV EGGNRFSERI GLEPQRSVLA
DSQLCDETLY HSYYRAWAAR MAPEFAA