Gene Saro_3526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3526 
Symbol 
ID5077675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp142258 
End bp143625 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content67% 
IMG OID640481250 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_001165912 
Protein GI146275752 
COG category[R] General function prediction only 
COG ID[COG1524] Uncharacterized proteins of the AP superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.34184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGGCC TTTCGACCGC AAAGGCCCTC GTCGCGGGCG TCGTGCTGCT TGGCAGCACG 
GCGCCCGTCG CGGCGAGCGC CGCGACCCGC ACGACGCCGG TCCTGCTCAT CTCGATCGAC
GGCCTGCGGC CCGGCGACGT TCTGGAAGCC GAACGGCGCG GTCTCAAGAT CCCCAACCTG
CGCCGTTTCC TCAAGGAAGG CAGCTACGCC ACCGGCGTCA CCGGAAACCT GCCCACGGTC
ACCTATCCCA GCCACACCAC GCTGATCACG GGCGTCGCCC CGGCGCGCCA CGGCATCGTC
TCGAACACGA CCTTCGATCC GAAGCAGGTG AACTATGGCG GCTGGTACTG GTATGCCGAG
GACATCAGGA CGGGCACCCT GTGGGATGCA GCCCACAAGG CCGGGCTTTC CACCGCCAAC
GTCCATTGGC CGGTGAGCGT CGGCGTCAAG GCCTTGTCCT ACAACCTTCC CCAGATCTGG
CGCTCTGGCC ACGCCGACGA CCGCAAGCTC GTCCGCGCGC TGTCCACGGA CGGCCTCTAT
GACGCGCTCG AACACGATTG CGGCGCCTAT GCCGATGGCA TCGACGAAGG CATCGCCGGC
GACGAGACCC GCGCCAGGTT CGCCGCCCGC CTGATCGAAA CGAAGAAGCC CGATTTCGTC
ACCGTCTATC TGGCCGCGCT CGACCACGAG GAGCATCTTT TCGGTCCGGG GTCAGCGCAG
GCCAATGCCG TTCTCGAACG GCTCGACGCG GCGGTCGGCA CATTGGTATC GGCGGAACTG
GCAGCACGTC CCGATGCCAC CATTGCCGTC GTCAGCGATC ACGGCTTCGT CGCGACCGAT
ACCGAGGTCA ACCTCTTCCG CCCCTTCATC GACGCGGGCC TGATCGCGCT GGGACCGGAC
GGCAAGGTTG CTTCCTGGGA AGCGATGCCA TGGCCGTCGG GTGGCTCCAT CGCGGTTGTG
CTTGCGCGGC CGGACGACGC CGCGCTTGTC ACCAGGGTAG AGGCCCTGCT CGCCGGCCTC
GCCGCCGATC CGCAGGCCCG CATCGCCAGC GTCATCGGCA AGGCCGACAT CGCGCGGCTG
GGCGCAAACC CTCAGGCATC GTTCTATGTC GACCTGAAGC CCGGCGCACT GGCGGGCAAC
TTCGCGGCCG ATGCACCACT CGCCAAGCCG TCGCGCTACA AGGGCATGCA CGGCTATTTC
CCGGCGATGC CGGAAATGCG CTCGACCTTT CTGGTGATGG GCAAGTCCGT CGCCCCGGCA
CGCAACCTTG GCGAGATCGA CATGCGCGCG ATCGCACCGA CACTGGCGAA GGCAATGGGG
GCCGAGCTGC CCGGCGCCGA AGCAAAGGCC ATCCCGCTCG GAAAGTGA
 
Protein sequence
MRGLSTAKAL VAGVVLLGST APVAASAATR TTPVLLISID GLRPGDVLEA ERRGLKIPNL 
RRFLKEGSYA TGVTGNLPTV TYPSHTTLIT GVAPARHGIV SNTTFDPKQV NYGGWYWYAE
DIRTGTLWDA AHKAGLSTAN VHWPVSVGVK ALSYNLPQIW RSGHADDRKL VRALSTDGLY
DALEHDCGAY ADGIDEGIAG DETRARFAAR LIETKKPDFV TVYLAALDHE EHLFGPGSAQ
ANAVLERLDA AVGTLVSAEL AARPDATIAV VSDHGFVATD TEVNLFRPFI DAGLIALGPD
GKVASWEAMP WPSGGSIAVV LARPDDAALV TRVEALLAGL AADPQARIAS VIGKADIARL
GANPQASFYV DLKPGALAGN FAADAPLAKP SRYKGMHGYF PAMPEMRSTF LVMGKSVAPA
RNLGEIDMRA IAPTLAKAMG AELPGAEAKA IPLGK