Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3526 |
Symbol | |
ID | 5077675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 142258 |
End bp | 143625 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640481250 |
Product | type I phosphodiesterase/nucleotide pyrophosphatase |
Protein accession | YP_001165912 |
Protein GI | 146275752 |
COG category | [R] General function prediction only |
COG ID | [COG1524] Uncharacterized proteins of the AP superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.34184 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGGCC TTTCGACCGC AAAGGCCCTC GTCGCGGGCG TCGTGCTGCT TGGCAGCACG GCGCCCGTCG CGGCGAGCGC CGCGACCCGC ACGACGCCGG TCCTGCTCAT CTCGATCGAC GGCCTGCGGC CCGGCGACGT TCTGGAAGCC GAACGGCGCG GTCTCAAGAT CCCCAACCTG CGCCGTTTCC TCAAGGAAGG CAGCTACGCC ACCGGCGTCA CCGGAAACCT GCCCACGGTC ACCTATCCCA GCCACACCAC GCTGATCACG GGCGTCGCCC CGGCGCGCCA CGGCATCGTC TCGAACACGA CCTTCGATCC GAAGCAGGTG AACTATGGCG GCTGGTACTG GTATGCCGAG GACATCAGGA CGGGCACCCT GTGGGATGCA GCCCACAAGG CCGGGCTTTC CACCGCCAAC GTCCATTGGC CGGTGAGCGT CGGCGTCAAG GCCTTGTCCT ACAACCTTCC CCAGATCTGG CGCTCTGGCC ACGCCGACGA CCGCAAGCTC GTCCGCGCGC TGTCCACGGA CGGCCTCTAT GACGCGCTCG AACACGATTG CGGCGCCTAT GCCGATGGCA TCGACGAAGG CATCGCCGGC GACGAGACCC GCGCCAGGTT CGCCGCCCGC CTGATCGAAA CGAAGAAGCC CGATTTCGTC ACCGTCTATC TGGCCGCGCT CGACCACGAG GAGCATCTTT TCGGTCCGGG GTCAGCGCAG GCCAATGCCG TTCTCGAACG GCTCGACGCG GCGGTCGGCA CATTGGTATC GGCGGAACTG GCAGCACGTC CCGATGCCAC CATTGCCGTC GTCAGCGATC ACGGCTTCGT CGCGACCGAT ACCGAGGTCA ACCTCTTCCG CCCCTTCATC GACGCGGGCC TGATCGCGCT GGGACCGGAC GGCAAGGTTG CTTCCTGGGA AGCGATGCCA TGGCCGTCGG GTGGCTCCAT CGCGGTTGTG CTTGCGCGGC CGGACGACGC CGCGCTTGTC ACCAGGGTAG AGGCCCTGCT CGCCGGCCTC GCCGCCGATC CGCAGGCCCG CATCGCCAGC GTCATCGGCA AGGCCGACAT CGCGCGGCTG GGCGCAAACC CTCAGGCATC GTTCTATGTC GACCTGAAGC CCGGCGCACT GGCGGGCAAC TTCGCGGCCG ATGCACCACT CGCCAAGCCG TCGCGCTACA AGGGCATGCA CGGCTATTTC CCGGCGATGC CGGAAATGCG CTCGACCTTT CTGGTGATGG GCAAGTCCGT CGCCCCGGCA CGCAACCTTG GCGAGATCGA CATGCGCGCG ATCGCACCGA CACTGGCGAA GGCAATGGGG GCCGAGCTGC CCGGCGCCGA AGCAAAGGCC ATCCCGCTCG GAAAGTGA
|
Protein sequence | MRGLSTAKAL VAGVVLLGST APVAASAATR TTPVLLISID GLRPGDVLEA ERRGLKIPNL RRFLKEGSYA TGVTGNLPTV TYPSHTTLIT GVAPARHGIV SNTTFDPKQV NYGGWYWYAE DIRTGTLWDA AHKAGLSTAN VHWPVSVGVK ALSYNLPQIW RSGHADDRKL VRALSTDGLY DALEHDCGAY ADGIDEGIAG DETRARFAAR LIETKKPDFV TVYLAALDHE EHLFGPGSAQ ANAVLERLDA AVGTLVSAEL AARPDATIAV VSDHGFVATD TEVNLFRPFI DAGLIALGPD GKVASWEAMP WPSGGSIAVV LARPDDAALV TRVEALLAGL AADPQARIAS VIGKADIARL GANPQASFYV DLKPGALAGN FAADAPLAKP SRYKGMHGYF PAMPEMRSTF LVMGKSVAPA RNLGEIDMRA IAPTLAKAMG AELPGAEAKA IPLGK
|
| |