Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2971 |
Symbol | |
ID | 3917406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 3191032 |
End bp | 3192723 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640445749 |
Product | protein serine/threonine phosphatases |
Protein accession | YP_498240 |
Protein GI | 87200983 |
COG category | [K] Transcription [L] Replication, recombination and repair [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0515] Serine/threonine protein kinase [COG0631] Serine/threonine protein phosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.59777 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGGAA GTGGCGAACT CGTGGTGGCG GCAGGCTTCT CGAGCCTGAC CGGACCGCGA GCCGACAATC AGGATTTCGG CGGCGTGCAT CTGGGCACGG CGCTGGAACG GGCCCGCCAC GGTGCGGTGG CGGCCATTGC CGATGGCGTG TCGGGCGGGC GCGAGGGGCG GGCAGCAGCG GAACTGGCGG TTCGCGCGCT GATCGAAGGC TTCTATGCGA TGCCCGGCAC GCTGGGGCCA GCGCGCGCGA TGCAGGCGCC GCTCGCCGCC TACAACCGCT GGCTTCACGC CCAGGGCCGC GGCGAAACGA TGGCGAACAG TGCCACGACA TTCACCGCAA TTGCGCTGCG CGGTCGCCGC GCCCACCTGG TCCACGTCGG CGACAGCCGC GCGTGGCGCT ACTCCGGCGG GCGCCTGACC TGCCTCACCG CCGACCATAC GCGCCCCGAA CCCGATCTGA ATCACGTGCT CATCCGCGCG CTCGGGATCG AACCCGAACT GCGGCTCGAC CACTCCGATC TCGAACTTGC CGAGCACGAC CGCCTGGTCC TCACGACCGA CGGCATCCAC GCCGTCCTTT CGGCAAAGCG GATCGCCGCC ATCCTTGCCG AAAGCGCCAG CGCAGAAGCG ACCGCCGAAG CGCTTGCCGA AGCCGCCATC GCCGCGGGCG GCCGCGACAA TGCCACAGCC GTCGTCCTCG ACATCGTCCG ACTGCCCGCC CCCGATCACG ACGGCATCCT CGCAGGCCTC TCCGCCCTGC CCTTCGCCGA CCCGCCCCGC CCCGGCGAAA GCATTGACGG CTTTCGCGCC GAACGCATCC TCTCCGAAGG GCGCCATGCC GTCCTTCTGA TCGCCACCGA TTGCGAGGAC GGTAGCCGCG TGGTGCTGAA GTTCCCGCGC CGAGAAATCC TGTCCGACCG GGCGCTCCGC CTGGCGTTCG CACGCGAAAT GCTGCTCGCC CAGCGTGTAT CGAGCCCGTT CATCCTGGCA GCGCACCCGG TCCGGCCAGA TCGCCAGAGC GCGCTCTACG GCGTCCAGCC GTTTCTCGAA GGCGAGACCA TGGCCCAGCG GCTGGAACGC GGCCTGCCCT CGCTGCGCAC CGCGCTCGAC ACAGCGATCA AGCTGACGCG CGGGGTTGCT GCCCTGCATC GCCTTGAAGT CGTCCACCGC GACATCAAGC CCGACAATGC GATCCTGACT GCCGACGGCG GCCTGCGCCT CATCGATCTC GGCGTCGCGC GACTGCCGAA GGTCGAGGAT TTCCACGCCG ACGAAATCCC CGGAACGCCG GGTTTCATGG CCCCCGAACA GTTCGAAGGC AACGCCGGCG ATGCCCTGAC CGACCAGTTT GCGCTCGGCG TCACCCTCTA CCGCTGGTTC ACCGGCAAGT GGCCGTTTGG CGAACAGGAG GCCTTCCAGC GGCCGCGATT CAACCGCCCT GCACCGCCCT CTCGCCACCG TCCGGAAATT CCCAGCTGGC TCGATGACGC GATCCTCACC GCGATCCAGC CCGACCGGGA CAAGCGTTTC GGTGACGTGA TCGAACTCCT GCGTGCGCTC GAAGGCGGCG GAACGCTTGC CAGCGGCCCG AAGCGGGACA TACCCCTGAT CGAGCGGGAT CCGGTCCGTT TCTGGCAGAT CGTCAGCGCA TTGCTCGGCG CGGCGCTGAT CGCCTCGTTG CTTCTGCGCT GA
|
Protein sequence | MRGSGELVVA AGFSSLTGPR ADNQDFGGVH LGTALERARH GAVAAIADGV SGGREGRAAA ELAVRALIEG FYAMPGTLGP ARAMQAPLAA YNRWLHAQGR GETMANSATT FTAIALRGRR AHLVHVGDSR AWRYSGGRLT CLTADHTRPE PDLNHVLIRA LGIEPELRLD HSDLELAEHD RLVLTTDGIH AVLSAKRIAA ILAESASAEA TAEALAEAAI AAGGRDNATA VVLDIVRLPA PDHDGILAGL SALPFADPPR PGESIDGFRA ERILSEGRHA VLLIATDCED GSRVVLKFPR REILSDRALR LAFAREMLLA QRVSSPFILA AHPVRPDRQS ALYGVQPFLE GETMAQRLER GLPSLRTALD TAIKLTRGVA ALHRLEVVHR DIKPDNAILT ADGGLRLIDL GVARLPKVED FHADEIPGTP GFMAPEQFEG NAGDALTDQF ALGVTLYRWF TGKWPFGEQE AFQRPRFNRP APPSRHRPEI PSWLDDAILT AIQPDRDKRF GDVIELLRAL EGGGTLASGP KRDIPLIERD PVRFWQIVSA LLGAALIASL LLR
|
| |