Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0601 |
Symbol | |
ID | 3915613 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 645976 |
End bp | 647553 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640443331 |
Product | sulfotransferase |
Protein accession | YP_495882 |
Protein GI | 87198625 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACAG CGCCTTCCGG ACTGGTCGAC GCCGCCCGCG GTGCCCTTGT CCGGGGCGAT CTGGAAGCGA CGAATCGCGC CGCCGCCGCG ATGGTGGCGG CAAGTCCCGG GGACGCCGAA GGCCACTTTC TGCTGGGGGT GGCCGAAGCC GGTGTCGGGC GTATCAAGGC GGGCATCGGG CACATCGAGC GGGCGGTGAC GCTCGATCCG AGAGGCGAAT ACCTCGCGCA CCTCGCCAAG TTCTATTGCC TTGTGCGCAG GGACCGCGAC GCAGCGAACG CGCTGCGCAG GGCCGAGGCC GCACCGCCCG CCGACGCGCT GGGCCGCGAT ACGATGGGCT GCGTCTTTGC CCGGCTCGGC GACCATGCTG CCGCGCTTCC CCATTTCGCC GAGGCCGTGC GGCTGCGCCC CGACATTGCC GAATATCGCT ACAACGAGGC GGTTACCCTG AACTTCCTCG GCCGCGTCGA TGAGGCCGAG GCGGCCATCG AGGCGCTGCT GGCGCAAGCG CCCGGCCATG CGCGCGCGCA TCACCTGCTG GCGGGCCTGC GCCGACAGAG CGCTGAGCGC AACCACGTGG CGCGGCTTGG CCAGGCAAGG GCTCGGGCCC GGGGCGGGCG TGACCGGCTC CTGCTGGGCT ATGCGCTGGC CAAGGAACTG GAGGACATCG GCGAATACGA CCACGCTCTC GACACGCTGT GCGAGGCCAA TGCCGAACAC CGGCGCAACC TGCCCTATGC GTTCGAGCGC GACGCCGCGA TCTTCGATGC GATCGAGGAA AGCTGGCCGA TGTTGCGCGA TGCGGCACCG ACGGCGCCGA GCAAGGCGTC CCCGATCTTC GTCATCGGCA TGCCCCGCAC CGGCACGACG CTGGTCGATC GCATCCTCGG TTCGCACCCC GAAGTCGAAA GCGCGGGCGA GTTGCAGGCA ATGCCGCTGG CGGTGAAGAG GGCGGCGGCG ACCAACAGCG CGACGGTCAT GGACCCGGAA ACGATCCGCG CGGCCACCGG AGCCGACATG GCGGCGGTCG GTCGGGATTA TCTGGAGCGT GCCCGCCACC ATCTGCGCGG CGGGGCTGCG CGCTTTACCG ACAAGTTTCC CGGCAACTTC CACTATGCGG GCTTCATTGC CCGTGCCCTG CCCGAAGCGC GGATCGTCTG CCTGCGGCGT CATCCGATGG ACACGGTGCT GAGCAATTTC CGCAACCTCT TCGCGGTCGG CTCGCGGTAC TACGACTACA GCTACGACCT GCTCGACATA GCGGCCTATT ATGCCCGGTT CGACCGCCTG ATGGCGTTCT GGCGCGAGGC GCTGCCCGGG CGCGTGCTGG AGCTGCGCTA CGAGGATCTG GTCGCGGACC AGGAAGGGCA GACCCGGCGG CTTCTGGAGC ACTGCGGGCT CGGCTGGTCG GAAACATGCC TGGAATTCCA CAGCAACGCC GCGCCGGTTT CGACGCCGAG CGCGGCGCAG GTCCGCCGTC CGATCTACGC CGATTCCGTC GCGCGCTGGA AGCGGCATGG AGAGGTGCTG GGGCCTGTCG CGCGCTTCTT CGAGCAGACC GGCATCGCGA TCGATTGA
|
Protein sequence | MTTAPSGLVD AARGALVRGD LEATNRAAAA MVAASPGDAE GHFLLGVAEA GVGRIKAGIG HIERAVTLDP RGEYLAHLAK FYCLVRRDRD AANALRRAEA APPADALGRD TMGCVFARLG DHAAALPHFA EAVRLRPDIA EYRYNEAVTL NFLGRVDEAE AAIEALLAQA PGHARAHHLL AGLRRQSAER NHVARLGQAR ARARGGRDRL LLGYALAKEL EDIGEYDHAL DTLCEANAEH RRNLPYAFER DAAIFDAIEE SWPMLRDAAP TAPSKASPIF VIGMPRTGTT LVDRILGSHP EVESAGELQA MPLAVKRAAA TNSATVMDPE TIRAATGADM AAVGRDYLER ARHHLRGGAA RFTDKFPGNF HYAGFIARAL PEARIVCLRR HPMDTVLSNF RNLFAVGSRY YDYSYDLLDI AAYYARFDRL MAFWREALPG RVLELRYEDL VADQEGQTRR LLEHCGLGWS ETCLEFHSNA APVSTPSAAQ VRRPIYADSV ARWKRHGEVL GPVARFFEQT GIAID
|
| |