Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3926 |
Symbol | |
ID | 5077410 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009426 |
Strand | + |
Start bp | 97655 |
End bp | 99322 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640481033 |
Product | hypothetical protein |
Protein accession | YP_001165695 |
Protein GI | 146275534 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACGC TGACCAAATT GCGCGCCCGG GTTTCACCGC AAGCGATCAC CAAGGTGACC CGTATCTTCA ACGGGACCCT CGACGACATC TTCGCCGAAC TCTTCCAGAA CGCCCGGCGC GCCGGCGCCA CACGCGTCGC GGTGACGGCC GAGCACCTGG ACGATGCCTG CCTGATCACC GTCGATGACG ACGGCGCAGG TATTGCTGAT CCCATCGATC TCGTCTCGCT CGGCCAATCC GATTGGCCCG GTGAATGCCG GTCACGTGAG GACCCTGCCG GCATGGGCTT CTTCAGCCTC GCCGGGCTTG ACACGGTGGT CAGTTCCACC AGTGCTGCTG GCTCGTTCTG GCTGGCGATC GCAGGTGATG CCTGGACCGG CGAGGCCGAC ATTGATGTGC TGCCGTGGGA TGGTCCGCGC GGCACGGCAA TCGCGTTCCA CTTTCCGCCC GGACCGGATG GCAAGCTCGA GCGGACCGTT GAGGCAGCGG CGCGGTTCTT CCCCCTGCCG GTCACTTACA ACGGCAAGGA TATCGCCCGC GCCGACTTCC TCGCCGATGC CTACAAGATC ATCGAGCGCG ATGGCTTCCG GATCGGGGTG TTCCGCGACC GGCATTCGCC GCACGTCGCG ACGCTCAATT TCCACGGGGT GACGCTCAAG CACGCGTTCC CGGTGATCAA GGAAGTCCAC CACACCCAGT GGAGCGTCCA GGTCGATATC GTCGATGCCC CGGACCTTGT TCTCGTCCTG CCCGCCCGCA AGGAGATCTA CCGCAATACC GCGCTCGACC ATCTCGTCGC GCTGTGCCGC CGTGCGATCT TCTCGGTCAT CTACGCCGAG CCGCTGCACA GGCTGAGTTT TGAGGACTGG CTCGAGGCAC GGTTCTACTC CGACGATTTC CCTCAAGCGG CGCGGCAACT GCCGCTCTGG TCGCCGTCCA CCGCCCGCGA AGACTATCGC CAGGTCCCGG CCTTCGCCGA TCTCGAGCCG GGCGCGACCA TCTACGATGA CACGGACTCC TATGATTCCG TGACGTTCGG CCGCGCGCTC CGGCGCTCGA ATGGCGGGGA ACCACACAAG CTCGGCGGAC CCGACCCCCG TGCGTTCCAT GAGCCGATCA CCAATTTCAT CGGCTATCCG TGGTATGACG CCCTGTCCTG CTTCATCCGC ACTGGCGAGT GCCTCACGCA CGACGGCGAT CAGGCGGCCG CGAGTGAGCC CGATGCCCTG ACCCAGAGGC CGGACGCTAT CCGGATCGAA CTGACCGATC AGCACGGCAA CCGCCTCGAC GTCGAGACCG ACTTCGTCAT CCAGGAGGGC GACGATTCCT GGGGCGATCC CGACTGCGCG GTGATTGCAG TTACCCGGGG CTCGGAACTC GATCCGAACG ATCTCACCGA CCTCATCATC GATGCGGTGT TCTCGCCTTC GGACGATTCC GATGCCGACA GCTACGACAC CCAGGAGACC CGCTTTCGCC ACGACGCTGC CGTGCGGGCC CATGCCATCC TTGAAGGCGA TGACGCCGCC ATTCTCGCTG GCATCCGCAT GGCCTTCGCC GACCGTGTTG CCTGGCGCAT CCCGCATGGC CGCAAGCTGC AGCTGACCTG GTCGAGCAGC GGCAATGACC TCACCCTTGT CGTCGCGGGG GAGGGCGCCA ACCAATGA
|
Protein sequence | MTTLTKLRAR VSPQAITKVT RIFNGTLDDI FAELFQNARR AGATRVAVTA EHLDDACLIT VDDDGAGIAD PIDLVSLGQS DWPGECRSRE DPAGMGFFSL AGLDTVVSST SAAGSFWLAI AGDAWTGEAD IDVLPWDGPR GTAIAFHFPP GPDGKLERTV EAAARFFPLP VTYNGKDIAR ADFLADAYKI IERDGFRIGV FRDRHSPHVA TLNFHGVTLK HAFPVIKEVH HTQWSVQVDI VDAPDLVLVL PARKEIYRNT ALDHLVALCR RAIFSVIYAE PLHRLSFEDW LEARFYSDDF PQAARQLPLW SPSTAREDYR QVPAFADLEP GATIYDDTDS YDSVTFGRAL RRSNGGEPHK LGGPDPRAFH EPITNFIGYP WYDALSCFIR TGECLTHDGD QAAASEPDAL TQRPDAIRIE LTDQHGNRLD VETDFVIQEG DDSWGDPDCA VIAVTRGSEL DPNDLTDLII DAVFSPSDDS DADSYDTQET RFRHDAAVRA HAILEGDDAA ILAGIRMAFA DRVAWRIPHG RKLQLTWSSS GNDLTLVVAG EGANQ
|
| |