Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3627 |
Symbol | |
ID | 5077775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 251128 |
End bp | 252951 |
Gene Length | 1824 bp |
Protein Length | 607 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640481350 |
Product | hypothetical protein |
Protein accession | YP_001166012 |
Protein GI | 146275852 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGATG CGGTTCGGGG TGCCGAAGGC AAGGTGGTGC GCATCGGCGG CGCCAGCGGG GCGCTGAACG ACAGCGCCAT CGCGGTGCCC GCGCTGCTGA CCGTGCCGGG ACTCGATTAC CTCGCCTTCG ACTATCTCGG CGAAGGCGCC ATGGGCATCT TCCGGCGCAT GAAGCAGGCC GATCCGGCAT CGGGCTTCCT GCCGGACTTC GTCGACATCC ATGTCGGACC GTATCTGGCC GAACTCAAGG CGCGCGGCAT TCGCGTCGTC GCCAATGCCG GCGGCATGAA CCCCGAAGGG CTGGCGGAAC TCATCCGCAA GCGCGGCCGG GAACAGGGGC TGGACCTGCG GGTGGCCACG GTGACCGGCG ACGATGTCGA AGCGCTGGTT CCCGCGCTGA GGGCCGAGGG CTTGCGCGAC ATGTACAACG GCATGCCCCT GCCCGAAGGC GGCATCGGGC TTCACGCCTA TCTCGGCGCG TTTCCCATCG CGCGGGCGCT GGCGGCGGGC GCGGACATGG TCATTACCGG GCGCGTGGTC GATTCCGCGC TCATCCTGGG GCCGCTGATT CACGAGTTCG GCTGGGGCGC CGAGGACTAC GACCTGCTGG CGGCGGGCAC TGTCGCCGGC CATCTTCTGG AATGTGGCGC GCAGGCGACG GGCGGCACCT TCACCGACTG GCAGGACGTG CCCGACTGGG CGAACATCGG TTTTCCCGTG GGCGAATGCC ATGCCGACGG CAGCGTGGTG ATGACCAAGC CCGAAGGCAC GGGCGGTCTC GTTTCGGTAG GGACGATCGC GGAGCAGCTT CTCTACGAGG TCGGCGATCC GCAAGCCTAC ATCGTTCCAG ACGTGGTCTG CGACTTCACC GGCGTGACTG TGGAGCAGGT CGGGCCGAAC CGCGTGCGCG TGGCCGGCGC GAAAGGCTAT CCACCTACCG GATCGCTCAA GCTGTGCGGC ACTTATGACG ACGGCTGGCG CTCGGTCGCG CTGATCCCCG TCAGCGGGAT GGATGCGGCG GCCAAGGCGC GGCGCACGGC GGACGCGCTG CTGGAACGCA CGGGCCGGAT GCTGCGCGAA CGCAACTGGG GCGAGTGGCG GATGACGCAC AGCGAGGTCA TCGGCACCGA GACCGCATGG GGGCCGCGCG CACAGGCGCT TGGCCCCCGC GAAGTCCTGC TCAAGATCGT CGTCGATCAC GACAATCCTG CCGCCTGCAT GTTGTTCGGG CGCGAGCAGA CGACCGCAAT CATGAACATG GCCGTCGGAA CGTCGATTGC CCCGATCATC GCCGCGCCGC GCGCCTTTCC GCTGACCGGC ATGGTCTTCG GCCTGATCGG GCGCGACCGG GTCAAGGCGC GGGTGTGGCT GGACGGGACC GAGCTGGACT TTGACGATCC TGTCCGCGCC GGGTTCGATC CGGCGAAGAT CGTGCGGCCC GCCGCGCCGG CATTGCCGGA AACGTCAGGG CCGCTGGTCG AAGTGCCGCT TGTCCGGCTC GCCTGGGCAA GGAGCGGCGA CAAGGGACGC CTGTTCAACG TTGGCGTGAT CGCGCGCGAG GCGCGGTTCC TGCCGTGGAT CCGCGCGAGC CTGACCCAGG CAGCAGTGAC CGACTGGTAC CGGCACCTGT TCGACGATCC TGCCCATGCG CGGCTGGAGA TCTTCGACGT GCCCGGCTGC CACGCGATCA ACATCCTGGC CCACGATGCG CAGGGCGGCG GCATCAACGT CTCGCCGCGC CTCGATGCCG CGGCCAAGAG CATGGCCCAG CACCTGCTGG AAATGCCGGT GCGAGTGCCG CAAGCGATGA TCGAAAGAGG ATAA
|
Protein sequence | MGDAVRGAEG KVVRIGGASG ALNDSAIAVP ALLTVPGLDY LAFDYLGEGA MGIFRRMKQA DPASGFLPDF VDIHVGPYLA ELKARGIRVV ANAGGMNPEG LAELIRKRGR EQGLDLRVAT VTGDDVEALV PALRAEGLRD MYNGMPLPEG GIGLHAYLGA FPIARALAAG ADMVITGRVV DSALILGPLI HEFGWGAEDY DLLAAGTVAG HLLECGAQAT GGTFTDWQDV PDWANIGFPV GECHADGSVV MTKPEGTGGL VSVGTIAEQL LYEVGDPQAY IVPDVVCDFT GVTVEQVGPN RVRVAGAKGY PPTGSLKLCG TYDDGWRSVA LIPVSGMDAA AKARRTADAL LERTGRMLRE RNWGEWRMTH SEVIGTETAW GPRAQALGPR EVLLKIVVDH DNPAACMLFG REQTTAIMNM AVGTSIAPII AAPRAFPLTG MVFGLIGRDR VKARVWLDGT ELDFDDPVRA GFDPAKIVRP AAPALPETSG PLVEVPLVRL AWARSGDKGR LFNVGVIARE ARFLPWIRAS LTQAAVTDWY RHLFDDPAHA RLEIFDVPGC HAINILAHDA QGGGINVSPR LDAAAKSMAQ HLLEMPVRVP QAMIERG
|
| |