Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3129 |
Symbol | |
ID | 3918171 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 3338468 |
End bp | 3340807 |
Gene Length | 2340 bp |
Protein Length | 779 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640445913 |
Product | hypothetical protein |
Protein accession | YP_498398 |
Protein GI | 87201141 |
COG category | [S] Function unknown |
COG ID | [COG5448] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02217] conserved hypothetical protein TIGR02217 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0416541 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCATT GGCTCGCCAA GCGCCGAACG GTGCAGCAGA CCGACATGAT CCAGCGGTTC GACCCGCGCT TCTGGACCGT GAACTTCCCG CGACCGGCGA TGGCCTCGGT CGTGACGACA GCCGCTGATG CCATGCGCGT CGACGTGGTG TTCCAGAAGG CAGACGACCT TGTCGGCCTG ATCTGGGAAT CTGAGGACAG GTGGGACCAT CCGCTGCTCG CCTACGAAAC GCGGCAGGAC TATTCACGGC TGACGTTGAC CTTTCGCTGG CGCTCGTCAG GCATAGTCGC GCTCAACGCT GTCAATGGTC CGACGCTGAC AATCGAGGGC CGCGATGCGA GCGGCACGGC ACGGGCCTGG TACGTCCGGC TGTGGAACTA TGCAGACGGA TCGCCGACCG ATGCGCTGAT CAGACTGCCG TTTTCGGACC TTGCCGGTGG TTTCCTGCTG CCGTCGGAGG CGGACCCGGT CCACCCCGCT GACATCGACC GCATGTTCAT CTCGCTCGTC CCGCCGGGTT ACGCGCCGGG CAGCGACGCG GCCTATGCAA ATGCGGTCGA GGGCTGGGCG GAACTGTCGG ACATCCACTG CGAAGGGCAT CGCCCGATGC TCGAGATCGG TGACGTCATG GTGCCACCGC ATGGCCTGGC GATCTGCACC GGCTATGACG ATGCCTACAA CCTGACGCCG GCGCGTGTCC TGCGGCAGGT GCGCGGTCTG GGGTACCGCG GGAGCATCAA CCACTACATC GGGATGAGCC ATTTCTTCCC GCTGGCACCT GATGGCGTGG GCGGCTTCGT CGTGGACGGG GCGTTGCCTG CAATGAACGC CGCGGCGAAG GCGTGGCAAG AGGCGTTCTT CACCGACGCC TGCGCGATGG GATATACCGT CATCGCCTCC CAGTCCTACG AGTTGCTGGC ACAGCATTGC CCCGATGCCT GGCAGCAACG TGCCCAGGAT GGAACGCCCG CGAGGACGGG CTGGTCGCCG CCATCGGCGT TGCTGTCGCC GGCCAATGCC GAGGCCATGG CATGGGTACG CAAGGTCGGT GTCGAACTCG TTGCGCTGCT CAAGGCGGCG GGGTTACCTG TACGCCATCA GGTGGGCGAG CCGTGGTGGT GGGTGACGGC CGACCGCAGG ATCTGCATCT ACGACAACGC GGCAAAAGCG GCGCTGGACG GCGACCCGGT GGACATTCCC GATCTTGGCG CGCCGCTGAC GGCGGCGCAG AAGAGCCTGC TCGATGCGGC GGGCACCATC CTTGCGCAAT CTACCGCGGA TCTCGCCGAG GCGGTCCGAA CCGCCGCCGG GGCCGCAGGA GCGGAGACGC TGCTGCTGGC ATTCCTGCCG ACGGTGCTTG ACCCCGCCAC CCCCGACGCG CGGCGCGCCA ACTTGCCGGT CGGCTGGGCC AGCCCCGCAT TCGATGTCCT GCAATTGGAG GACTATGACT GGGTCACGAC GGGCCGGCAG TCGCTGCGCG ACGAAGGCAG GCGCATTGCC GAGGAACGCC TGGGCTACCC GCGCGACCGC CAGCACTACC TTTCGGGATT TGTCCTGACT GGAACGAACG CCGCAGTCGA ATGGCCGAGG ATCGATGCGG CAGCAAGCGA AGCCGTCGCG CTTGGCGTTG CGGAAACCTT CATCTGGGCA CTGCCGCAGG TTTCCCGAGA CGGCTTCGTC CGGCTTCCCG AGCCAACCGG AGACAACCCC ATGCAATCCT TCGATGACGT CCTGTTCCCC CTGTCGCTCG GCCGGGATGC CTCCGTCACG CCGGAATTTT CGACGAACGT GACGATCACG GCTTCGGGTT TCGAGCGACG CAACAGCCTA TGGTCGGACG CGCGACTGCA ATTCGACGTG GGACCTGGCG TCCGTTCCGA AGCGGAGCTT GGTGAACTGA TCGCCTTTTT CCGCGCCCGG CGCGGACAGG CCCGCGGGTT CCGCCTGCGC GATCCGTCCG ACTTCAGTTC CAACGGCATG ACCGGCACAC CCACCCCTAC CGACCAGATC CTCGGGACCG GCGACGGGGC AACAGCGCGC TTCGCACTGG TCAAGCGTTA TGGCGACAGC GAGGATGCCC AGCGGCGCCG AATCACCCGC CCGCGCGCCG AAACGCTGCG CGTGAGCATC GACAATGTGG AAACCGGCGA CTTCACGCTG GCGCCGCTTG GCTACATCAC GCTGGCCAGC GCCCCACCCT CCGGTGCAGT CGTGCGCGCG GGTTTCCTGT TCGACGTGCC GGTGCGCTTT GCCGAAGACC GCATCGATAT TTCGGGCGCG GAGTTCGCGG CCGGAGAAGC GCCGAGCGTT CCGCTGGTCG AACTGCGAGA AGACGCGTGA
|
Protein sequence | MSHWLAKRRT VQQTDMIQRF DPRFWTVNFP RPAMASVVTT AADAMRVDVV FQKADDLVGL IWESEDRWDH PLLAYETRQD YSRLTLTFRW RSSGIVALNA VNGPTLTIEG RDASGTARAW YVRLWNYADG SPTDALIRLP FSDLAGGFLL PSEADPVHPA DIDRMFISLV PPGYAPGSDA AYANAVEGWA ELSDIHCEGH RPMLEIGDVM VPPHGLAICT GYDDAYNLTP ARVLRQVRGL GYRGSINHYI GMSHFFPLAP DGVGGFVVDG ALPAMNAAAK AWQEAFFTDA CAMGYTVIAS QSYELLAQHC PDAWQQRAQD GTPARTGWSP PSALLSPANA EAMAWVRKVG VELVALLKAA GLPVRHQVGE PWWWVTADRR ICIYDNAAKA ALDGDPVDIP DLGAPLTAAQ KSLLDAAGTI LAQSTADLAE AVRTAAGAAG AETLLLAFLP TVLDPATPDA RRANLPVGWA SPAFDVLQLE DYDWVTTGRQ SLRDEGRRIA EERLGYPRDR QHYLSGFVLT GTNAAVEWPR IDAAASEAVA LGVAETFIWA LPQVSRDGFV RLPEPTGDNP MQSFDDVLFP LSLGRDASVT PEFSTNVTIT ASGFERRNSL WSDARLQFDV GPGVRSEAEL GELIAFFRAR RGQARGFRLR DPSDFSSNGM TGTPTPTDQI LGTGDGATAR FALVKRYGDS EDAQRRRITR PRAETLRVSI DNVETGDFTL APLGYITLAS APPSGAVVRA GFLFDVPVRF AEDRIDISGA EFAAGEAPSV PLVELREDA
|
| |