Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2423 |
Symbol | |
ID | 3916742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 2604974 |
End bp | 2606890 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640445178 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_497693 |
Protein GI | 87200436 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3669] Alpha-L-fucosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.204978 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCTGC CTGCCCTTTC CCGTCGCACC GTGCTGTCGG GCTCGCTCCT TGCAGGCGCC GCCGCCTGCA CGCCCAAGGC GGGCGGCCTT GCCTCCTCGC CTGTCGTGCC CGCACTGCCC GCGCCATGGG GCGCCGTGCC GCATCCGCGC CAGGTGAAAT GGCACGACCG GCGCATGTAC GCCTTCATCC ATTTCTCGAT GAACACCTTC ACCGACAAGG AATGGGGTTT CGGTGACGAA GACCCCGCCA TGTTCAACCC CACCGATTTC GACGCCGACC AGATCGTCGG CGCGGCGGTG GCGGGCGGAC TTACCGGCCT CATCATCACC GCCAAGCACC ACGACGGCTT CTGCCTCTGG CCCACCACGC TGACCGAACA CTGCGTGCGC AATTCGCCCT GGCGCGGGGG CAAGGGCGAT GTCGTCGGCG AACTGGAGGC CGCGTGCCGC CGCGCCGGGA TCAACTTCGG CGTGTATCTC TCGCCGTGGG ATCGCAACCG CGCCGATTAC GGCAAGCCGT CCTACGTCGA ATACTATCGC GCCCAGTTGA CCGAATTGTG CACCCGCTAT GGCAAGCTGT TCGAGGTGTG GTTCGACGGC GCGAACGGCG GCGACGGCTT TTATGGCGGC GCGCGCGAGA CGCGGCAGAT CGATGCGCCG AAGTATTACA ACTGGCCCGG CATCATCGAA CTCGTCCACT CGCTCCAGCC GGATGCCTGC ACCTTCGACC CGCTCGGCGC GGACATCCGG TGGGTCGGCA ACGAGGAAGG CCATGCCGGC GATCCCTGCT GGCCAACCAT GCCGAACGCG CCCTACGAAA TGGACAAGGG CTATACCGGC GTGCGCGGCG CGGAACTGTG GTGGCCGGCG GAAACCGATG TCTCGATCCG TCCCGGCTGG TTCTACCACG CCGACGAGGA CACCCAGGTC AAGACGCCGC AGAAGCTGAT GGAGATGTTC GACCGCTCTG TCGGCCATGG CAGCAACTTC CTGCTGAACC TGCCGCCCGA CCGCCGTGGC CGGATTCCCG ATCGCGACGT CGCCAGCCTC AAGGCCTTCG GCGATGCGAT CCGCGCCACG TTTGCGCAGG ATCTGGCGCG CGGGGCGCTT GCCAGTGCCA GCGCGGACAT CGGCTCCACC GCCGCCAGCG CCATCGACGG CAATCCCGAC ACGTTCTGGT GCGCGCCCGC CGAAGCGCGC GACGCCGCGC TCGCGCTGGA ACTCCAGCCC GGGACCCGGT TCGACACGAT CGTCTTGCGC GAATGGCTGC CGCTGGGACT GCGCACCACG ACTTTCGCCA TCGACATCGC CGACGACGGC GGCGAATGGC GCGAAATCGC GCGCAAGGAC ATGGTCGGCC CCGAACGCCA TGTCCGCCTG CCCGCGCCCG TCTCTCCGCG CCGTGTGCGC TTCCGTGCCA TCGCGGCAGA GGCAGGGCCG ACGCTGCGGG AATTCGCGCT CTACCTGTCG TCAGCGCCCA TCGAACTGCC GCCCGCGGTG CCTTCGGACC CCAGCATCGT CTCGCGCCGC CGCTGGAAGA TCGTGGCCGC CAGCGCACCC GGGGCCGATG CCGTGCTCGA TGAAAACCCG AAGAGCGCGT GGACAGCGCC CGCCACGGCT TCGCTGACCA TCGACCTCGG CGGGGAAGAG AAGCTCGCGG GCTTCACCCT GACGCCCACG CGCCACATCG ATCCGCAAGC CGCCCCGCCG GCGCGCTGGC ACGTCGAGAC GAGCCTCGAC GGCAAGTGCT GGAGCAAGGC GGAAGAAGGC GAGTTCCAGA ACATCAACTA TGCCCGCGCG ACGCAGCGCA TCGCCTTTTC CGCGCCGCGC AACGCCCGCT ACCTGCGCCT CGCCTTCCCG CGCCCGGCCG TGCCCGCACC GGCCATCGCC GTGGCGACAA TCGGTGCCTT TCGCTAG
|
Protein sequence | MTLPALSRRT VLSGSLLAGA AACTPKAGGL ASSPVVPALP APWGAVPHPR QVKWHDRRMY AFIHFSMNTF TDKEWGFGDE DPAMFNPTDF DADQIVGAAV AGGLTGLIIT AKHHDGFCLW PTTLTEHCVR NSPWRGGKGD VVGELEAACR RAGINFGVYL SPWDRNRADY GKPSYVEYYR AQLTELCTRY GKLFEVWFDG ANGGDGFYGG ARETRQIDAP KYYNWPGIIE LVHSLQPDAC TFDPLGADIR WVGNEEGHAG DPCWPTMPNA PYEMDKGYTG VRGAELWWPA ETDVSIRPGW FYHADEDTQV KTPQKLMEMF DRSVGHGSNF LLNLPPDRRG RIPDRDVASL KAFGDAIRAT FAQDLARGAL ASASADIGST AASAIDGNPD TFWCAPAEAR DAALALELQP GTRFDTIVLR EWLPLGLRTT TFAIDIADDG GEWREIARKD MVGPERHVRL PAPVSPRRVR FRAIAAEAGP TLREFALYLS SAPIELPPAV PSDPSIVSRR RWKIVAASAP GADAVLDENP KSAWTAPATA SLTIDLGGEE KLAGFTLTPT RHIDPQAAPP ARWHVETSLD GKCWSKAEEG EFQNINYARA TQRIAFSAPR NARYLRLAFP RPAVPAPAIA VATIGAFR
|
| |