Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1621 |
Symbol | |
ID | 3918729 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1693643 |
End bp | 1694503 |
Gene Length | 861 bp |
Protein Length | 286 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640444361 |
Product | putative signal peptide protein |
Protein accession | YP_496895 |
Protein GI | 87199638 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2342] Predicted extracellular endo alpha-1,4 polygalactosaminidase or related polysaccharide hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.174255 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGGCC CCTGGTTCGA TCGTCGTCGC GTGATAGCGG CCCTCGCGGC CAGCCCGCTG GCATTCGGCC GGCTCGCGCA CGCCGCTGCG CCCTGGCGCT GGGCGGTCGA TTATGGTGCG AAGACTGATC CGGCGCTGGC CCGCCAGTTC GATCTGCTAG TGCTGGAACC AGATCACGCG CGCCCTATCG AAGCCTTGCG GGGGCCTGGC GCGAAGCTGC TCGGCTATCT CAGCCTGGGC GAAGTGGAGC AGGCGCGACC CTATGTCGGC AGATTGCGCA AGGCGGGCGC GCTGATCGCG GCCAATCCGA ACTGGCCCGA TGCCCGAATG GTCGACCTGC GCCACGCGCT CTGGACATCG CTGGTGGTGG AGGAGATCAT TCCCGCCATC CTTGCCAAGG GCTATGACGG CATCTTCTTC GATACGCTCG ACAACGCCGA GGCCATGCAA CACGCTGATC CCGTGAAGAT GGCCGGGATG GTCGACGCCG CCGCCGCCCT GGTTCGCGCC ATCCGTGCCC GCTTCCCGCC GATCACGCTG ATGATGAACC GTGGCTATGC GCTACTGCCG GCGGTCGCCC CCCACGTCGA TGTGGTCCTA GGGGAGGCGA TGGCATCGAA GTGGGACTTT GCGAAGAAGG CCTACGTCCG CACTACTCCG TCCGATTGGG AATGGCAGGC TGCCAGGCTG CGCGAAGCGA AGCTTGCCAA TCCCGCGTTG CGCCTGACCG TGCTCGACTA TTGGGACGAG GCCGACCGCG ACACCGTTGC CGCGCTTTAC CATTGCGAGC GCGAGGCCGG GTTCCACCCC TATGTCGCCA CGCTGGCGCT TGATCGCATC CATCCGGAGC TTGCCGCATG A
|
Protein sequence | MMGPWFDRRR VIAALAASPL AFGRLAHAAA PWRWAVDYGA KTDPALARQF DLLVLEPDHA RPIEALRGPG AKLLGYLSLG EVEQARPYVG RLRKAGALIA ANPNWPDARM VDLRHALWTS LVVEEIIPAI LAKGYDGIFF DTLDNAEAMQ HADPVKMAGM VDAAAALVRA IRARFPPITL MMNRGYALLP AVAPHVDVVL GEAMASKWDF AKKAYVRTTP SDWEWQAARL REAKLANPAL RLTVLDYWDE ADRDTVAALY HCEREAGFHP YVATLALDRI HPELAA
|
| |