Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1184 |
Symbol | |
ID | 3916481 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 1225715 |
End bp | 1226656 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640443920 |
Product | endo alpha-1,4 polygalactosaminidase, putative |
Protein accession | YP_496463 |
Protein GI | 87199206 |
COG category | [S] Function unknown |
COG ID | [COG3868] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTGGC ATAGCGCCGC AGCCGCCTCG CAAACGGGGC TTGCCGCCCT TGCCGTGGCA ATGCTGGCGC TGACCGGCTG CGGGGGCAGC GACGGCGGGT CACCGACACC GACGCCCACG GTAACGCCGG CCCCGACACC CACGCCAACG CCAACGCCAA CTCCGACGCC CACCCCGACA CCAACCCCGA CGCCTACCCC CACTCCCACG CCGGGAGGCC TCGCGAGCTG GGACTGGATA TTGCAAAGCC AGGGACTCCC GGCCACGCCA CCCGCCGTGG CCTATCTCGA CGTCGACGGC TTCGATACGC CCAAGAGCTA CGTCACGCTG GCCAAGGCGG CGGGCACGAA GACGATCTGC TATCTGAACG TCGGCACGGC AGAGAACTTC CGCCCGGACT ATTCCCAGTT TTCCGCGATT TCGGGCCTGC TCGGCAACAG CTATCCGGGG TTTTCCGGCG AACGCTATAT CGACATCCGG CGCTATCCCG AGTTCATCCA GATCATGGAC AACCGGCTGG TCATGTGCCG GGACAAGGGC TTCGACCTTG TCGAATTCGA TGTGATGGAC GCCTTCGAGG ACGGTGCCTC CACGGTGGGC TTCCCGCTGA CCGAAGCCGA CATGATCGCA TATGTGACCG CCCTGTCCGC GCGGGCGCGG GGCTATGGGC TGAAGCCGGT GCAGAAGAAT GCAGGCGGTT CCTCGGCAAA GATCGTGACG CTGTTCGACG CGGTGCTGTT CGAGGATTGC GTTTTGGGCA ACTTCTGCTC GGACGACGCG CCCTACATCG CCGCGGGCAA GCCGGCGTTC AACGCGGAGT ATCCGGAAAA CTGGGGCAGC TTCGACCGGG CCAGGGTTTG CGCCACATCC GCAGCGGCGA AGATTTCGAC GATCATCACG GTTATCGACC TGGACCGCCC GGCGCCGGAT CGATGTACCT GA
|
Protein sequence | MSWHSAAAAS QTGLAALAVA MLALTGCGGS DGGSPTPTPT VTPAPTPTPT PTPTPTPTPT PTPTPTPTPT PGGLASWDWI LQSQGLPATP PAVAYLDVDG FDTPKSYVTL AKAAGTKTIC YLNVGTAENF RPDYSQFSAI SGLLGNSYPG FSGERYIDIR RYPEFIQIMD NRLVMCRDKG FDLVEFDVMD AFEDGASTVG FPLTEADMIA YVTALSARAR GYGLKPVQKN AGGSSAKIVT LFDAVLFEDC VLGNFCSDDA PYIAAGKPAF NAEYPENWGS FDRARVCATS AAAKISTIIT VIDLDRPAPD RCT
|
| |