Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3221 |
Symbol | |
ID | 3917479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 3439260 |
End bp | 3440513 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640446005 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_498490 |
Protein GI | 87201233 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0404695 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCGC CGGTTCCCGC AGCTGACTTC GAGTTCCCCA CCGAAATCCC CGGCCTGCCG CTGGCGGGCA GGAAAGTCCT GATCGTCGTC GAGAACCTGC CGCTGCCCTT CGACCGGCGT GTCTGGCAGG AAGCGCGCAC ACTGAAGGCC GCCGGGGCGC AAGTCTCGAT CATTTGCCCC ACGGGCAAAG GTTACGAGAA GCGCTTCGAA GTCATCGACG GCATCGACAT CCACCGCCAT CCCCTGCCCA TCGAGGCGAG CGGCGCACTG GGGTTCCTGC TGGAATATGG CGCGGCCCTG TTCTGGGAAA CGGTGCTGGC TTGGAAGATA TTCCTCAAGC GCGGCATCGA CGTGATCCAG GGGTGCAATC CGCCCGACCT GATCTTCCTT GTCGCCCTGC CCTTCAAGCT TCTGGGCGTC AAATATATCT TCGACCATCA CGACATCAAT CCCGAGCTTT ACGAAGCGAA GTTCGACAAG CGGGGCTTCT TCTGGAAGTT GATGGTCCTG TTCGAAAAGT TGACGTTCAA GGCCGCCGAC GTGTCGATGG CCACCAATCA TTCCTATCGC AAGATCGCCA TCGAGCGGGG CGGCATGGAC CCGGATAAGG TGTTCGTCGT CCGCTCAGGT CCGGATCTCA GCCGGCTGAA GCGGGTACCT CCGGTCGAAA GCTGGAAGAA CGGGCGCAAG CACCTCGTCG GATATGTCGG GGTGATGGGC GACCAGGAGG GAATAGACCT TCTAATCGAT GCGGTGGACC ATATCGTGCG CGTGATGGGC CGAGACGACA TCCAGTTCTG CCTTGTCGGC GGAGGGCCAA GCCTCGCCAA GCTAAAGGCA CTGGTCGCGG AAAAGGGCTT GGCCGACTTC ATCCAGTTCA CCGGCCGCGC ACCCGATCAG GACCTGTTCG AAGTTCTTTC GACGATGGAC GTCGGGGTCA ATCCGGACCG CGTCAACGCG ATGAACGACA AGTCCACCAT GAACAAGATC ATGGAGTACA TGAGCCTCGA GAAGCCCATC GTGCAGTTCG ACGTGACCGA GGGGCGCTTT TCCGCGCAGG AAGCCTCGCT CTATGCGCGC GCGAACGATC CGGTCGACAT GGCGGAAAAG ATCGTCGAGC TGATCGGAGA TCCGGAACGA CGGGCCCGCA TGGGCGCACT CGGCCGCATG CGCGTGGAGA CCGAACTGAA TTGGGGGCAC CAGATCGCCC CGTTGATCGC CGCGTATCGC AAGGCGCTCT GCCTTGCCGA CTGA
|
Protein sequence | MNAPVPAADF EFPTEIPGLP LAGRKVLIVV ENLPLPFDRR VWQEARTLKA AGAQVSIICP TGKGYEKRFE VIDGIDIHRH PLPIEASGAL GFLLEYGAAL FWETVLAWKI FLKRGIDVIQ GCNPPDLIFL VALPFKLLGV KYIFDHHDIN PELYEAKFDK RGFFWKLMVL FEKLTFKAAD VSMATNHSYR KIAIERGGMD PDKVFVVRSG PDLSRLKRVP PVESWKNGRK HLVGYVGVMG DQEGIDLLID AVDHIVRVMG RDDIQFCLVG GGPSLAKLKA LVAEKGLADF IQFTGRAPDQ DLFEVLSTMD VGVNPDRVNA MNDKSTMNKI MEYMSLEKPI VQFDVTEGRF SAQEASLYAR ANDPVDMAEK IVELIGDPER RARMGALGRM RVETELNWGH QIAPLIAAYR KALCLAD
|
| |