Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1626 |
Symbol | |
ID | 3918734 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1699641 |
End bp | 1701191 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640444366 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_496900 |
Protein GI | 87199643 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0797094 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGGAC GCGAAAGCGA TGTCTGCGTC ATCGTCGAAG GCGCCTATCC CTATGTTACG GGCGGCGTTG CAAGCTGGCT TCAGGAATTG ATCACCAGCT TGCCGGAACT GACCTTTTCC GTCGTGGCGA TCAAGGCCGA CGAGGAACCA CAGAAATGGA ACGTCGAACC GCCCCCAAAC GTGATCGAGG TCGTGGAGGT CCCGCTGTCG TTCGCCCCGC GCAGACCGGC CGCGCTACCG CCGAGCCTTG CCGACCGGAT AGGGCGCCTG CTCCTGCGTT TCCTTCAGGA GGGCCAGCCC GAAATACTGC GAACCCTGGT GGCCGAGCTG GCCGCGCTCG ATCGCAAGCC GCATCCTGGC GATGTCATGT CCAGCGCGCA GATGTTCTCG ATCCTCACGG AACATTACCG CGAAGCATTC CCCTCCGCTT CGTTCCATCA TTTCTTCTGG GCAACCCGGA TCCTCTTGGG AGGCTTGCTC GCCGTTCTTC TGGCGCCGCT GCCGAGGGCA CGCACCTACC ACACCCTGTC CACCGGGTTC GCTGGCCTGC TTGCGGCGCG CGCGCGGCAT GAAACCGGAC GCCCCGCGTT CCTTACCGAG CACGGCATCT ATCTGCTCGA GCGGCAGATC GAGATCATGA TGGCGGAATG GATGGGGGAT CAGATCGACA ACGGACTGGC GCTTGAACGC GAACAGCATG ATCTGCGCGA CCTGTGGGCG GCTGCGTTCG AAAGCTACGC CCGAGGGTGT TACGACGTCT GCCATCCGAT CATTGCGCTT TACGGCGCCA ACAGCGAAGT CCAGGCGCGC ATGGGGGCTC GGCGCAAGAG CCTGCGGGTC ATCCCCAACG GCATCCGGCC TGAACGCTTC GAGGGCGTAG TCTCCCGCCG TGACGAACAG CGGCCGCTCA TTGCTCTCAT CGGCCGCGTC GTACCTATCA AGGACATCAA GACCTTCATT CGTGCCGCAG GTCTTGTTCA CGCGGCATTC CCCGATGCGC GCTTTGCGGT GCTCGGCCCC CGGGATGAAG ACGTGGATTA CGCTCTCGAC TGTACTGCGC TTGTCGACGA ACTCGGACTC GGAGACGTGA TCGCGTTTCC CGGCCGGGTC AATGTGGTCG ACTGGATGCC GAAGATCGAC ATACTCGTCC TGACCAGCCT ATCGGAAGCC CAGCCGCTGG TCATTCTCGA GGCAGGCGCA TGCGGCATTC CATCGGTTGC GCCCGATGTT GGCAGTTGTC GCGAACTGAT CGAAGGCAAT AAGCCGGGCG AACCCCATGG CGGTATCATC ACGGCTCTTG TCGATCCCGA GGCGACCGCC GCAGCGCTTC TGCGTCTGCT GCGCCATCCT GATTTGCGCG CCTCGATGGG CGAGGTGATG CGGGCGCGGG TCCATGCGGA TTACGACTGG TCGGGTATCG TGGAACAGTA TCGTCGTATC TATTCGGGGA AGGAGGAGCC TCGCCAAATG GCGGTGGTCG GTTCGCCACC GGTCCCTCGT GCGACCTTGC GCGATGTCGC GAACGCGATT CGCGGACCGT CGAAGGGGTG A
|
Protein sequence | MTGRESDVCV IVEGAYPYVT GGVASWLQEL ITSLPELTFS VVAIKADEEP QKWNVEPPPN VIEVVEVPLS FAPRRPAALP PSLADRIGRL LLRFLQEGQP EILRTLVAEL AALDRKPHPG DVMSSAQMFS ILTEHYREAF PSASFHHFFW ATRILLGGLL AVLLAPLPRA RTYHTLSTGF AGLLAARARH ETGRPAFLTE HGIYLLERQI EIMMAEWMGD QIDNGLALER EQHDLRDLWA AAFESYARGC YDVCHPIIAL YGANSEVQAR MGARRKSLRV IPNGIRPERF EGVVSRRDEQ RPLIALIGRV VPIKDIKTFI RAAGLVHAAF PDARFAVLGP RDEDVDYALD CTALVDELGL GDVIAFPGRV NVVDWMPKID ILVLTSLSEA QPLVILEAGA CGIPSVAPDV GSCRELIEGN KPGEPHGGII TALVDPEATA AALLRLLRHP DLRASMGEVM RARVHADYDW SGIVEQYRRI YSGKEEPRQM AVVGSPPVPR ATLRDVANAI RGPSKG
|
| |