Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0729 |
Symbol | |
ID | 3918553 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 770732 |
End bp | 772492 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640443461 |
Product | glycosyl transferase family protein |
Protein accession | YP_496010 |
Protein GI | 87198753 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.155064 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCGC GCAGCTTTCT GGGCGGGTTC TGCATCGCCT TTTTCCGCGA TCCCGTCCGG GCGCTGGTGG CAGCGTTCTG GTTCGCAACC GGCAAGCGAG TGCGGGCGCG CAATCGGCTG CACCTGGTGC TGACCATGCC GGGGACGGCC TACGATCGGT GGATCGCGGA AGTGGAGGGG GCGGATCTTC CCGAGGATCG CCCGGCCGAC CGGGCTTCGG GGCAGGCATC GCCGCGCTTC AACGTGCTGA TCTGCGTGCC TGACGACGAC GCCGCCCGGA TTATCCGCCG ACAAATCGAA GCGCTGATCG CACAGGGTTG GCCGTACTGG ACCGCGAACA TCTTCGTGGG CGCAGACGTG ACCATGCCCG ACTTGCCGGA TGATCCCCGC ATTCGCTTGC TGCCTTCGAT GGGGGACGGC GAGACCCTCA AGGAGACGTA CCCTGCCTTG GAGACAAAGG GCGGTTACGT CGTGCCCCAT CACCAGGGCG CGATCCTTTC CGGCAGCGCC CTCCAGCGCA TGGCCGATGC CATTCGTGGA AGCGGCAATC CCACGATCAT CTTTGGCGAC CACGACCATC TCGCGGGTCT GAACCGCAGG CACACGCCAT GGTTCAAGCC GCAATGGAAC GCGGAAATGA TCCTCGCCCA GGACTATGTG ACGCAAGTCC TGGCAATCCG TGAGGACGAG GCACAGGCGC TGCGGCTCGA CGGCGGGTCT TGCGCTGCCT ATGCCCTTCT CCTTGAACTG AGCCGAACGC CGGGTTTCAC GGCAGTACGC GTTCCCCATA TCCTTGCCCA CGTCGTCGAC GATCACGCTC CGGATGCTTC CGCCATCCGG GCGATTGTCG CGCAGCACGT CGCGCATCGA GCCGGGATCG CAACGGCAGG CCCGTTCGGC ACGGTCCGCG TCGCCTGGCC GTTGCCCGAT CCCTTGCCAC TGGTGAGCGT GATCGTGCCT ACGCGGGACC AGCCGAGGCT TCTTCGTGCC TGCATGGACG GCCTTTTGCG CGACACGCTC TACGCGCCCA TGGAAATCCT CGTGGTCGAC AACGGGACCA CCGATCGGCA AGCGCTGGCA CTAATCCGCG AACACAGTGC CGACCCGCGA GTGAGGGTCC TGTCGGCACC GGGCCCCTAC AATTATTCCA GGTTGAACAA CCGGGCGGTC CGAGAAGCTG CGGGAGAGTA CGTCTGCCTG CTGAACAACG ATACGCAAGT CATCAAGGGG ACATGGCTGC ATGAGATGAT GCGGCAGGCA TCCCGACCGG AAGCGGGCGC TGTAGGTGCC ATGCTGCTCT ATCCGGACCA CACGATCCAG CATGCAGGCG TCGTGGTCGG AATGGGAGAG GCAGCAGGCC ATGCGCATCG CTTCCAAAGC GCAGATGGCG CGGGCTTCTT CGCCCAGGCC CATGTCCAGA GATACGTGAG CGCCGTCACC GCCGCCTGCC TCGTGGTCAA GCGCGAGAAG TTCCTCGCGG TCGACGGGCT GGACGAGGAG GGGCTGCCGA TCGCCTTCAA CGACGTCGAC CTCTGCCTGA AGCTGCAGCG GGAGGGGTGG CGCAACCTTT ATTGTCCGCA GGCGGTGATG GTACACCATG AATCGAAGTC CCGTGGCAAG GATTTCGCGC CGGACCAGCG TGACCGCTAC ATGCGGGAGT TGTCGGTACT GCAAGGCCGG TGGGGCACAG CCCGGTATCA GGACCCGCTG CACCACCCGC GCCTGAAGCG CTCCAGCGAA ACCTATATCC TCGATTATTG A
|
Protein sequence | MSARSFLGGF CIAFFRDPVR ALVAAFWFAT GKRVRARNRL HLVLTMPGTA YDRWIAEVEG ADLPEDRPAD RASGQASPRF NVLICVPDDD AARIIRRQIE ALIAQGWPYW TANIFVGADV TMPDLPDDPR IRLLPSMGDG ETLKETYPAL ETKGGYVVPH HQGAILSGSA LQRMADAIRG SGNPTIIFGD HDHLAGLNRR HTPWFKPQWN AEMILAQDYV TQVLAIREDE AQALRLDGGS CAAYALLLEL SRTPGFTAVR VPHILAHVVD DHAPDASAIR AIVAQHVAHR AGIATAGPFG TVRVAWPLPD PLPLVSVIVP TRDQPRLLRA CMDGLLRDTL YAPMEILVVD NGTTDRQALA LIREHSADPR VRVLSAPGPY NYSRLNNRAV REAAGEYVCL LNNDTQVIKG TWLHEMMRQA SRPEAGAVGA MLLYPDHTIQ HAGVVVGMGE AAGHAHRFQS ADGAGFFAQA HVQRYVSAVT AACLVVKREK FLAVDGLDEE GLPIAFNDVD LCLKLQREGW RNLYCPQAVM VHHESKSRGK DFAPDQRDRY MRELSVLQGR WGTARYQDPL HHPRLKRSSE TYILDY
|
| |