Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2071 |
Symbol | |
ID | 3917718 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 2210563 |
End bp | 2211738 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640444823 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_497344 |
Protein GI | 87200087 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.637859 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGCGAC TGCTCTCGAT CTCGACGCTC TACCCCGCGC CCGGTCGCAC CGGCTTCGGG CGCTTCGTGG CGCGACAGAT GGAGGCGCTG GCGGCGCGCG GGGACTGGCA GGTGACGGTG ATCAACCCCA TCGGCCTACC GCCGCTGCCG ATCAGGCGCT ACGCCGCCTT GCGCGCGATA CCGGCGCAGG AACAGCAGGG CGGCGTGACC GTGCATCACC CGCGTTTCAC GCTTGTCCCG GGTCTTTCGG GGCCGATCAA TCCCGCGCTG ATCGCCCGGG CGGTCGTGCC GCTGGCAAGG CAGCTACATG CGCAAACGCC ATTCGACATG GTGGACGCGC AGTTCTTCTA TCCCGATGGC CCGGCAGCGG CGAAAGTCGC GGCGGCACTC GACCTGCCCT TCGCGATCAA GGCACGCGGA TCCGACATTC ACCTGTGGGG CGAGCGACGG CTTGCGGTGG CACAGATGCG GCGGGCGGCG GCCGGGGCTT CGGCCCTGCT GTCCGTATCC GCCGCGCTGG CGCGCGACAT GGCCGCGCTC GGTATGCCGG ATGACCGCAT CCGCGTGCAC TACACCGGGC TAGACGGCAG CCGCTTCCGC TTGCAGGACC AGGCGCAGGC GCGCCGGGTG GTGGCGCATC TGGTGCCCGG CGACGGCAGG CTGCTCCTCT GCGTCGGCGC GCTGCTCGCG ATCAAGGGAC AGGATCTGGC GATCCGTGCG CTTGCCCTTT TGCCGCCGGA CGTGCGTCTC GCGCTTGCGG GAACGGGGCC GGATGATGCG GCGTTGCGCG CTCTCGTCGC CGAACTCGGT CTCGAACACC GCGTGCATTT CCTCGGCGCG GTGGAGCACG ACGCCCTGCC CGCGCTGCTT GCTGCAGCCG ACGCGATGGT GCTGCCGTCC GAGCGCGAAG GCCTTGCCAA TGCCTGGATC GAAGCGCTCG CCTGCGGCGC GCCGCTGGTA ATTCCCGACG TCGGCGGTGC GCGCGAAGTT GTTCGCGGAA CCAGCGCCGG CCGTGTCGTG GCGCGCAATC CCGGGGCGAT CGCACAGGCC ATCTTGGACC TGCTAGCCGC CCCGCCCGCA CGCGATGCCG TCGCGGCGAA TGTCGCGAGC TTCAGTTGGG ACGCGAATGC CGCGGCGCTT GCAGCGATCT ACGAAGAAGC GGCGACGAAG CCTTAA
|
Protein sequence | MKRLLSISTL YPAPGRTGFG RFVARQMEAL AARGDWQVTV INPIGLPPLP IRRYAALRAI PAQEQQGGVT VHHPRFTLVP GLSGPINPAL IARAVVPLAR QLHAQTPFDM VDAQFFYPDG PAAAKVAAAL DLPFAIKARG SDIHLWGERR LAVAQMRRAA AGASALLSVS AALARDMAAL GMPDDRIRVH YTGLDGSRFR LQDQAQARRV VAHLVPGDGR LLLCVGALLA IKGQDLAIRA LALLPPDVRL ALAGTGPDDA ALRALVAELG LEHRVHFLGA VEHDALPALL AAADAMVLPS EREGLANAWI EALACGAPLV IPDVGGAREV VRGTSAGRVV ARNPGAIAQA ILDLLAAPPA RDAVAANVAS FSWDANAAAL AAIYEEAATK P
|
| |