Gene Saro_3069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3069 
Symbol 
ID3916683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3288488 
End bp3289687 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content68% 
IMG OID640445851 
Productglycosyl transferase, group 1 
Protein accessionYP_498338 
Protein GI87201081 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.687268 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAACA CACCGACCCC GCCCAAGGTC CTGCTGCTGT TGACCTCGCT CCACGGCGGC 
GGCGCCGAAC GTGTCGCCGT GCATCTGCTG AACCGCCTTC AAGGGCGCTT CGACATGCGC
ATGGGCCTGC TCCGCGCCTC GGGCCCCTAC CTCGACCAGG CCGACCGGTC GCGGCTGATA
GTGGCGCCGG AAGGCGAGAC GCACTTCAAC TTCGACGGTC CCAATTCCGC CAATTACCGC
CCCGGAAAGC TGGTCGGCAG CGCAGTGCGG GCACCGCTCG CATTCCGCAG GATGATCCGC
GAAACGCAGC CTGACGTCGT GCTGAGCTTC CTCAAGGGCA CCAACCTGCT GGTCTGGCTG
GCGCTGATGA ACATGGGCCG CGCCCGACCG CGCTGGATCG CGCGCGAAGG CAACAACGTG
CTGGCCGTCA TCCGCGAGGA AGCGCCCAAC GGCGCCGTGG CGCGGGCATC GCGTGACCTT
ACGGCCAAGG CCTATCGGCG GGCCGATGCC GTTCTCGCAA ATTCCACCGA CATGGCCGCG
GGGCTGATCA CCGATCTCGA TCTCGATCCC GCGAAGATGC GGATGATCAA CAATCCCATC
GACATCGACG GCATACGCGA GGCAGCGGGC GAGAGCCTTC CGGGCGCGCC CAACCGGCCC
TTCATCCTGA CCGCGGGCCG GCTCGAATAC CAGAAGGCGC ACGAGGTGCT GCTGCGCGCC
TTCGCGCGGA GCGAAGCGTG GCGCACGCAC GCGCTGGTGA TCCTCGGCAA GGGGAGCCGG
CTGGGCGAAC TGCACCGCCT CGCCGCACAG CTCGGCATCG GCGAGTACGT GCGCTTCATC
GGCTTCGTCC CCAACCCCTA TGCCTGGATG GCGCGCGCCG ATCTGTTCGT GCTGCCTTCG
CGGTGGGAAG GATTTCCGAC CGTGGCGGCC GAGGCGATGG CCTGCGGCAC GCCCCTGCTG
CTGACCGACT GCAGATTCGG CGCGCGCGAT ATCGTGGAGC CCGGAGTGAC CGGGGAACTG
GTGCCAGTGA ACGACGAGGC AGCGCTGGCC ACCGAAATCG CGGCACTGCT GGCTTCGCCG
GAGCGGCGCA GTGCGCTGGC ACGGGCCGGA CGCGAGAAGG TGGAACGGTT CAGGCTTGAA
CGAATGCTGG AAGCCTACGC TGCCCTCTTC GACGAACAGT TCGCCGCGCG TCGTCGTTAA
 
Protein sequence
MNNTPTPPKV LLLLTSLHGG GAERVAVHLL NRLQGRFDMR MGLLRASGPY LDQADRSRLI 
VAPEGETHFN FDGPNSANYR PGKLVGSAVR APLAFRRMIR ETQPDVVLSF LKGTNLLVWL
ALMNMGRARP RWIAREGNNV LAVIREEAPN GAVARASRDL TAKAYRRADA VLANSTDMAA
GLITDLDLDP AKMRMINNPI DIDGIREAAG ESLPGAPNRP FILTAGRLEY QKAHEVLLRA
FARSEAWRTH ALVILGKGSR LGELHRLAAQ LGIGEYVRFI GFVPNPYAWM ARADLFVLPS
RWEGFPTVAA EAMACGTPLL LTDCRFGARD IVEPGVTGEL VPVNDEAALA TEIAALLASP
ERRSALARAG REKVERFRLE RMLEAYAALF DEQFAARRR