Gene Saro_3178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3178 
Symbol 
ID3918220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3398543 
End bp3399751 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content69% 
IMG OID640445962 
Productglycosyl transferase, group 1 
Protein accessionYP_498447 
Protein GI87201190 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCGCG TGCTTCATGT TCTCGACCAT TCGCTGCCGC TCCACAGCGG CTATACCTTC 
CGCACCCGCG CCATTCTGAA GGCGCAGGAG GCGATGGGCA TCGAGGTGCG TGGCGTCACC
GGCCGGCGCC ATGTCGCCCC TCCCGCCGCG CAGGAGCCGG AAGAGAGCGA CGGCCTGCGC
TTCTATCGCA CCCCTGGCGC AGCCGGGAAC CTGCCGCTTG TACGGGAATG GGCCGAAGTG
TCGGCCCTTG CCCGGCGCAT CGTCGAGGTC GCCGGTGAAT GGCGACCCGA TATCCTCCAC
GCGCACTCGC CCGCCTTGTG CGGCCTCGCG GCGGTAAAGG CCGGGCGCAA GCTCGGCATT
CCGGTGGTCT ACGAAATCCG CGCCTTCTGG GAGGATGCGG CCGTCGGCAA CGGGACAGGC
CGCGAAGGCA GCCTCAAGTA CCGCGTCACG CGGGCGATGG AGAACGACGT CGTCGGTGCC
GCCGCCCGTG TCGTCACGAT CTGCGAGGGT CTTCGCCAGG ACCTGGTCGG ACGCGGATTC
GCGCCCGAGA AGCTGTCGAT CATGCCCAAC GGCGTCGACC TCGATCTGTT CGGCGCGCCC
CTGCCGCGCG ACCTTGGCCT CGCGCAGGAG CTTGGTCTCG GCGACGGACC CGTCATCGGC
TTCCTGGGCA GCTTTTATCC CTACGAGGGG CTGGACGATC TTGTCGACGC AATGCCGGCG
ATCGCCGGCG CGGTGCCTGG CGCCACGCTC CTGCTCGTCG GCGGAGGGCC GGCGGAAGCA
GACCTTCGTG CCCGCGCCGC CGCTTCGCCG GCGGCCCCGG CGATCCGTTT CGTCGGCCGC
GTGCCTCATC ACGAGGTGGA CCGCTATTAT TCGCTGGTCG ATGTCGTCTG CTATCCGCGC
AAGGCCATGC GCCTTACCGA AATGGTGACC CCGCTCAAGC CGCTTGAGGC GATGGCTCAG
GGCAAGCTCG TGGCGGCGTC CGACGTCGGC GGGCACCGCG AACTTGTCAC CGATGGCGAG
ACCGGGGCGC TGTTCCCCCC GGACGACCCT GCGGGTCTCG CCGCCGCGCT TGTTTCGCTG
CTCGCCGGGC GCGACGGCTG GGAGGAAAGG CGTGCGACGG CGAGGGCGTT CGTCCGGGAC
CGGCACGATT GGGCGATCAA TGTGCGGCGT TATCAGGACG TTTACCAAGC CTTGTTACCG
AGTCCTTGA
 
Protein sequence
MTRVLHVLDH SLPLHSGYTF RTRAILKAQE AMGIEVRGVT GRRHVAPPAA QEPEESDGLR 
FYRTPGAAGN LPLVREWAEV SALARRIVEV AGEWRPDILH AHSPALCGLA AVKAGRKLGI
PVVYEIRAFW EDAAVGNGTG REGSLKYRVT RAMENDVVGA AARVVTICEG LRQDLVGRGF
APEKLSIMPN GVDLDLFGAP LPRDLGLAQE LGLGDGPVIG FLGSFYPYEG LDDLVDAMPA
IAGAVPGATL LLVGGGPAEA DLRARAAASP AAPAIRFVGR VPHHEVDRYY SLVDVVCYPR
KAMRLTEMVT PLKPLEAMAQ GKLVAASDVG GHRELVTDGE TGALFPPDDP AGLAAALVSL
LAGRDGWEER RATARAFVRD RHDWAINVRR YQDVYQALLP SP