Gene Saro_3225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3225 
Symbol 
ID3917483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3444277 
End bp3445527 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content67% 
IMG OID640446009 
Productglycosyl transferase, group 1 
Protein accessionYP_498494 
Protein GI87201237 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.503811 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGGCG CCGGGTGCCG GCCGAAGTGC CCCAGGGCAC ATCCGGAAGG CCAAGTGACC 
ACGCTCTTCC ACCCCAGGCC GGACATCCTG CACATCAGCG GCGACTTCCC GGACCCGATC
AATCCTTCCA AGACCCCGGT CATCCGCACG CTGCTGGAAA TGACCGACGA TACGTTCGCT
CATCGGGTCA TTTCGCTCAA TCGCAAGTCG CCCGGTCCCG TAGCCCTGTC CGCCACGTTG
CTGGGACGCT CTCCGCTTTC GCGCATTGTC CCTTTCGAAC GGGGCATGGC AGCCGAGTAC
CTGGCCCCGC CGATGGGGCT GCTTCACAAG ACCCTCTTGC ACCGGGTAGC CGACGCGCTG
GTCGAACGGC TCGCGGAAGG TCCACGGCCC GCCCTGATCG TGGGCCACAA GCTTTCCATC
GAGGGGATCG TGGCGCGCCG GATGGCACGC AGGCTCGACA TTCCGTTCGC GATCTCGATC
CAGGGCAACA CCGATACGCG TATCGTGGAT GCCCGCCCGG ACTTGCGCGG CGAACTGCGA
CGCGTTTTCC ACGAGGCGTC CGTCGCATTT CCCTTCGCGC CGTGGGCGTT GCGGCGGATC
GAGGCACGGC TGGGCCGGCG CCGGGGCCTC ACAACGATGC TTCCCTGCCC TACCGACATC
GACGAGCCTG TGGCCCCGGC GATGGACGGC AATCAGCTGG TCTCGGTGTT CCACCTGCGC
AACCATGGCC TCAAGAACCT GCGTGGCCTT GCCTTGGCGA TGCGCGAGCT TGCCTCGACC
GACCCGGATA TCCGGCTTTC GATCATCGGC GGCGGGAGCG AGGAAGACTT CGGGACATGC
CGCGCGATCC TCGGCGACCT GCCAAACGTG GAACTGGCCG GTGCGATGGA CCGGCGCCAA
CTGCGCGAAG CGCTCGGGGG GGCGGCAGGC TTCGTCCTGC CCTCGCTGCG CGAAAGCTTC
GGGCTCGTTT TCATTGAAGC GCTGTTCTGC GGTCTGCCTG TGGTCTATCC GACCGGCAGG
GCAGTGGACG GCTATTTCGA CGGTGAGCCC TTCGCCATCG GCGTCGACCC CCGGCAACCG
GGGCGGATCG CGGAGGGTAT GCGCACGCTG GTGCGCGAGC AGGCGCCGCT CAAACGTGCG
CTCGCACGCT GGCAACAGGA CGGTCGCGCT CGGCAATTCA CCCGACCGGA AATCGCCGCA
TCCTTCGCCG CAGGGCTTAC CGCCGCCGCA GAAGCCGGAA CGCGCCCGTA G
 
Protein sequence
MDGAGCRPKC PRAHPEGQVT TLFHPRPDIL HISGDFPDPI NPSKTPVIRT LLEMTDDTFA 
HRVISLNRKS PGPVALSATL LGRSPLSRIV PFERGMAAEY LAPPMGLLHK TLLHRVADAL
VERLAEGPRP ALIVGHKLSI EGIVARRMAR RLDIPFAISI QGNTDTRIVD ARPDLRGELR
RVFHEASVAF PFAPWALRRI EARLGRRRGL TTMLPCPTDI DEPVAPAMDG NQLVSVFHLR
NHGLKNLRGL ALAMRELAST DPDIRLSIIG GGSEEDFGTC RAILGDLPNV ELAGAMDRRQ
LREALGGAAG FVLPSLRESF GLVFIEALFC GLPVVYPTGR AVDGYFDGEP FAIGVDPRQP
GRIAEGMRTL VREQAPLKRA LARWQQDGRA RQFTRPEIAA SFAAGLTAAA EAGTRP