Gene Saro_0942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0942 
Symbol 
ID3918028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp989648 
End bp990808 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content68% 
IMG OID640443676 
Productglycosyl transferase, group 1 
Protein accessionYP_496221 
Protein GI87198964 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAGG CCGCCCCGCT CAGGATTCTC CACCTTCATT CCACGTTCGA TGCCGGGGGC 
AAGGAACGCC GCGCGGTGGC GCTGATGAAC CGCTGGGCAC GGCAAGGCGG GGCAAGGGCG
ATCGAGCACC ACATCGTTTC GGCACAGCCC GGCTCGATGG GGGCAAGGAG CCTGATCGAC
AAGCGCGTCG CCGTGTTCTT CCCTTTCGGG TTCCCGCCGC TTGCAGGCAA ATTGCGCCTT
TCGCGCCTGC AACAGCTTGC ACGTGCGATG CGCGGCTTCG ACCTTGTTCT CACCTATAAC
TGGGGCGCGA TGGACGCAGC GATGGCGCAT GCCGTCTTCG CCCCGTCGAT GGGCCTTGCG
CCGCTGGTCC ATCACGAGGA CGGCTTCAAC GCCGATGAAG CGGGCGGCCT CAAGCGTGGC
CGCAACTGGT ATCGCATGGT CGCGCTTTCT CGCGCCAGCG CGCTGGTAGT TCCGTCGCGC
GGGCTGGAAG AGATCGCGCT GGGCTCCTGG CGCCAGCCGC GTGCCCGCGT ACATCGCATC
CTCAACGGCA TCGATACCGC GGCCTATGCC CGCAAGCCCA GGCCCGACGT GTTGCCGCGC
GTGGTCAAGC GGCCGGGCGA AAAATGGCTC GGCACCCTGG CCGGCCTGCG CGCGGTCAAG
AACCTGCCGC GCATGGTTCG GGCGATGAAG GCGCTGCCGC CCGAATGGCA TCTCGTCATT
CTTGGCGAAG GGCCGGAGCG GGAGGCCATC CTTGCCGAGG CCATGCGGCA GGAGGTCGGT
CACCGCGTCC ACCTGCCGGG CCATGTCGCG GACCCCGCCG CAGCGATTGG CCTTTTCGAT
CTTTTCGCGC TTTCCTCCGA CAGCGAGCAG GCACCGCTTT CCGTGATCGA GGCGATGGCC
GCCGGGCTCG CCGTGGTCAG CCCCGCCGTG GGCGATGTGG CGGACATGGT TTCAGAGGCG
AACCGCCCCT ACGTGATCCC GCCCGGAGAC GACGATGCGC TGGCGGCGGC AGTTCGCGCG
CTGGCGGGCG ATGCGGCGCT TCGTGCGTCG ATCGGCAAAT CCAATCGCGC CCGCGCCCGG
GCCGAGTTCG ACGAAGGCGT CATGGCCGAC CTCTACGCCA GGCTCTACGC GGGGGCGCTC
GGCCGCGACA GCTTCTCGTG A
 
Protein sequence
MSKAAPLRIL HLHSTFDAGG KERRAVALMN RWARQGGARA IEHHIVSAQP GSMGARSLID 
KRVAVFFPFG FPPLAGKLRL SRLQQLARAM RGFDLVLTYN WGAMDAAMAH AVFAPSMGLA
PLVHHEDGFN ADEAGGLKRG RNWYRMVALS RASALVVPSR GLEEIALGSW RQPRARVHRI
LNGIDTAAYA RKPRPDVLPR VVKRPGEKWL GTLAGLRAVK NLPRMVRAMK ALPPEWHLVI
LGEGPEREAI LAEAMRQEVG HRVHLPGHVA DPAAAIGLFD LFALSSDSEQ APLSVIEAMA
AGLAVVSPAV GDVADMVSEA NRPYVIPPGD DDALAAAVRA LAGDAALRAS IGKSNRARAR
AEFDEGVMAD LYARLYAGAL GRDSFS